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Introduction 
1 


Successful microcomputer-based 
designs are judicious 
blends of hardware and software. The User's 
Manual 
addresses 
both subjects in varying degrees of detail. 


This publication is the definitive source of information 
describing 
the iAPX 86 components. 
Software topics 
are given moderately 
detailed coverage. 
The manual 
serves as a reference source during system design and 
implementation. 


Intel's Literature 
Guide, updated bi-monthly and avail- 
able at no cost, lists all other manuals and reference 
material. Of particular interest to iAPX 86,88 designers 
are: AP-I13, Getting Started 
with the Numeric 
Data 
Processor; 
AP-I06, 
Multiprogramming 
with 
iAPX 
86,88 Microsystems; The Peripheral Design Handbook, 
and the iAPX 88 Book. 


The manual contains 
four chapters, 
two appendices, 
and a numerics 
supplement. 
The remainder 
of this 
chapter 
describes 
the architecture 
of the iAPX 86 
and 88. 


Chapter 2 describes the iAPX 86 and iAPX 88 Central 
Processing Units. Chapter 3 describes the 8089 Input! 
Output Processor. 
These two chapters 
are identically 
organized and focus on providing a functional descrip- 
tion of the iAPX 86,88 and 89, plus related 
Intel 
products. 


Hardware 
reference 
information-electrical 
charac- 
teristics, timing and physical interfacing-for 
the iAPX 
86,88 processors 
is concentrated 
in Chapter 4. 


Appendix 
A is a collection 
of iAPX 86 application 
notes; these provide design and debugging examples. 
Additional application 
notes are available through In- 
tel's Literature 
Department 
(see Literature 
Guide). 


Appendix B contains iAPX component data sheets and 
several systems data sheets. The entire Intel catalog of 
data sheets is available in: 1981 Component Data Cata- 
log and 1981 Systems Data Catalog. 


The Numerics Supplement provides detailed informa- 
tion on the 8087 numeric processor 
extension 
to the 
iAPX 86110and 88/10 CPUs. 


MICROSYSTEM 
80 
NOMENCLATURE 


The increase 
in microcomputer 
system and software 
complexity has prompted Intel to introduce a new fam- 
ily of microprocessor 
products 
to reduce application 
complexity 
and cost. This new generation 
of Intel 
microprocessors 
is powerful and flexible and includes 
many processor 
enhancements. 
These include CPUs, 
numeric floating point extensions, 
I/O processors, 
and 
all the support chips required for a full function system. 


As Intel's 
product 
line has evolved, 
its component- 
based product numbering system has become inappro- 
priate for all the possible VLSI computer 
solutions 
offered. While the components retain their names, Intel 
has moved to a new system-based 
naming scheme to 
accommodate 
these new VLSI systems. 


We have adopted the following prefixes for our product 
lines, 
all of them 
under 
the general 
heading 
of 
Microsystem 
80: 


iAPX 
- 
Processor 
Series 
iRMX 
- 
Operating Systems 
iSBC 
- 
Single Board Computers 
iSBX 
- 
MULTIMODULE 
Boards 


Concentrating 
on the iAPX Series, two processor lines 
are currently defined: 


iAPX 86 
- 
8086 CPU-based 
system 
iAPX 88 
- 
8088 CPU-based 
system 


Configuration 
options 
within each iAPX system are 
identified by adding a suffix, for example: 


iAPX 86/10 - 
CPU Alone (8086) 
iAPX 86111- 
CPU + lOP (8086 + 8089) 
iAPX 88/20 - 
CPU with Math Extension 
(8088, 8087) 
iAPX 88/21 - 
CPU with Math Extension + lOP 
(8088, 8087 + 8089) 


This improved numbering system will enable us to pro- 
vid~ you with a more meaningful view of the capabili- 
ties of our evolving Microsystem 
80. 


IAPX 86 AND IAPX 88 ARCHITECTURE 
- 
THE FOUNDATION 
FOR THE FUTURE 


Overview 


iAPX 
86,88 is an evolving family of microprocessors 
and 
peripherals. The family partitions processing functions among 
general data processors (8086 and 8088), specialized coproces- 
sors like the 8087 numeric data processor, and 110 channel 
processors (the 8089). 


Four key architectural 
concepts 
shaped the data processor 
designs. 
All four 
reflect the family's 
role as vehicles 
for 
modular, 
high level language 
programming 
(in addition 
to 
assembly 
language 
programming). 
The four architectural 


concepts are memory segmentation, 
the operand 
addressing 


structure, 
the 
operation 
register 
set, and 
the instruction 
encoding 
scheme. 
They are distinct 
departures 
from 
the 
minicomputer 
architectural 
styles of the 1960's and 1970's. 


These earlier architectures 
(minicomputers) 
were designed 
for assembly language 
programming 
which emphasizes 
reg- 
ister based data and linear programs. 
Over the last decade, 


large software 
development 
projects shifted their program- 
ming to high level languages 
which employ 
modular 
pro- 


gramming 
and 
memory 
based 
data. 
The 
iAPX 
86,88 
memory 
segmentation 
scheme is intended for modular 
pro- 
grams. It supports 
the static and dynamic 
memory 
require- 
ments of program 
modules, as well as their communication 
needs. The iAPX 
86,88 registers are designed for fast high 
level languag~ 
execution. 
The scheme employs 
specialized 
registers and implicit register usage. You will derive signifi- 
cant 
performance 
and 
memory 
utilization 
improvements 
directly from these architectural 
features. 


The four concepts 
are discussed 
in the following 
sections. 


They are: 


• 
Memory 
segmentation 
for modular 
programming, 
evolution 
to memory 
management 
and protection 


• 
Addressing 
structure 
for high level programming 
languages 


• 
Instruction 
set encoding 
for memory efficiency 
and execution 
speed 


Memory Segmentation for Modular Programming 


Large programs 
(I O-IOOK bytes) are not generally written in 
assembly language. 
They are developed in individually 
com- 
piled modules 
in high level languages. 
Modular 
program 
development 
techniques, 
program libraries, compatible 
link- 
ing, and project management 
tools are often requirements 
in 
such 
an 
environment. 
A complex 
application 
program 
might be composed 
of multiple processes, with each process 
constructed 
from 
multiple 
modules. 
Processes 
send mes- 
sages to each other for communication, 
while modules gen- 


erally share common 
data when needed. Ideally, these inter- 
module 
'communication 
paths 
are 
well 
structured 
and 


disciplined. 


The iAPX 86,88 segmentation 
scheme is optimized 
for the 


reference 
patterns 
of computer 
programs. 
Four 
segment 


registers 
are provided 
in a segment 
register file. Memory 


references 
are relative 
to automatically 
selected code seg- 
ment 
(CS) amI data 
segment 
(DS) registers. 
The module 


shares a stack segment 
(SS) with all other modules 
of the 


process (task). The module may share a global data segment 
with other modules in the process; for example, 
to send and 
receive messages between modules. This segment is accessed 
explicitly with the extra segment (ES) register. 


This scheme is highly efficient because 
constant 
program 


references 
to code 
and 
data, 
as well as the stack, 
have 


automatic 
segment 
selection. 
This 
results 
in minimized 
instruction 
length. Only 16 bits are required to address any- 
where in the full megabyte 
address 
range. Only infrequent 
inter-module 
communications 
require the extra prefix bits to 


explicitly override the automatic 
segment selection. 


There are two other significant 
advantages 
to the segment 


register concept. 
First, it separates 
segment 
base addresses 


from offset addresses 
which are relative to the segment base. 


Only offset addresses 
are used within object modules. 
This 


supports position-independent, 
dynamically relocatab,le mod- 


ules. You merely have to alter the CS and DS register contents 
to move a module, rather than relinking the whole task and 
reloading. 
This structure employs short addresses (16 rather 


than ~bit) 
for efficient use of memory. 


The second advantage 
of iAPX 86,88 segmentation 
is that it 


can be extended 
to include memory management 
and multi- 


level protection. 
The contents 
and width 
of segmentation 


registers are independent 
of the rest of the instruction 
set. 


The architecture 
can be made to address additional 
memory 


and 
provide 
access 
rights 
and 
limit checking. 
Using the 


mainframe 
concept 
of memory 
based segment 
tables, this 


structure 
can also support 
virtual 
memory. 
Further, 
since 


only four 
registers 
are active 
in the file at a time, 
these 


features can be accomplished 
on the CPU chip itself, avoid- 
ing the access delays of off-chip memory 
management. 


In summary, 
memory 
segmentation 
has several 
ultimate 


benefits for the end user. It provides for simplified hardware 
and 
faster, 
modular 
software 
development, 
more 
easily 


maintainable 
code, 
and 
provides 
an orderly 
way for the 


architecture 
to grow. 


Addressing Structure for High Level 
Programming 
Languages 


The iAPX 86,88 architecture 
employs an operand 
address- 
ing 
scheme 
complementing 
the 
memory 
segmentation 
scheme. There are four components 
in an address. They are 


the segment, 
base, index, and displacement. 
The segment 
component 
was just described. 
A base register is dedicated to 


both the data and stack segments. These base registers may 
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also be used when accessing the extra (global) segment. They 
are used for holding 
the base address 
of a data structure. 


Two 
index 
registers 
are provided 
for use with the base 
registers to dynamically 
select any element from a based data 


structure. 
Eight or sixteen-bit 
fixed displacements 
may be 


added 
to any of these address 
forms. 
The complete 
regis- 


ter file is shown in Figure 
2 and the addressing 
structure 
is 


shown 
in Figure 
3. 


Referring 
to Figure 3, an iAPX 86,88 operand 
address 
con- 


tains up to four components: 
a segment 
(S), a base (B), an 


index (I), and a displacement 
(d). The segment component 
is 
automatically 
selected 
for the code, 
data, 
and 
stack 
seg- 
ments. 
An explicit 
segment 
selection 
is required 
for data 


references 
in the extra 
segment. 
Any combination 
of the 


remaining three address components 
is permitted in virtually 


all memory 
reference 
instructions, 
with at least one always 
being present. 
. 


___ 
S__ [~J 


+ 
B 
BASE 


+ 
INDEX 


r--~ 
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~_ __ 
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DISPLACEMENT 
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I 
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ADDRESS 


Block and string data are extensions to this scheme. They use 
different assumptions 
for source and destination 
segments, 


but the segments 
are still implicitly 
accessed. 
Immediate 


operands 
are also supported. 


The iAPX 
86,88 is a two operand 
machine 
(source 
and 


destination). 
It supports 
source/ destination 
operand combi- 


nations 
of register/memory, 
memory/register, 
memory / 


memory 
(string 
operations 
only), immediate/ 
register, 
and 
immediate/memory. 
The various address 
combinations 
of 


S, B, I, and d correspond 
to common 
data structures 
used in 


high level language 
programming. 
Such data structures 
can 


therefore 
be implemented 
easily in assembly 
language 
as 


well. 


Figure 3 shows the correspondence 
between the most com- 


mon iAPX 
86,88 address 
modes and various data types in 


high 'level programming 
languages. 
The 
S component 
is 
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CODE 
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One Mega-Byte 
Address 
Range 


implicit; the stack base (BP) assumes the stack segment; no B 
component, 
or use of the data base (BX), assumes the data 


segment. 
The less commonly 
used address 
modes are not 
shown. 


The stack base (BP) is a concept borrowed from the family of 
P-machines 
"developed" 
as ideal 
PASCAL 
vehicles. 
P- 
machines 
term this register the "mark 
pointer". 
It always 
points to the base of the current 
local data area in the stack 
segment. 
This permits 
efficient 
local addressing 
in block- 
structured 
languages such as PASCAL 
and PL/ M. In these 
languages, 
procedures 
are invoked by pushing their parame- 
ters on the stack, calling the procedure, 
and then allocating 


their local data area on the stack. The iAPX 
86,88 return 
instruction 
then removes the parameters 
from the stack, as is 
done in the P-machines. 


Operation Register Set for Computation 


The 
Intel iAPX 
86,88 line is truly 
a complete 
family 
of 
microprocessors. 
The iAPX 86/ 10 and iAPX 88/ 10 are the 
general data processor members of the family, while the 8089 
is the I/O processor 
family member. 
In addition, 
the CPU 


itself has an interface for attaching 
coprocessors. 
Coproces- 
sors provide specialized operation 
set extensions 
that benefit 
the 
application 
by performing 
special 
purpose 
logic 
to 
increase performance. 


The iAPX 86/20 Numeric 
Data Processor 
is an example of 
this c~ncept. Using an 8086 with an 8087 coprocessor 
(CPU 
extension) it provides a one hundred-fold 
performance 
boost 
over the iAPX 86/ 10 for a wide range of numeric operations. 
The full computational 
capability 
of the iAPX 86,88 family 
can therefore span a much broader range than is possible with 
a single microprocessor. 
This technique has been used success- 
fully in the mainframe and minicomputer 
industries to provide 


instruction set options for scientific, commercial, text process- 
ing, or other special purpose applications. 


An 8087 extends 
the iAPX 
86 or iAPX 
88 architecture 
to 
include additional 
data types, registers, and instructions. 
The 
8086 or 8088, with an 8087 coprocessor, 
operates 
on 16,32, 


and 
64-bit 
integers, 
32, 
64 and 
80-bit 
floating 
point 
numbers, 
and up to 18 digit packed 
BCD numbers. 
Data 
conversions 
and calculations 
are performed 
in the 8087 
and are transparent 
to the programmer. 


The iAPX 86/ 10 and iAPX 88/ 10 CPUs alone can perform 
arithmetic 
operations 
on signed and unsigned 
8 and 16-bit 
binary 
integers 
as well as packed 
and unpacked 
decimal 
integers. The full complement 
of logical operations 
are pro- 
vided as well. Interesting 
new features are the string opera- 
tions. Six primitive 
string instructions 
(move, skip, search, 
compare, 
set, and translate) 
are standard. 
When combined 
with special control 
operators, 
complex 
string 
manipula- 
tions are possible with two or three instructions. 


Instruction Set Encoding for Memory Efficiency 
and Execution Speed 


The iAPX 86 uses a byte oriented 
instruction 
stream while 
operating 
with a 16-bit data bus. To accomplish 
this, the 
processor 
is subdivided 
into two independent 
parallel pro- 
cessors called the bus interface unit (BIU) and the execution 
unit (EU). 
The iAPX 
88 employs 
an identical 
execution 
unit 
and is 100% code compatible 
with iAPX 
86, yet it 


interfaces 
to an 8-bit wide data bus BIU. The bus interface 
unit is an independent 
processor 
that prefetches 
instruc- 
tions. 
Instruction 
fetch 
time 
is therefore 
mostly 
over- 
lapped 
with other iAPX 
86,88 processor 
activity. 
The bus 


interface 
unit 
permits 
either 
instructions 
or data 
to be 
placed 
in memory 
without 
regard 
to word 
boundaries. 


(An array 
of five byte records 
in PASCAL 
can be refer- 
enced without 
requiring 
an additional 
byte of padding 
to 


word 
align 
the records.) 
Processor 
subdivision 
into 
the 
BIU and EU has the additional 
benefit 
of minimizing 
the 


effect of wait states and bus hold time on CPU efficiency. 


Instruction 
set encoding 
is substantially 
improved 
when 
instructions 
are composed 
in byte 
multiples 
instead 
of 


words. Instructions 
in the iAPX 86,88 vary from one to six 
bytes in length 
(not counting 
optional 
prefix 
bytes). 
The 


average 
instruction 
is three bytes long. In a word aligned 
machine 
the same information 
would 
occupy 
four bytes. 


This and the features 
described 
above give the iAPX 86,88 


roughly 
a 30% program 
space savings 
over other 
archi- 
tectures. 


Beyond 
efficient 
support 
for 
high 
level languages, 
the 
iAPX 
86 and 
iAPX 
88 establish 
the foundation 
for the 


family to build on in the 1980's. The family uses increasing 
levels of integration 
to significantly 
reduce software, 
hard- 
ware, and development 
investment. 


The iAPX 86/ 10 and iAPX 88/ 10 general purpose 
proces- 


sors employ 
external 
module 
integration. 
Specialized 
sys- 
tem functions 
are distributed 
among 
optimized 
compo- 
nents and removed 
from the host processor. 
The CPU is 


freed to become the system manager 
and resource allocator 


rather 
than doing "all things for all programs". 
The family 


also includes 
the 8087 Numeric 
Data 
Processor 
and the 
8089 I/O 
Channel 
Processor. 


These processors 
are optimized 
to address 
the three main 


functions 
in a computer 
environment: 
data processing 
and 
control, 
arithmetic 
computation~ 
and input/ output. 
The 


8087 and 8089 are described 
below. 


The 8087 Numeric 
Processor 
Extension 
(NPX) 
adds over 
50 numeric 
opcodes 
and eight 80-bit registers 
to the host 
processor 
to provide 
more 
extensive 
data 
and 
numeric 


processing 
capability. 
It performs 
floating point and trans- 
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cendental 
(trigonometric) 
functions, 
processes 
decimal 


operands 
up to 18 digits without 
roundoff, 
and performs 


exact 
arithmetic 
on integers 
up to 64 bits long. Another 


feature 
of the NDP, 
with important 
benefits to you, is that 


it is compatible 
with the proposed 
IEEE 
floating 
point 


standards. 
It can be used in applications 
requiring 
high 


speed computation 
such as numerical 
analysis, accounting 


and financial 
applications, 
the sciences, 
and engineering. 
Throughput 
increases 
in such applications 
up to 100 times 
current 
speeds are typical (See Figure 4.) 


The 8089 Input/ 
Output 
Processor 
(lOP) is an independent 


microprocessor 
that 
optimizes 
input I output 
operations. 
The objective 
of the lOP is to remove all II 0 details from 


application 
software. 
It responds 
to CPU 
direction 
but 


executes 
its own instruction 
stream 
in parallel 
with other 


processors. 
110 transfers 
of either 8 or l6-bit data can be 


BASE 
INTERFACE 
UNIT 


done at rates up to 1.25 megabytes 
per second. 
The lOP 
therefore 
combines 
the attributes 
of both 
a CPU 
and a 


oMA controller 
to provide a powerful 
110 subsystem. 
An 
important 
feature 
of the lOP 
is that 
it can be physically 


isolated 
from the application 
CPU. The advantage 
to you 
is that 
110 subsystem 
changes 
or upgrades 
can be made 
without 
any impact to application 
software. 
(See Figure 5.) 


Summarizing, 
there 
are 
several 
advantages 
to external 


module 
integration: 


• 
System 
tasks may be allocated 
to special purpose 
pro- 
cessors designed 
for optimal 
task handling 


• 
Simultaneous 
operation 
(parallel 
processing) 
provides 


highest system performance 


• 
Isolated 
system functions 
minimize the effect of modifi- 


cations, 
local failures, 
or errors on the rest of the system 


• 
The iAPX 
86,88 family of processors 
allows division of 
the application 
into small, manageable 
tasks for parallel 
development, 
while providing 
built-in 
hardware 
facili- 
ties for coordinating 
processor 
interaction. 
With 
the 
iAPX 
86,88 approach 
you can implement 
high perfor- 
mance systems far more quickly 
and easily than would 
otherwise 
be possible. 


DEVELOPMENT TOOLS 


Development 
Systems 


Development 
systems are a unique combination 
of hardware 
and software tools which increase your product development 
productivity. 
With 
Intel 
development 
products, 
you 
will 
shorten 
the 
development 
cycle and 
reduce 
your 
time to 
market. 


Development 
systems from Intel provide an upgradable 
spec- 
trum of tools ranging from stand alone development 
systems 
to future networks 
of specialized work stations. 
Intel elimi- 
nates your risk of development 
system obsolescence by gua- 
ranteeing product 
upgradability 
and compatibility. 
Ibis gua- 
rantee protects your capital investment. 


For small to medium size projects, the Intellec'· development 
system is available in many configurations 
at low cost. For 
small projects, these systems have nominal program 
memory 
with floppy 
disks as peripheral 
storage devices. Minimum 
configurations 
may be upgraded 
to provide increased perfor- 
mance, 
increased 
memory, 
and increased 
mass storage via 
hard 
disk. 
These 
more 
powerful 
configurations 
support 
medium sized projects. 


The Intellec Series 11/85 is a good example of such a system. 
It is a complete 
microcomputer 
development 
system inte- 


grated into one compact package. The Model 225 includes a 
CPU with 64K bytes of RAM, 
4K bytes of ROM, 
a 2000 
character CRT, detachable full ASCII keyboard, and a 250K 
byte floppy disk drive. The powerful ISIS-II Disk Operating 
System software allows you to efficiently develop and debug 
iAPX 86,88 programs. 
Optional 
storage peripherals 
provide 
over 2 million and 7.3 million bytes of storage on floppy and 
hard disk, respectively. 


Distributed 
development 
configurations 
address the range of 
medium to large sized projects. These configurations 
connect 
multiple standalone 
development 
systems to more powerful 
support 
resources such as mainframes 
and their peripherals. 


In addition 
to the Intellect!!ldevelopment 
system, Intel offers 
several products to help you debug and test your hardware and 
software. 
In-eircuit -Emulators, 
such as ICE-86'· 
and ICE- 
88'·, are available to emulate your product environment. 
They 
increase 
development 
productivity 
substantially. 
Another 
software tool, RBF-89, helps you debug 8089 software under 
ICE control. 
With these tools, software 
development 
time 
can 
be reduced 
dramatically 
- 
lowering 
your 
total 
in- 
vestment. 


High Level Languages 


Programming 
languages are the key to developing an applica- 
tion. Intel programming 
languages 
serve three purposes 
in 
your design. First, they are your primary design tool. Intel's 
breadth 
of languages· and extended 
features 
give you the 
maximum 
ability to properly design and plan your program. 
Second, Intel languages are a communication 
vehicle between 
programmers 
during implementation 
and later during modifi- 
cation. Standard 
high level languages allow programmers 
to 
better communicate 
what the programs 
do. Third, Intel lan- 
guages are designed in conjunction 
with Intel microsystems to 
provide the greatest code efficiency and execution speed. Intel 
languages 
speed implementation 
of your design and reduce 
maintenance 
costs. 


MDS-311 is a set of software development 
tools for iAPX 86 
and iAPX 
88 applications. 
It is a complete 
set of software 
products 
that run on the Intellec Model-800 
and Series-II 
development 
systems. The software 
tools provided 
include 
PLI M-86, high level programming 
language, and the ASM- 
86 assembler. Two utilities, LINK86 and L0C86, 
are supplied 
to link separately 
compiled 
or assembled 
program 
modules 
into executable tasks. The Library Manager, 
LIB86, lets you 
maintain 
a library of iAPX 86 or iAPX 88 object modules. 


These modules 
can then be linked in with new programs 
without 
being recompiled. 
This simplifies and speeds your 
development. 
Common code (e.g. a subroutine) 
only has to be 
developed and compiled once. Intel code converters, 
such as 
CONV86, 
are very useful tools for migrating 
8080 or 8085, 


Z80, and 6809 assembly language programs to the iAPX 86 or 
iAPX 
88. They convert 
assembly 
source code to ASM86 
source code. This will help you make a rapid transition and cut 
redevelopment 
costs substantially. 


Intel will provide a variety oflanguages 
for both systems and 
applications 
to facilitate development 
of your product. 
You 
can choose the language (or languages) which best suits your 
product 
needs and the expertise 
of your staff. ASM86, 
the 
assembly language, and PLI M-86, the systems oriented high 
level language, are both currently available. PASCAL, 
FOR- 
TRAN, 
and BASIC will be offered in the near future, and 
COBOL is planned after that. 


Intel's languages 
also run on your final product. 
Your pro- 


duct's function is significantly increased when packaged with 
language translators. They allow your customers to tailor your 
products 
for their environment. 
Intel's languages 
will save 
implementation 
time and free resources to work on the value- 


added portion of your product. 


SINGLE BOARD COMPUTERS 
ACCELERATE 
YOUR 
MICROSYSTEM 
SUCCESS 


In addition 
to the increased 
integration 
of functions 
in 
VLSI components, 
there is a strong trend today to imple- 
ment microsystem 
applications 
with single board 
compu- 


ters. This allows the design engineer to: 


• Easily configure reliable and cost-effective systems 
using iSBC and iSBX standard products. 


• Overcome the shortage of qualified engineers and 


technicians. 


• Get the end product to market quickly. 


• Focus on the application. 


• Offset the increasing cost of capital. 


In addition, using iSBC single board computers and iSBX 
expansion products in your design reduces the number of 
risks that you must face in all phases of the product life 
cycle. The four major risk areas that Intel iSBC and iSBX 
products will help you overcome are as follows: 


Using a fully tested board computer, which incorpo- 
rates the key elements of processor, memory and I/O, 
helps overcome today's critical shortage of engineers, 
programmers 
and 
technicans. 
Implementing 
iSBC 


boards and iSBX MULTI MODULES in your design 
reduces increasing capital costs in production, QC, and 
test. It is estimated that using iSBC boards can save up 
to $200,000 per board design. 


2. Time to Market Dictates Success or Failure 


With inflation running at its current rate, the amount of 
time it takes to get a product from an idea to the market 
becomes critical. A delay of a few months can collapse 
your return on investment. 


Experience shows that the first company that gets its 
product to the marketplace usually dominates that 
market. You can get your product to the market months 
earlier using standard off-the-shelf iSBC, iSBX and 
Real-Time 
Executive 
(iRMX) 
Software 
modules. 


Intel's large board manufacturing and distribution cap- 
ability enables you to respond to your market demand 
rapidly and in a cost-effective mann~r. 


3. Solution Completeness 
ancf Project Credibility 


Microprocessor based solutions for today's problems 
are commonplace and are expected to succeed. A broad 
spectrum of compatible system components 
in the 


iSBC, iSBX, and iRMX product line increase the prob- 
ability of being right the first time. General purpose 
iSBC board solutions are easy to customize through the 
use of iSBX modules from Intel, or your own design. 


4. Coping with the Technology/Complexity 


Avalanche 


iSBC and iSBX products incorporate the latest in VLSI 
shortly after their initial introduction. With increasing 
system complexity Intel's design process and testing 
reduces the risk of "gremlin" bugs which multiply with 
complexity and evadediagnosis.Standards usedthrough- 
out the product family such as the de facto industry 
standard 
MULTIBUS, 
EIA, IEEE etc. provide a 


smooth transition for your product to new and chang- 
ing processor, memory and I/O technologies. 


Intel's single board computer product family is continuing 
to reduce your risk and protect your investment in the 
future by expanding iSBC and iSBX products in three 
dimensions: processors, memory, and I/O. 
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Intel's iAPX 86,88 multiple processor family is designed 
for modular programming in highlevel'as wellas assembly 
languages. 


• 
Its memory segmentation'scheme is optimized for the 
reference needs 'ofcomputer programs, and is separate 
from the operand addressing sttucture. 
' 


• 
The structure for addressin~ 'operands within segments 
directly supports the various data types found in high 
,level program,ming languages., 
, 


• 
The family provides an operation register set to support 
general computation requirements. It also provides for 
optimized operati9n register sets to do specialized data 
processing functions with its inherent multi- and copro- 
cessor support. 


• The family uses optimized instruction 
enc~ding for 
high performance and memory efficiency 
• The fa~ily is well supported with development t~ols 
and single board computer products. 


This architecture provides the foundation for solving'the 
application needs in the 1980's.It makes a noted departure 
from architectures of the 1960's and 1970's - 
based on 
Intel's intent to minimize software and hardware product 
costs for you, the end user. 


The iAPX 86 and 
iAPX 88 Central 
Processing 
Units 
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CHAPTER 2 
THE 8086 AND 8088 
CENTRAL PROCESSING 
UNITS 


This chapter describes the mainstays of the 8086 
microprocessor 
family: the 8086 and 8088 central 
processing units (CPUs). The material is divided 
into ten sections 
and generally 
proceeds 
from 
hardware to software topics as follows: 


1. Processor Overview 


2. Processor Architecture 


3. Memory 


4. 
Input/Output 


5. Multiprocessing Features 


6. Processor Control and Monitoring 


7. Instruction Set 


8. Addressing Modes 


9. Programming 
Facilities 


10. Programming 
Guidelines and Examples 


The chapter describes the internal operation 
of 
the CPUs in detail. The interaction 
of the pro- 
cessors with other devices is discussed in func- 
tional terms; electrical characteristics, 
timing, and 
other information 
needed to actually 
interface 
other devices with the 8086 and 8088 are provided 
in Chapter 4. 


The 8086 and 
8088 are closely related 
third- 
generation microprocessors. 
The 8088 is designed 


with an 8-bit external data path to memory and 
I/O, while the 8086 can transfer 16 bits at a time. 
In almost every other respect the processors are 
identical; 
software 
written 
for 
one CPU 
will 
execute on the other without alteration. The chips 
are contained 
in standard 
40-pin 
dual 
in-line 
packages (figure 2-1) and operate from a single 
+5V power source. 


The 8086 and 8088 are suitable for an exception- 
ally wide spectrum 
of microcomputer 
applica- 


tions, and this flexibility is one of their most 
outstanding 
characteristics. 
Systems can range 
from 
uniprocessor 
minimal-memory 
designs 
implemented with a handful of chips (figure 2-2), 
to multiprocessor 
systems with up to a megabyte 
of memory (figure 2-3). 


Vcc 


A015 


A18/53 


A17/54 


A18/55 


A19/56 


BHE/S7 


MN/MX 


iii5 


HOLD 
(RQ/GTo) 


HLDA 
(RO/il'fi) 


WR 
(LiiCK) 


M/iO 
(~I 


OTIA 
(!11 


DEN 
(Sol 


ALE 
(0501 


INTA 
(0511 


TEST 


READY 


RESET 


Vcc.,, 


A18/53 


A17/84 


A18/55 


A19/56 


550 
(HIGHI 


MN/MX 


RQ 


HOLD 
IRQ/GTO) 


HLOA 
(RQ/GT1) 


WR 
(LiiCKI 


loiM 
1521 


OTIA 
Isi) 


DEN 
ISoI 


ALE 
(0501 


INTA 
1051) 


TEST 


READY 


RESET 


Figure 2-1.8086 
and 8088 Central Processing 
Units 
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The large application 
domain 
of the 8086 and 
8088 is made possible primarily by the processors' 
dual operating 
modes (minimum and maximum 
mode) 
and 
built-in 
multiprocessing 
features. 
Several of the 40 CPU pins have dual functions 
that are selected by a strapping pin. Configured 
in minimum 
mode, 
these pins transfer 
control 
signals 
directly 
to 
memory 
and 
input/output 
devices. In maximum mode these same pins take 
on different functions that are helpful in medium 
to large ystems, especially systems with multiple 
processors. 
The control 
functions 
assigned 
to 
these pins in minimum mode are assumed by a 
support chip, the 8288 Bus Controller. 


The CPUs are designed to operate with the 8089 
Input/Output 
Processor 
(lOP) 
and other 
pro- 
cessors in multiprocessing 
and distributed 
pro- 


cessing systems. When used in conjunction 
with 
one or more 8089s, the 8086 and 8088 expand 
the applicability 
of microprocessors 
into 
1/0- 
intensive data processing systems. Built-in coor- 
dinating 
signals and instructions, 
and electrical 
compatibility with Intel's Multibus ™ 
shared bus 
architecture, 
simplify 
and 
reduce 
the cost of 
developing multiple-processor 
designs. 


Both CPUs are substantially more powerful than 
any microprocessor 
previously offered by Intel. 


Actual 
performance, 
of 
course, 
varies 
from 
application to application, but comparisons to the· 
industry standard 
2-MHz 8080A are instructive. 
The 8088 is from four to six times more powerful 
than the 8080A; the 8086 provides seven to ten 
times the 8080A's performance 
(see figure 2-4). 
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Figure 2-4. Relative Performance 
of the 
8086 and 8088 


The 8086's advantage over the 8088 is attributable 
to its I6-bit external data bus. In applications that 
manipulate 
8-bit quantities 
extensively, 
or that 
are execution-bound, 
the 8088 can approach 
to 
within IOOJo of the 8086's processing throughput. 


The high performance 
of the 8086 and 8088 is 
realized by combining a 16-bit internal data path 
with a pipelined architecture 
that allows instruc- 
tions to be prefetched 
during spare bus cycles. 


Also contributing 
to performance 
is a compact 
instruction format that enables more instructions 
to be fetched in a given amount of time. 


Software 
for high-performance 
8086 and 8088 
systems need not be written in assembly language. 
The CPUs are designed to provide direct hard- 
ware support 
for programs 
written in high-level 
languages such as Intel's PL/M-86. 
Most high- 
level languages 
store variables 
in memory; 
the 
8086/8088 symmetrical 
instruction 
set supports 
direct operation on memory operands, 
including 
operands on the stack. The hardware addressing 
modes 
provide 
efficient, 
straightforward 
implementations 
of based variables, 
arrays, 
ar- 
rays of structures 
and other high-level language 
data constructs. 
A powerful 
set of memory-to- 
memory string operations is available for efficient 
character 
data 
manipulation. 
Finally, 
routines 
with critical performance 
requirements 
that can- 
not be met with PL/M-86 
may be written 
in 
ASM-86 (the 8086/8088 assembly language) and 
linked with PL/M-86 code. 


While the 8086 and 8088 are totally new designs, 
they make the most of users' existing investments 
in 
systems 
designed 
around 
the 
8080/8085 
microprocessors. 
Many 
of 
the 
standard 
Intel 
memory, peripheral control and communication 
chips are compatible with the 8086 and the 8088. 
Software is developed in the familiar 
Intellec® 


Microcomputer 
Development 
System 
environ- 
ment, and most existing programs, whether writ- 
ten in ASM-80 or PL/M-80, 
can be directly con- 


verted to run on the 8086 and 8088. 


Microprocessors 
generally execute a program by 
repeatedly cycling through the steps shown below 
(this description is somewhat simplified): 


I. 
Fetch the next instruction from memory. 


2. 
Read 
an 
operand 
(if 
required 
by 
the 
instruction) . 


3. 
Execute the instruction. 


4. 
Write 
the 
result 
(if 
required 
by 
the 
instruction) . 


In previous CPUs, most of these steps have been 
performed serially, or with only a single bus cycle 
fetch overlap. The architecture 
of the 8086 and 
8088 CPUs, 
while performing 
the same steps, 
allocates them to two separate 
processing units 


within the CPU. The execution unit (EU) executes 
instructions; 
the bus interface unit (BIU) fetches 


instructions, 
reads operands 
and writes results. 


The two units can operate independently 
of one 


another and are able, under most circumstances, 
to extensively overlap instruction 
fetch with exe- 


cution. The result is that, in most cases, the time 
normally 
required 
to 
fetch 
instructions 
"dis- 


appears" 
because the EU executes instructions 


that have already been fetched by the BIU. Figure 
2-5 illustrates this overlap and compares it with 
traditional 
microprocessor 
operation. 
In 
the 


example, 
overlapping 
reduces the elapsed time 


required to execute three instructions, 
and allows 


two additional 
instructions 
to be pre fetched as 


well. 
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The execution units of the 8086 and 8088 are iden- 
tical (figure 2-6). A l6-bit arithmetic/logic 
unit 


(ALU) in the EU maintains the CPU status and 
control 
flags, 
and 
manipulates 
the 
general 


registers and instruction 
operands. 
All registers 
and data paths in the EU are 16 bits wide for fast 
internal transfers. 


The EU has no connection to the system bus, the 
"outside 
world." 
It obtains instructions 
from a 
queue maintained by the BIU. Likewise, when an 
instruction 
requires 
access to memory 
or to a 


peripheral 
device, the EU requests the BIU to 


obtain 
or 
store 
the 
data. 
All 
addresses 


manipulated by the EU are 16bits wide. The BIU, 
however, 
performs 
an address 
relocation 
that 


gives the EU 
access to the 
full megabyte 
of 


memory space (see section 2.3). 


The BIUs of the 8086 and 8088 are functionally 
identical, 
but 
are 
implemented 
differently 
to 


match 
the 
structure 
and 
performance 


characteristics of their respective buses. 


The BIU performs all bus operations for the EU. 
Data is transferred between the CPU and memory 
or I/O devices upon demand from the EU. Sec- 
tions 2.3 and 2.4 describe the interaction 
of the 


BIU with memory and I/O devices. 


In addition, during periods when the EU is busy 
executing instructions, 
the BIU "looks 
ahead" 


and fetches more instructions from memory. The 
instructions are stored in an internal RAM array 
called the instruction 
stream 
queue. 
The 8088 


instruction 
queue holds up to four bytes of the 


instruction stream, while the 8086 queue can store 
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up to six instruction 
bytes. These queue sizes 
allow the BIU to keep the EU supplied with pre- 
fetched 
instructions 
under 
most 
conditions 
without monopolizing 
the system bus. The 8088 
BIU fetches another 
instruction 
byte whenever 
one byte in its queue is empty and there is no 
active request for bus access from the EU. The 
8086 BIU operates 
similarly except that it does 
not initiate a fetch until there are two empty bytes 
in its queue. The 8086 BIU normally obtains two 
instruction bytes per fetch; if a program transfer 
forces fetching from an odd address, 
the 8086 
BIU automatically 
reads one byte from the odd 
address 
and 
then 
resumes 
fetching 
two-pyte 
words from the subsequent even addresses. 


Under most circumstances 
the queues contain at 
least one byte of the instruction 
stream and the 
EU does not have to wait for instructions 
to be 
fetched. The instructions 
in the queue are those 


stored in the memory locations immediately adja- 
cent to and higher than the instruction currently 
being executed. That is, they are the next logical 
instructions 
so long as execution proceeds seri- 
ally. 
If 
the 
EU 
executes 
an 
instruction 
that 
transfers 
control 
to another 
location, 
the BIU 
resets the queue, fetches the instruction from the 
new address, passes it immediately to the EU, and 
then begins refilling the queue from the new loca- 
tion. In addition, 
the BIU suspends instruction 
fetching whenever the EU requests a memory or 
I/O read or write (except that a fetch already in 
progress is completed before executing the EU's 
bus request). 


Both CPUs have the same complement 
of eight 
16-bit general registers (figure 2-7). The general 
registers are subdivided 
into two sets of four 
registers each: the data registers (sometimes called 
the H & L group for "high" 
and "low"), 
and the 
pointer and index registers (sometimes called the 
P & I group). 


The data registers are unique in that their upper 
(high) 
and 
lower 
halves 
are 
separately 
addressable. 
This means that each data register 
can be used interchangeably 
as a 16-bit register, 
or as two 8-bit registers. The other CPU registers 
always are accessed as 16-bit units only. The data 
registers can be used without constraint 
in most 
arithmetic 
and 
logic operations. 
In addition, 
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some instructions 
use certain registers implicitly 


(see table 2~1)thus allowing compact yet powerful 
encoding. 


REGISTER 
OPERATIONS 


AX 
Word Multiply, Word Divide, 
Word I/O 


AL 
Byte Multiply, 
Byte Divide, Byte 
I/O, Translate, Decimal Arithmetic 


AH 
Byte Multiply, 
Byte Divide 


BX 
Translate 


CX 
String Operations, 
Loops 


CL 
Variable Shift and Rotate 


DX 
Word Multiply, Word Divide, 
Indirect I/O 


SP 
Stack Operations 


SI 
String Operations 


DI 
String Operations 


The pointer 
and index registers can also par- 
ticipate in most arithmetic and logic operations. 
In fact, all eight general registers fit the definition 
of "accumulator" 
as used in first and second 
generation microprocessors. 
The P & I registers 
(except for BP) also are used implicitly in some 
instructions as shown in table 2-1. 


The megabyte of 8086 and 8088 memory space is 
divided into logical segm~nts of up to 64k bytes 
each. (Memory segmentation 
is described in sec- 
tion 2.3.) The CPU 
has direct access to four 
segments at a time; their base addresses (starting 
locations) are contained 
in the segment registers 
(see figure 2-8). The CS register points to the cur- 
rent code segment; instructions are fetched from 
this segment. The SS register points to the current 
stack segment;' stack operations are, performed on 
locations in this segmenl. The DS register points 
to the current data segment'; it generally contains 
program variables. The ES register points to the 
current, extra segment, which also is typically used 
for data storage. 


The segment registers are accessible to programs 
and can be manipulated with several instructions. 
Good programming practice and consideration of 
compatibility with future Intel hardware and soft- 
ware products dictate that the segment registers 
~e used in a disciplined fashion. Section 2.10 pro- 
vides guidelines for segment register use. 
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The 16-bit instruction pointer (lP) is analogous to 
the 
program 
counter 
(PC) 
in the 
8080/80llS 
CPUs. The instruction pointer is updated by the 
BIU so that it contains 
the offset (distance in 
bytes) of the next instruction from the beginning 
of the current code segment; Le., IP points to the 
next instruction. 
During 
normal 
execution, 
IP 
contains the offset of the next instruction 
to be 
fetched by the BIU; whenever IP is saved on the 
stack, however, it firs! is automatically 
adjusted 
to point to the next instruction 
to be executed. 


Programs do not have direct access to the instruc- 
tion pointer, 
but instructions 
cau'se it to change 
and to be saved on and restored from the stack. 


The 8086 and 8088 have six I-bit status 
flags 
(figure 2-9) that the, EU, posts to, reflect certain 
properties of the result of an arithmetic or logic 
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operation. 
A 'group 
of instructions 
is available 
that 
allows 
a program 
to 
alter 
its execution 
depending on the state of these flags, that is, on 
the result of a prior' operation. 
Different instruc- 
tions affect the status flags differently; in general, 
however, 
the 
flags 
reflect 
the 
following 
conditio~s: 


H 


I. 
If AF (the auxiliary carry flag) is set, there 
has been a carry out of the low, nibble into 
the high nibble or a borrow from the high 
nibble into the low nibble of an 8-bit quantity 
(low-order byte of a 16-bit quantity). 
This 
flag 
is 
.used 
by 
decimal 
arithmetic 
instructions. 


2. 
If CF (the carry flag) is set, there has been a 
carry out of, or a borrow into, the high-order 
bit of the result (8- or 16-bit). The flag is used 
by 
instructions 
that 
add 
and 
subtract 
multibyte numbers. 
Rotate instructions 
can 
also isolate a bit in memory or a register by 
plaCing it in the carry flag. 


3. 
If OF (the oyer flow flag) is set, an arithmetic 
overflow has occurred; that is, a significant 
digit has been lost because the size of the 
result exceeded the capacity of its destination 
location. An Interrupt On Overflow instruc- 
tion is available that will generate an inter- 
nlPt in this situation. 


4. 
If SF (the sign flag) is set, the high-order bit 
of the result is a 1. Since negative binary 
numbers are represented in the 8086 and 8088 
in standard 
two's complement notation, 
SF 
indicates the sign of the result (0 = positive, 
I = negative). 


5. 
If PF (the parity flag) is set, the result has 
even parity, an even number of I-bits. This 
flag can be used to check for data transmis- 
sion errors. 


6. 
If ZF (the zero flag) is set, the result of the 
operation is O. 


Three additional control flags (figure 2-9) can be 
set and cleared by programs 
to alter processor 
operations: 


1. 
Setting DF (the direction flag) causes string 
instructions 
to auto-decrement; 
that is, to 
process strings from high addresses to low 
addresses, or from "right to left." 
Clearing 
DF 
causes 
string 
instructions 
to 
auto- 
increment, or to process strings from "left to 
right. " 


2. 
Setting IF (the interrupt-enable 
flag) allows 
the CPU 
to recognize external 
(mask able) 
interrupt requests. Clearing IF disables these 
interrupts. 
IF has no affect on either non- 
maskable 
external 
or internally 
generated 
interrupts. 


3. 
Setting TF (the trap flag) puts the processor 
into single-step mode for debugging. In this 
mode, the CPU automatically 
generates an 
internal 
interrupt 
after 
each 
instruction, 
allowing a program to be inspected as it exe- 
cutes instruction by instruction. 
Section 2.10 
contains an example showing the use of TF in 
a single-step and breakpoint routine. 


8080/8085 
Registers 
and Flag 
Correspondence 


The registers, flags and program counter in the 
8080/8085 CPUs all have counterparts 
in the 8086 
and 8088 (see figure 2-10). The A register (ac- 
cumulator) 
in the 8080/8085 corresponds 
to the 
AL register in the 8086 and 8088. The 8080/8085 
H & L, B & C, and D & E registers correspond to 
registers BH, BL, CH, CL, DH and DL, respec- 
tively, in the 8086 and 8088. The 8080/8085 SP 
(stack pointer) and PC (program 
counter) have 
their counterparts 
in the 8086/8088 SP and IP. 


The AF, CF, PF, SF, and ZF flags are the same in 
both 
CPU 
families. 
The 
remaining 
flags and 
registers are unique to the 8086 and 8088. This 
8080/8085 to 8086 mapping allows most existing 
8080/8085 program code to be directly translated 
into 8086/8088 code. 


Both processors have a strap pin (MN/MX) 
that 


defines the function of eight CPU pins in the 8086 
and nine pins in the 8088. Connecting MN/MX to 
+5V places the CPU in minimum mode. In this 
configuration, 
which 
is 
designed 
for 
small 
systems (roughly one or two boards), 
the CPU 
itself provides the bus control signals needed by 
memory 
and 
peripherals. 
When 
MN/MX 
is 


strapped 
to ground, 
the CPU is configured 
in 
maximum mode. In this configuration 
the CPU 
encodes control signals on three lines. An 8288 
Bus Controller 
is added 
to decode the signals 


from the CPU and to provide an expanded set of 
control signals to the rest of the system. The CPU 
uses the remaining 
free lines for a new set of 
signals designed to help coordinate 
the activities 


of other processors 
in the system. Sections 2.5 


and 2.6 describe the functions of these signals. 


The 
8086 and 
8088 can accommodate 
up to 
1,048,576 bytes of memory in both minimum and 
maximum 
mode. 
This 
section 
describes 
how 


memory 
is functionally 
organized 
and 
used. 


There 
are 
substantial 
differences 
in the 
way 


memory components are actually accessed by the 
two processors; 
these differences, 
which are in- 
visible to programs, 
are covered in section 4.2, 


External Memory Addressing. 


From a storage point of view, the 8086 and 8088 
memory spaces are organized as identical arrays 
of 8-bit bytes (see figure 2-11). Instructions, 
byte 
data and word data may be freely stored at any 
byte address without regard for alignment thereby 
saving memory 
space by alloWing code to be 


densely packed in memory (see figure 2-12). Odd- 
addressed (unaligned) 
word variables, 
however, 
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transfer 
16-bits at a time. Instruction 
alignment 
does not materially 
affect 
the performance 
of 
either processor. 


Following Intel convention, 
word data always is 


stored with the most-significant 
byte in the higher 
memory location 
(see figure 2-13). Most of the 
time this storage 
convention 
is "invisible" 
to 


anyone working with the processors; 
exceptions 
may occur when monitoring 
the system bus or 


when reading memory dumps. 


A special class of data is stored as doublewords; 
i.e., 
two consecutive 
words. 
These are called 


pointers and are used to address data and code 
that 
are 
outside 
the 
currently-addressable 


segments. The lower-addressed word of a pointer 
contains an offset value, and the higher-addressed 
word contains a segment base address. Each word 
is stored conventionally with the higher-addressed 
byte containing the most-significant 
eight bits of 


the word (see figure 2-14). 


8086 and 8088 programs "view" 
the megabyte of 


memory space as a group of segments that are 
defined by the application. 
A segment is a logical 
unit of memory that may be up to 64k bytes long. 
Each segment is made up of contiguous memory 
locations 
and 
is an 
independent, 
separately- 
addressable 
unit. Every segment is assigned (by 


software) 
a base address, 
which is its starting 


location in the memory space. All segments begin 
on 16-byte memory 
boundaries. 
There 
are no 
other restrictions on segment locations; segments 
may be adjacent, 
disjoint, 
partially overlapped, 


or fully overlapped 
(see figure 2-15). A physical 
memory location may be mapped into (contained 
in) one or more logical segments. 


The segment registers point to (contain the base 
address values of) the four currently addressable 
segments 
(see 
figure 
2-16). 
Programs 
obtain 
access to code and data 
in other segments by 


changing the segment registers to point to the 
desired segments. 


Every application 
will define and use segments 


differently. 
The currently 
addressable 
segments 


provide a generous 
work space: 64k bytes for 


code, a 64k byte stack and 128k bytes of data 
storage. 
Many 
applications 
can be written 
to 
simply initialize the segment registers and then 
forget 
them. 
Larger 
applications 
should 
be 
designed with careful consideration 
given to seg- 
ment definition. 
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The 
segmented 
structure 
of 
the 
8086/8088 


memory space supports modular software design 
by discouraging huge, monolithic programs. The 
segments also can be used to advantage in many 
programming 
situations. 
Take, for example, the 


case of an editor for several on-line terminals. A 
64k text buffer (probably an extra segment) could 
be assigned to each terminal. 
A single program 


could maintain all the buffers by simply changing 
register ES to point to the buffer of the terminal 
requiring service. 


It is useful to think of every memory location as 
having 
two 
kinds 
of 
addresses, 
physical 
and 


logical. A physical address is the 20-bit value that 
uniquely 
identifies 
each 
byte 
location 
in the 


megabyte memory space. Physical addresses may 
range from OH through FFFFFH. 
All exchanges 


between the CPU and memory components 
use 


this physical address. 


Programs deal with logical, rather than physical 
addresses and allow code to be developed without 
prior knowledge of where the code is to be located 
in memory and facilitate dynamic management of 
memory resources. A logical address consists of a 
segment base value and an offset value. For any 
given memory location, 
the segment base value 


locates the first byte of the containing 
segment 
and the offset value is the distance, in bytes, of 
the target 
location 
from 
the beginning 
of the 
segment. 
Segment 
base and 
offset 
values 
are 
unsigned 
16-bit quantities; 
the lowest-addressed 
byte in a segment has an offset of O. Many dif- 
ferent 
logical addresses 
can map to the same 
physical 
location 
as shown 
in figure 2-17. In 
figure 2-17, physical memory location 
2C3H is 
contained in two different overlapping segments, 
one beginning at 2BOH and the other at 2COH. 


Whenever the BIU accesses memory-to 
fetch an 
instruction 
or to obtain 
or store a variable-it 
generates 
a 
physical 
address 
from 
a 
logical 
address. This is done by shifting the segment base 
value four bit positions and adding the offset as 
illustrated in figure 2-18. Note that this addition 
process 
provides 
for 
modulo 
64k 
addressing 
(addresses wrap around 
from the end of a seg- 
ment to the beginning of the same segment). 


The BIU obtains the logical address of a memory 
location from different sources depending on the 
type of reference that is being made (see table 


2-2). Instructions always are fetched from the cur- 
rent code segment; IP contains the offset of the 
target instruction 
from the beginning of the seg- 
ment. Stack instructions 
always operate 
on the 
current stack segment; SP contains the offset of 
the top of the stack. 
Most variables 
(memory 
operands) 
are assumed to reside in the current 
data segment, although 
a program 
can instruct 
the BIU to access a variable in one of the other 
currently addressable 
segments. The offset of a 
memory variable is calculated 
by the EU. This 
calculation 
is based 
on 
the 
addressing 
mode 
specified in the instruction; 
the result is called the 
operand's 
effective 
address 
(EA). 
Section 
2.8 
covers addressing 
modes and effective address 
calculation in detail. 


Strings 
are 
addressed 
differently 
than 
other 
variables. The source operand of a string instruc- 
tion is assumed to lie in the current data segment, 
but another currently addressable segment may be 
specified. Its offset is taken from register SI, the 
source index register. The destination operand of 
a string instruction 
always resides in the current 
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Instruction 
Fetch 
CS 
NONE 
IP 
Stack Operation 
SS 
NONE 
SP 
Variable (except following) 
OS 
Cs,Es,sS 
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String Source 
OS 
CS,ES,SS 
SI 
String Destination 
ES 
NONE 
01 
BP Used As Base Register 
SS 
CS,Ds,Es 
Effective Address 


extra segment; its offset is taken from 01, the 
destination index register. The string instructions 
automatically adjust SI and DI as they process the 
strings one byte or word at a time. 


When register BP, the base pointer 
register, is 
designated as a base register in an instruction, 
the 


variable is assumed to reside in the current stack 
segment. Register BP thus provides a convenient 
way to address data on the stack; BP can be used, 
however, to access data in any of the other cur- 
rently addressable segments. 


In most cases, the BIU's segment assumptions are 
a convenience 
to programmers. 
It is possible, 


however, for a programmer to explicitly direct the 
BIU to access a variable in any of the currently 
addressable 
segments (the only exception is the 
destination operand of a string instruction which 
must be in the extra segment). This is done by 
preceding an instruction 
with a segment override 
prefix. This one-byte machine instruction tells the 
BIU which segment register to use to access a 
variable referenced in the following instruction. 


The segmented memory structure of the 8086 and 
8088 makes it possible to write programs that are 
position-independent, 
or dynamically relocatable. 


Dynamic relocation 
allows a multiprogramming 
or multitasking system to make particularly effec- 
tive use of available memory. Inactive programs 
can be written to disk and the space they occupied 
allocated 
to other programs. 
If a disk-resident 


program is needed later, it can be read back into 
any available 
memory 
location 
and 
restarted. 


Similarly, if a program needs a large contiguous 
block of storage, and the total amount is available 
only in nonadjacent 
fragments, 
other program 
segments can be compacted 
to free up a con- 


tinuous space. This process is shown graphically 
in figure 2-19. 


In order to be dynamically re!ocatable, a program 
must not load or alter its segment registers and 
must not transfer directly to a location outside the 
current code segment. In other words, all offsets 
in the program 
must be relative to fixed values 


contained in the segment registers~ This allows the 
program 
to be m6v'ed' anywhere in memory as 
long as the segment registers are updated to point 
to the new base addressesi Section 2.10 contains 
an 
example: 
that iillustrates 
dynamic 
code 
relocation. 
'." 


Stack Implementation 
1 
• 


Stacks in the 8086 and 8088 are implemented in 
memory and are located by the statk 
segment 
register'(SS) and the Mack pointer register (SP). A 
system may,have an unlimited number of stacks, 
and astad< may 'be up'to 64k bytes long, the max- 
imum length of a segment. (An attempt to expand 
a stack beyond 64k bytes overwrites the beginning 
of the stack.) One stack is directly addressable at 
a time; this is th€,curreIH stack, often referred to 
simply as "the" 
stack'. SS contains 
the base 
address of the' current stack {uid'SP 'points to the 
tdp of the stack (TOS). In other wor-ds, SP con~ 
tains the offset of the top' of the stack from the 


CODE 
SEGMENT 


STACK 
SEGMENT 


DATA 
SEG'MENT 


EXTRA 
SEGMENT 


stack segment's base address. Note, however, that 
the stack's base address (contained in SS) is not 
the "bottom" 
of the stack. 


8086 and 8088 stacks are 16 bits wide;' instruction's 
that operate 'on a stack add and remove stack 
items one word at a time. An item is pushed onto 
the stack (see figure 2-20) by decrementing 
SP by 
2 and writing the item at the new TOS. Art item is 
popped off the sta2k by copying it from TOS and 
then incrementing 
SP by 2. In other wotds, the 
stack grows down 
in memory 
toward 
its base 
address" Stack operations 
never move items on 
the stack, nor do they erase .them. The top of the 
stack changes only as a re~ult of updating 
the 
stack pointer. 


Dedicated 
and Reserved 
Memory 
Locations· 
! 
' 


Two areas in extreme low and high memory are 
dedicated to 'specific processor 
functions 
or are 
reserved by' Intel Corporation 
for use by Intel 
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POP BX 


hardware 
and software 
products. 
As shown in 


figure 2-21, the location are: OH throgh 7FH (128 
bytes) and FFFFOH through FFFFFH (16 bytes). 
These areas are used for interrupt 
and system 


reset 
processing 
8086 
and 
8088 
application 


systems should not use these areas for any other 
purpose. 
Doing 
so may 
make 
these 
systems 
incompatible with future Intel products. 


8086/8088 
Memory 
Access 
Differences 


The 8086 can access either 8 or 16 bits of memory 
at a time. 
If an instruction 
refers to a word 


variable and that variable is located at an even- 
numbered address, the 8086 accesses the complete 
word in one bus cycle. If the word is located at an 
odd-numbered 
address, 
the 
8086 accesses the 


word one byte at a time in two consecutive bus 
cycles. 


To maximize throughput 
in 8086-based systems, 


16-bit data shQuld be stored at even addresses 
(should be word-aligned). This is particularly true 
of stacks. Unaligned stacks can slow a system's 
response to interrupts. 
Nevertheless, 
except for 


the 
performance 
penalty, 
word 
alignment 
is 


totally transparent 
to software. This allows max- 
imum 
data 
packing 
where 
memory 
space 
is 


constrained. 


The 8086 always fetches the instruction stream in 
words from even addresses except that the first 
fetch after a program transfer to an odd address 
obtains 
a 
byte. 
The 
instruction 
stream 
is 


disassembled inside the processor and instruction 
alignment 
will not 
materially 
affect 
the 
per- 
formance of most systems. 


The 8088 always accesses memory in bytes. Word 
operands are accessed in two bus cycles regardless 
of their alignment. 
Instructions 
also are fetched 
one byte at a time. Although alignment of word 
operands does not affect the performance 
of the 
8088, locating 16-bit data on even addresses will 
insure maximum throughput 
if the system is ever 
transferred to an 8086. 


The 8086 and 8088 have a versatile set of in- 
put/output 
facilities. Both processors provide a 
large I/O space that is separate from the memory 


Figure 2-21. Reserved and Dedicated Memory 
and I/O Locations 


space, and instructions that transfer data between 
the CPU and devices located in the 1/0 
space. 
1/0 
devices also may be placed in the memory 
space to bring the power of the full instruction set 
and 
addressing 
modes 
to 
input/output 
pro- 
cessing. For high-speed transfers, 
the CPUs may 
be used with traditional 
direct memory 
access 
controllers or the 8089 Input/Output 
Processor. 


The 8086/8088 1/0 space can accommodate up to 
64k 8-bit ports or up to 32k 16-bit ports. The IN 
and OUT (input and output) instructions transfer 
data 
between 
the 
accumulator 
(AL 
for 
byte 
transfers, 
AX 
for 
word 
transfers) 
and 
ports 
located in the 1/0 space. 


The 1/0 space is not segmented; to access a port, 
the BIU simply places the port address (O-64k)on 
the lower 16 lines of the address bus. Different 
forms of the 1/0 instructions allow the address to 
be specified as a fixed value in the instruction or 
as a variable taken from register DX. 


Locations 
F8H through 
FFH (eight of the 64k 
locations) in the 1/0 space are reserved by Intel 
Corporation 
for use by future Intel hardware and 
software products. 
Using these locations for any 


other 
purpose 
may 
inhibit 
compatibility 
with 
future Intel products. 


The 8086 can transfer either 8 or 16 bits at a time 
to a device located in the 1/0 
space. A 16-bit 
device should be located at an even address so 
that the word will be transferred 
in a single bus 
cycle. An 8-bit device may be located at either an 
even 
or 
odd 
address; 
however, 
the 
internal 
registers in a given device must be assigned all- 
even or all-odd addresses. 


The 8088 transfers 
one byte per bus cycle. If a 
16-bit device is used in the 80881/0 
space, it must 
be capable 
of transferring 
words 
in the same 
fashion, Le., eight bits at a time in two bus cycles. 
(The 8089 Input/Output 
Processor can provide a 
straightforward 
interface between the 8088 and a 
16-bit 1/0 device.) An 8-bit device may be located 
at odd or even addresses in the 8088 I/O space 
and internal registers may be assigned consecutive 
addresses (e.g., IH, 2H, 3H). Assigning all-odd 
or all-even addresses to these registers, however, 
will simplify transferring 
the system to an 8086 
CPU. 


1/0 devices also may be placed in the 8086/8088 
memory space. As long as the devices respond like 
memory components, the CPU does not know the 
difference. 


Memory-mapped 
1/0 
provides 
additional 
pro- 
gramming 
flexibility. 
Any 
instruction 
that 
references memory may be used to access an 1/0 
port located in the memory space. For example, 
the MOY (move) instruction 
can transfer 
data 
between any 8086/8088 register and a port, or the 
AND, OR and TEST instructions may be used to 
manipulate 
bits in I/O device registers. In addi- 
tion, memory-mapped 
1/0 can take advantage of 
the 
8086/8088 
memory 
addressing 
modes. 
A 
group of terminals, for example, could be treated 
as an array in memory 
with an index register 


selecting a terminal in the array. Section 2.10 pro- 
vides examples of using the instruction 
set and 
addressing modes with memory-mapped 
1/0. 


Of course, a price must be paid for the added pro- 
gramming 
flexibility that memory-mapped 
1/0 
provides. Dedicating part of the memory space to 
I/O 
devices reduces 
the number 
of addresses 
available for memory, although with a megabyte 
of memory 
space this should rarely be a con- 
straint. Memory reference instructions 
also take 
longer to execute and are somewhat less compact 
than the simpler IN and OUT instructions. 


When configured 
in minimum 
mode, 
the 8086 
and 8088 provide HOLD (hold) and HLDA (hold 
acknowledge) 
signals that 
are compatible 
with 
traditional DMA controllers such as the 8257 and 
8237. A DMA controller 
can request use of the 
bus for direct transfer 
of data between an 110 
device and memory 
by activating 
HOLD. 
The 
CPU wil1complete the current bus cycle, if one is 
in progress, and then issue HLDA, granting the 
bus to the DMA controller. 
The CPU wil1 not 
attempt to use the bus until HOLD goes inactiv'e. 


The 8086 addresses 
memory 
that 
is physically 
organized in two separate banks, one containing 
even-addressed bytes and one containing odd-ad- 
dressed bytes. 
An 8-bit DMA controller 
must 
alternately 
select these banks to access logically 
adjacent bytes in memory. The 8089 provides a 
simple way to interface a high-speed 8-bit device 
to an 8086-based system (see Chapter 3). 


The 8086 and 8088 are designed to be used with 
the 8089 in high-performance 
110 applications. 
The 
8089 
conceptually 
res em bles 
a 
microprocessor 
with two DMA channels and an 
instruction set specifically tailored for 110 opera- 
tions. Unlike simple DMA controllers, 
the 8089 
can service 110 devices directly, 
removing this 
task from the CPU. In addition, 
it can transfer 
data on its own bus or on the system bus, can 
match 
8- or 16-bit peripherals 
to 8- or l6-bit 
buses, and can transfer 
data 
from memory to 
memory 
and from 
I/O 
device to I/O 
device. 
Chapter 3 describes the 8089 in detail. 


As 
microprocessor 
prices 
have 
declined, 
multiprocessing 
(using two or more coordinated 
processors in a system) has become an increas- 
ingly attractive 
design alternative. 
Performance 
can be substantially 
improved 
by distributing 
system tasks among separate, 
concurrently 
exe- 
cuting processors. 
In addition, 
multiprocessing 
encourages a modular approach to design, usually 
resulting in systems that are more easily main- 
tained and enhanced. 
For example, 
figure 2-22 
shows a multiprocessor 
system 
in which 
I/O 
activities have been delegated to an 8089 lOP. 
Should an 110 device in the system be changed 
(e.g., a hard disk substituted 
for a floppy), the 
impact of the modification 
is confined to the I/O 
subsystem and is transparent 
to the CPU and to 
the application software. 


The 
8086 
and 
8088 
are 
designed 
for 
the 
multiprocessing 
environment. 
They have built-in 
features that help solve the coordination 
prob- 
lems 
that 
have 
discouraged 
multiprocessing 
system development in the past. 


When configured 
in maximum 
mode, 
the 8086 
and 8088 provide the LOCK (bus lock) signal. 
The BIU activates LOCK when the EU executes 
the 
one-byte 
LOCK 
prefix 
instruction. 
The 
LOCK signal remains active throughout 
execu- 
tion of the instruction 
that follows the LOCK 
prefix. Interrupts 
are not affected by the LOCK 
prefix. If another 
processor requests use of the 
bus 
(via 
the 
request/grant 
lines, 
which 
are 
discussed shortly), the CPU rec'ords the request, 
but does not honor it until execution of the locked 
instruction has been co~pleted. 


Note that the LOCI( signal remains active for the 
duration 
of a single 
instruction. 
If two con- 
secutive 
instructions 
are 
each 
preceded 
by a 
LOCK prefix, 
there 
wil1stil1 
be an unlocked 
period between these instructions. 
In the case of a 
locked repeated 
string instruction, 
LOCK does 
remain 
active 
for 
the 
duration 
of 
the 
block 
operation. 


When the 8086 or 8088 is configured in minimum 
mode, 
the LOCK signal is' not available. 
The 
LOCK prefix can be used, however, to delay the 
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generation 
of an HLDA 
response to a HOLD 


request until execution of the locked instruction is 
completed. 


The LOCK signal provides information 
only. It is 


the responsibility 
of 
other 
processors 
on 
the 


shared bus to not attempt to obtain the bus while 
LOCK is active. 
If the system uses 8289 Bus 
Arbiters to control access to the shared bus, the 
8289's accept LOCK as an input and do not relin- 
quish the bus while this signal is active. 


LOCK may be used in multiprocessing systems to 
coordinate access to a common resource, such as 
a buffer or a pointer. If access to the resource is 
not 
controlled, 
one 
processor 
can 
read 
an 


erroneous value from the resource when another 
processor is updating it (see figure 2-23). 


Access can be controlled (see figure 2-24) by using 
the LOCK prefix in conjunction 
with the XCHG 


(exchange register with memory) instruction. The 
basis for controlling access to a given resource is a 
semaphore, a software-settable 
flag or switch that 
indicates 
whether 
the 
resource 
is "available" 
(semaphore=O) or "busy" 
(semaphore=l). 
Pro- 


cessors that share the bus agree by convention not 
to use the resource unless the semaphore indicates 


that it is available. They likewise agree to set the 
semaphore when they are using the resource and 
to clear it when they are finished. 


The XCHG 
instruction 
can obtain 
the current 


value of the semaphore and set it to "busy" 
in a 
single 
instruction. 
The 
instruction, 
however, 


requires two bus cycles to swap 8-bit values. It is 
possible for another processor to obtain the bus 
between these two cycles and to gain access to the 
partially-updated 
semaphore. 
This 
can 
be 


prevented 
by preceding 
the XCHG 
instruction 
with a LOCK prefix, as illustrated in figure 2-25. 
The bus lock establishes control over access to the 
semaphore and thus to the shared resource. 


The 
8086 and 
8088 (in 
either 
maximum 
or 


minimum mode) can be synchronized to an exter- 
nal event with the WAIT (wait for TEST) instruc- 
tion and the TEST input signal. When the EU 
executes aWAIT 
instruction, 
the result depends 


on the state of the TEST input line. If TEST is 
inactive, the processor 
enters an idle state and 


repeatedly 
retests the TEST 
line at five-clock 


intervals. If TEST is active, execution continues 
with the instruction following the WAIT. 


SHARED POINTER 


BUSCYCLE 
IN MEMORY 


~ 


~ 


~ 


~ 


The ESe (escape) instruction provides a way for 
another processor to obtain an instruction and/or 
a memory operand from an 8086/8088 program. 
When used in conjunction with WAIT and TEST, 
ESe 
can initiate. a "subroutine" 
that executes 
concurrently 
in another 
processor 
(see figure 
2-26). 


Six bits in the ESe instruction may be specified by 
the programmer 
when the instruction 
is written. 


By monitoring 
the 8086/8088 
bus and control 


lines, another 
processor 
can capture 
the ESe 
instruction when it is fetched by the BIU. The six 
bits may then direct the external processor to per- 
form some predefined activity. 


Figure 2-23. Uncontrolled 
Access to Shared 
Resource 


If 
the 
8086/8088 
is configured 
in maximum 


mode, the external processor, having determined 
that an ESe has been fetched, can monitor QSO 
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and QS I (the queue status lines, discussed in sec- 
tion 2.6) and determine when the ESe instruction 
is executed. If the instruction references memory 
the external processor can then monitor the bus 
and 
capture 
the 
operand's 
physical 
address 


and/or the operand itself. 
GETSEMA· 
PHORE & 
SET "BUSY" 
Note that fetching an ESe instruction is not tan- 
tamount 
to executing it. The ESe 
may be pre- 


ceded by a jump 
that causes the queue to be 


reinitialized. 
This event also can be determined 


from the queue status lines. 


SET 
MOV 
SEMAPHORE~ 
SEMAPHORE 
"AVAILABLE" 


When the 8086 or 8088 is configured in maximum 
mode, the HOLD and HLDA lines evolve into 
two more sophisticated 
signals called RQ/GTO 


and RQ/GTI. 
These are bidirectional 
lines that 


can be used to share a local bus between an 8086 
or 8088 and two other processors via a handshake 
sequence. 


( 
EXIT 


The request/grant 
sequence is a three-phase cycle: 


request, 
grant and release. First, 
the processor 


desiring the bus pulses a request/grant 
line. The 


epu 
returns a pulse on the same line indicating 


that it is entering the "hold 
acknowledge" 
state 


and is relinquishing the bus. The BIU is logically 
disconnected from the bus during this period. The 


0-8 


EU, 
however, 
will continue 
to 
execute 
instruc- 
tions 
until 
an instruction 
requires 
bus 
access 
or 
the 
queue 
is 
emptied, 
whichever 
occurs 
first. 


When 
the other 
processor 
has 
finished 
with 
the 
bus, 
it sendsia 
final 
pulse 
to the 
8086/8088 
in-' 


dicating 
that 
the request 
has ended 
and 
that 
the 
CPU may reclaim 
the bus. 


RQ/GTO 
has higher 
priority 
than 
RQ/GTI. 
If 
requests 
arrive 
simultaneously 
on both 
lines, 
the 
~nt....£2es 
to 
the 
processor 
on 
RQ/GTO 
and 
RQ/GTI 
is acknowledged 
after 
the bus 1}as been 
returned 
to 
the 
CPU. 
If, 
however, 
a 
request 
arrives 
on RQ/GTO 
while the CPU 
is processing 
a 
prior 
request 
on RQ/GTl, 
the second 
r~uest 
is 
not 
honored 
until 
the 
processor 
onRQ/GTI 
releases the bus. 


Intel 
has 
designed 
a 
general-purpose 
multiprocessing 
bus 
called 
the Multibus. 
This 
i~ 
the standard 
design 
used 
in iSBCTt-; single-board 
microcomputer 
products 
.. Many 
other 
manufac- 
turers 
offer 
products 
that are compatible 
with the 
Multibus 
.architecture 
as well. When 
the 8086 and 
8088 are configured 
in maximum 
mode, 
the 8288 
Bus Controller 
outputs 
signals that are electrically 
compatible 
with the Multibus 
protocol. 
Designers 
of multiprocessing 
systems 
may want 
to consider 
using 
the Multibus 
architecture 
in the design 
of 
their 
products 
to 
reduce 
development 
cost 
and 


time, 
and 
to obtain 
compatibility 
with 
the wide 
variety 
of boards 
available 
in the iSBC 
product 
line. 
. 


The 
Multibus 
architecture 
provides 
a versatile 
communications 
channel 
that can be used to coor- 
dinate 
a wide variety 
of computing 
modules 
(see 
figure 
2-27). 
Modules 
in a Multibus 
system 
are 
designated 
as 
masters 
or 
slaves. 
Masters 
may 
obtain 
use of the bus and initiate 
data transfers 
on 
it. Slaves 
are the objects 
of data 
transfers 
only. 
The Multibus 
architecture 
allows 
both 
8- and 
16- 
bit masters 
to be intermixed 
in a system. 
In addi- 
tion 
to 16 data 
lines, 
the bus design 
provides 
20 
address 
lines, 
eight multilevel 
interrupt 
lines, 
and 
control 
and arbitration 
lines. 
An auxiliary 
power 
bus 
also 
is provided 
to route 
standby 
power 
to 
memories 
if the normal 
supply 
fails. 


The 
Multibus 
architecture 
maintains 
its 
own 
clock, 
independent 
of the clocks 
of the modules 
it 
links together. 
This allows different 
speed masters 
to share 
the bus 
and 
allows 
masters 
to operate 
asynchronously 
with 
respect 
to each 
other. 
The 
arbitration 
logic 
of 
the 
bus 
permit 
slow-speed 
masters 
to compete 
equably 
for 
use of the bus. 


Once 
a module 
l1as obtained 
the bus, 
however, 


transfer 
speeds 
are 
dependent 
only 
on 
the 
capabilities 
of 
the 
transmitting 
and 
receiving 
modules. 
Finally, 
the Multibus 
standard 
defines 
the 
form 
factors 
and 
physical 
requirements 
of 
modules 
that 
communicate 
on 
this 
bus. 
For 
a 
complete 
description 
of 
the 
Multibus 
architec- 
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ture, 
refer to the Intel Multibus 
Specification 
(document 
number 
9800683) and 
Application 
Note 28A, "Intel Multibus Interfacing." 


Multiprocessor 
systems require a means of coor- 
dinating the processors' 
use of the shared bus. 
The 8289 Bus Arbiter works in conjunction 
with 


the 8288 Bus Controller 
to provide this control 


for 8086- and 8088-based systems. It is compati- 
ble with the Multibus architecture and can be used 
in other shared-bus designs as well. 


The 8289 eliminates race conditions, 
resolves bus 


contention 
and 
matches 
processors 
operating 


asynchronously 
with respect to each other. Each 


processor on the bus is assigned a different pri- 
ority. When simultaneous 
requests 
for the bus 


arrive, the 8289 resolves the contention and grants 
the bus to the processor with the highest priority; 
three 
different 
prioritizing 
techniques 
may be 


used. Chapter 4 discusses the 8289 in more detail. 


2.6 Processor Control and 
Monitoring 


The 8086 and 8088 have a simple and versatile 
interrupt 
system. Every interrupt 
is assigned a 


type code that identifies it to the CPU. The 8086 


and 8088 can handle up to 256 different interrupt 
types. 
Interrupts 
may be initiated 
by devices 
external to the CPU; in addition, they also may be 
triggered by software interrupt 
instructions 
and, 


under certain conditions, 
by the CPU itself (see 
figure 
2-28). Figure 
2-29 illustrates 
the 
basic 


response of the 8086 and 8088 to an interrupt. 
The next sections elaborate 
on the information 
presented in this drawing. 


The 8086 and 8088 have two lines that external 
devices may use to signal interrupts 
(INTR and 
NMI). 
The 
INTR 
(Interrupt 
Request) 
line is 


usually driven by an Intel® 8259A Programmable 
Interrupt Controller (PIC), which is in turn con- 
nected to the devices that need interrupt services. 
The 8259A is a very flexible circuit that is con- 
trolled by software commands 
from the 8086 or 
8088 (the PIC appears as a set of I/O ports to the 
software). 
Its main job 
is to accept 
interrupt 


requests from the devices attached 
to it, deter- 
mine which requesting 
device has the highest 


priority, 
and then activate the 8086/8088 
INTR 
line if the selected device has higher priority than 
the device currently 
being serviced (if there is 


one). 


When INTR is active, the CPU takes different 
action depending 
on the state of the interrupt- 
enable flag (IF). No action takes place, however, 
until the currently-executing 
instruction has been 
--- 
8259A ----- 


completed. * Then, 
if IF is clear (meaning that 
interrupts 
signaled on INTR are masked or dis- 
abled), the CPU ignores the interrupt request and 
processes the next instruction. The INTR signal is 
not latched by the CPU, so it must be held active 
until a response 
is received or the request 
is 
withdrawn. 
If interrupts on INTR are enabled (if 
IF is set), then the CPU recognizes the interrupt 
request and processes it. Interrupt requests arriv- 
ing on INTR can be enabled by executing an STI 
(set interrupt-enable 
flag) instruction, 
and dis- 
abled by executing a CLi (clear interrupt-enable 
flag) instruction. 
They also may be selectively 
masked (some types enabled, some disabled) by 
writing commands 
to the 8259A. It should be 
noted that in order to reduce the likelihood of 
excessive 
stack 
buildup, 
the 
STI 
and 
IRET 
instructions 
will reenable 
interrupts 
only after 
the end of the following instruction. 


The CPU acknowledges the interrupt request by 
executing two consecutive interrupt acknowledge 
(INT A) bus cycles. If a bus hold request arrives 
(via the HOLD or request/grant 
lines) during the 
INTA cycles, it is not honored 
until the cycles 
have been completed. 
In addition, 
if the CPU is 
configured 
in maximum 
mode, 
it activates the 
LOCK signal during these cycles to indicate to 
other processors that they should not attempt to 
obtain the bus. The first cycle signals the 8259A 
that the request has been honored. 
During the 
second INTA cycle, the 8259A responds by plac- 
ing a byte on the data bus that contains the inter- 
rupt 
type 
(0-255) associated 
with 
the 
device 
requesting service. (The type assignment is made 
when the 8259A is initialized by software in the 
8086 or 8088.) The CPU reads this type code and 
uses 
it 
to 
call 
the 
corresponding 
interrupt 
procedure. 


An external interrupt request also may arrive on 
another 
CPU 
line, 
NMI 
(non-mask able inter- 
rupt). This line is edge-triggered 
(lNTR is level- 
triggered) and is generally used to signal the CPU 
of a "catastrophic" 
event, such as the imminent 
loss of power, 
memory 
error 
detection 
or bus 
parity error. Interrupt 
requests arriving on NMI 
cannot be disabled, are latched by the CPU, and 
have higher priority than an interrupt request on 
INTR. 
If an interrupt 
request arrives on both 
lines during the execution of an instruction, 
NMI 
will be recognized first. Non-maskable 
interrupts 
are predefined as type 2; the processor does not 
need to be supplied with a type code to call the 
NMI procedure, and it does not run the INT A bus 
cycles in response to a request on NMI. 


The time required for the CPU to recognize an 
external 
interrupt 
request 
(interrupt 
latency) 
depends on how many clock periods remain in the 
execution 
of 
the 
current 
instruction. 
On 
the 
average, 
the 
longest 
latency 
occurs 
when 
a 
multiplication, 
division or variable-bit 
shift or 
rotate instruction is executing when the interrupt 
request 
arrives 
(see 
section 
2.7 
for 
detailed 
instruction 
timing 
data). 
As 
mentioned 
pre- 
viously, in a few cases, worst-case latency will 
span two instructions rather than one. 


An INT (interrupt) instruction generates an inter- 
rupt immediately upon completion 
of its execu- 
tion. The interrupt type coded into the instruction 
supplies the CPU with the type code needed to 
call the procedure to process the interrupt. 
Since 
any type code may be specified, software inter- 
rupts may be used to test interrupt 
procedures 
written to service external devices. 


*There 
are a few cases in which 
an interrupt 
request 
is not recognized 
until after the following instruction. 
Repeat, 
LOCK 
and segment 
override 
prefixes 
are considered 
"part 
of" 
the instructions 
they prefix; 
no interrupt 
is recognized 
between 
execution 
of a prefix 
and 
an instruction. 
A MOV 
(move) 
to segment 
register 
instruction 
and 
a POP 
segment 
register 
instruction 
are treated 
similarly: 
no interrupt 
is recognized 
until after 
the following 
instruction. 
This mechanism 
protects 


a program 
that 
is changing 
to a new stack 
(by updating 
SS and SP). 
If an interrupt 
were recognized 
after 
SS had 
been 
changed, 
but before 
SP had been altered, 
the processor 
would 
push the flags, CS and IP into the wrong 
area of memory. 


It follows 
from 
this that 
whenever 
a segment 
register 
and another 
value must 
be updated 
together, 
the segment 
register 


should 
be changed 
first, 
followed 
immediately 
by the instruction 
that changes 
the other 
value. 
There 
are also two cases, 


WAIT 
and repeated 
string 
instructions, 
where 
an interrupt 
request 
is recognized 
in the middle 
of an instruction. 
In these 
cases, interrupts 
are accepted 
after 
any completed 
primitive 
operation 
or wait test cycle. 


If the overflow flag (OF) is set, an INTO (inter- 
rupt on overflow) instruction 
generates a type 4 


interrupt 
immediately 
upon 
completion 
of its 
execution. 


The 
CPU 
itself 
generates 
a type 0 interrupt 


immediately 
following 
execution 
of a DIY or 


IDlY (divide, integer divide) instruction 
if the 


calculated 
quotient 
is larger than the specified 
destination. 


If the trap flag (TF) is set, the CPU automatically 
generates 
a type 
1 interrupt 
following 
every 
instruction. 
This is called single-step execution 


and is a powerful debugging tool that is discussed 
in more detail shortly. 


All internal interrupts (lNT, INTO, divide error, 
and single-step) share these characteristics: 


I. 
The interrupt type code is either contained in 
the instruction or is predefined. 


2. 
No INTA bus cycles are run. 


3. 
Internal interrupts cannot be disabled, except 
for single-step. 


4. 
Any internal 
interrupt 
(except single-step) 


has higher priority than any external inter- 
rupt 
(see table 
2-3). If interrupt 
requests 


arrive on NMI and/or 
INTR during execu- 


tion of an instruction that, causes an internal 
interrupt 
(e.g., 
divide error), 
the internal 


interrupt is processed first. 


The interrupt 
pointer (or interrupt 
vector) table 


(figure 2-30) is the link between an interrupt type 
code and the procedure that has been designated 
to service interrupts 
associated 
with that code. 


The interrupt pointer table occupies up to the first 
Ik bytes of low memory. There may be up to 256 
entries in the table, one for each interrupt 
type 


AVAILABLE 
INTERRUPT 
POINTERS 
(224) 


RESERVED 
INTERRUPT 
POINTERS 
(27) 


DEDICATED 
INTERRUPT 
POINTERS 
(5) 


3FFH-------- ..• 


- 
TY~~Ji~(A~(~:)ER 
- 


3FCHI- 
-f 


- 
TYPE 33 POINTER: - 
(AVAILABLE) 


- 
TYPE 32 POINTER: - 
(AVAILABLE) 


- 
TYPE 31 POINTER: - 
(RESERVED) 


CS BASE ADDRESS 
----- 
IP OFFSET 


- 
TYPE 5 POINTER: - 
(RESERVED) 


- 
TYPE 4 POINTER: 
- 
OVERFLOW 


1.By~~~~i 
iNO~f:~~~ION 
- 
TYPE 2 POINTER: 
- 
NON·MASKABLE 
- 
TYPE 1 POINTER: 
- 
SINGLE·STEP 
- 
TYPE 0 POINTER: 
- 
DIVIDE 
ERROR 


OOOH 1-16 BITs_I 


that can occur in the system. Each entry in the 
table 
is a doubleword 
pointer 
containing 
the 


address of the procedure that is to service inter- 
rupts of that type. The higher-addressed 
word of 


the pointer contains the base address of the seg- 
ment containing 
the procedure. 
The lower-ad- 


dressed 
word 
contains 
the 
procedure's 
offset 


from the beginning of the segment. Since each 
entry is four bytes long, the CPU can calculate the 
location of the correct entry for a given interrupt 
type by simply multiplying (type*4). 


. 
INTERRUPT 
PRIORITY 


Divideerror, INT n, INTO 
highest 
NMI 
INTR 
Single-step 
lowest 


Space at the high end of the table that would be 
occupied by entries for interrupt types that cannot 
occur in a given application may be used for other 
purposes. The dedicated and reserved portions of 
the interrupt pointer table (locations OH through 
7FH), however, should not be used for any other 
purpose to insure proper system operation and to 
preserve compatibility with future Intel hardware 
and software products. 


After pushing the flags onto the stack, the 8086 or 
8088 activates 
an interrupt 
procedure 
by exe- 
cuting the equivalent of an intersegment indirect 
CALL instruction. 
The target of the "CALL" 
is 


the address 
contained 
in the interrupt 
pointer 


table element located at (type*4). The CPU saves 
the address of the next instruction by pushing CS 
and IP onto the stack. These are then replaced by 
the second and first words of the table element, 
thus transferring control to the procedure. 


If 
multiple 
interrupt 
requests 
arrive 
simulta- 


neously, the processor activates the interrupt pro- 
cedures in priority order. Figure 2-31 shows how 
procedures would be activated in an extreme case. 
The processor is running in single-step mode with 
external interrupts enabled. During execution of a 
divide instruction, 
INTR is activated. 
Further- 


more the instruction 
generates 
a divide error 


interrupt. 
Figure 2-31 shows that the interrupts 


are recognized 
in turn, 
in the order 
of their 


priorities except for INTR. 
INTR is not recog- 


nized until after the following instruction because 
recognition of the earlier interrupts cleared IF. Of 
couse interrupts could be reenabled in any of the 
interrupt response routines if earlier response to 
INTR is desired. 


As figure 2-31 shows, all main-line code is exe- 
cuted in single-step mode. Also, because of the 
order 
of interrupt 
processing, 
the opportunity 


exists in each occurrence of the single-step routine 
to 
select 
whether 
pendil}g 
interrupt 
routines 


(divide error and INTR routines in this example) 
are executed at full speed or in single-step mode. 


When an interrupt 
service procedure 
is entered, 


the flags, CS, and IP are pushed onto the stack 
and TF and IF are cleared. The procedure 
may 


reenable 
external 
interrupts 
with the STI (set 


interrupt-enable 
flag) instruction, 
thus allowing 
itself to be interrupted 
by a request on INTR. 


(Note, however, that interrupts 
are not actually 


enabled until the instruction 
following 
STI has 


executed.) An interrupt procedure always may be 
interrupted 
by 
a 
request 
arriving 
on 
NMI. 


Software- 
or 
processor-initiated 
interrupts 


occurring within the procedure also will interrupt 
the procedure. 
Care must be taken in interrupt 


procedures that the type of interrupt 
being ser- 
viced by the procedure 
does not itself inadver- 


tently occur within the procedure. 
For example, 


an attempt to divide by 0 in the divide error (type 
0) interrupt procedure may result in the procedure 
being reentered 
endlessly. 
Enough 
stack space 


must be available to accommodate 
the maximum 


depth of interrupt 
nesting that can occur in the 


system. 


Like all procedures, 
interrupt 
procedures should 


save any registers they use before updating them, 
and restore them before terminating. 
It is good 


practice 
for an interrupt 
procedure 
to enable 


external interrupts 
for all but "critical 
sections" 


of code (those sections that cannot be interrupted 
without 
risking 
erroneous 
results). 
If external 
interrupts 
are disabled 
for too long in a pro- 


cedure, interrupt 
requests on INTR can poten- 
tially be lost. 


I 


RnR~ 
lI•••n 1nAA ~~NTRdl 
PRnr.~~~ING 
UNITS 
lF = 1 


DIVIDE 
• 
INTR 
INSTRUCTION 


DIVIDE ERROR RECOGNIZED 
! 


PUSH FLAGS 
PUSH CS & IP 
CLEAR IF & TF 
EXECUTE NEXT 
INSTRUCTION 
ISINGLE STEP RECOGNIZED 


~ 


~ 
PUSH FLAGS 
PUSH CS& IP 
CLEAR IF &TF 


DIVIDE ERROR 
PROCEDURE 
I 
I 
SINGLE STEP 
PROCEDURE" 


POP CS & IP 
POP FLAGS 
I 


TF=1,IF=1 
I 
POP CS & IP 
POP FLAGS 


INTR RECOGNIZED 


~ 
TF=O,IF=O 
I 


PUSH FLAGS 
PUSH CS & IP 
CLEAR IF &TF 


EXECUTE NEXT 
INSTRUCTION 
I SINGLE STEP RECOGNIZED 


I 
! 
I 
I 
! 
I 
PUSH FLAGS 
I 
PUSH CS & IP 
I 
CLEAR IF &TF 


I 
INTR 
PROCEDURE 
I 
I 
I 
I 
I 
SINGLE STEP 
PROCEDURE" 


POP CS & IP 
POP FLAGS 
I 


TF=1,IF=1 
I 
POPCS 
& IP 


TF CAN BE SET IN THE 
POP FLAGS 


SINGLE STEP PROCEDURE 
IF SINGLE 
STEPPING 
OF 
I 
THE DIVIDE ERROR DR INTR 
PROCEDURE 
IS DESIRED. 
TF=O,IF=O 


All interrupt 
procedures 
should 
be terminated 
with an IRET (interrupt 
return) instruction. 
The 
IRET instruction assumes that the stack is in the 
same condition as it was when the procedure was 
entered. It pops the top three stack words into IP, 
CS and the flags, thus returning to the instruction 
that was about to be executed when the interrupt 
procedure was activated. 


The actual processing done by the procedure 
is 
dependent upon the application. 
If the procedure 
is servicing an external device, it should output a 
command to the device instructing it to remove its 
interrupt 
request. 
It 
might 
then 
read 
status 
information 
from the device, determine the cause 
of the interrupt and then take action accordingly. 
Section 2.10 contains three typical interrupt pro- 
cedure examples. 


Software-initiated 
interrupt 
procedures 
may be 
used as service routines ("supervisor 
calls") for 
other programs 
in the system. In this case, the 
interrupt procedure is activated when a program, 
rather than an external device, needs attention. 
(The "attention" 
might be to search a file for a 
record, 
send 
a message 
to another 
program, 
request an allocation of free memory, etc.) Soft- 
ware interrupt procedures can be advantageous in 
systems that dynamically relocate programs dur- 
ing execution. Since the interrupt pointer table is 
at 
a 
fixed 
storage 
location, 
procedures 
may 
"call" 
each other through 
the table by issuing 
software interrupt 
instructions. 
This provides a 
stable 
communication 
"exchange" 
that 
is 
independent 
of procedure 
addresses. 
The inter- 
rupt procedures may themselves be moved so long 
as the interrupt pointer table always is updated to 
provide the linkage from the "calling" 
program 
via the interrupt type code. 


When TF (the trap flag) is set, the 8086 or 8088 is 
said to be in single-step mode. In this mode, the 
processor automatically 
generates a type 1 inter- 
rupt after each instruction. 
Recall that as part of 
its interrupt 
processing, the CPU automatically 
pushes the flags onto the stack and then clears TF 
and IF. Thus the processor is not in single-step 
mode when the single-step interrupt procedure is 
entered; it runs normally. 
When the single-step 
procedure 
terminates, 
the 
old 
flag 
image 
is 
restored from the stack, placing the CPU back 
into single-step mode. 


Single-stepping is a valuable debugging tool. It 
allows the single-step procedure to act as a "win- 
dow" 
into the system through 
which operation 
can 
be observed 
instruction-by-instruction. 
A 
single-step interrupt procedure, 
for example, can 
print or display register contents, the value of the 
instruction 
pointer 
(it 
is on 
the 
stack), 
key 
memory variables, etc., as they change after each 
instruction. 
In this way the exact flow of a pro- 
gram can be traced in detail, and the point at 
which discrepancies 
occur 
can be determined. 
Other possible services that could be provided by 
a single-step routine include: 


• 
Writing a message when a specified memory 
location or I/O port changes value (or equals 
a specified value). 


• 
Providing 
diagnostics 
selectively (only for 
certain instruction addresses for instance). 


• 
Letting a routine execute a number of times 
before providing diagnostics. 


The 8086 and 8088 do not have instructions 
for 
setting or clearing TF directly. Rather, TF can be 
changed 
by modifying 
the 
flag-image 
on the 
stack. The PUSHF 
and POPF 
instructions 
are 
available 
for 
pushing 
and 
popping 
the 
flags 
directly (TF can be set by ORing the flag-image 
with OIOOH and 
cleared 
by ANDing 
it with 
FEFFH). After TF is set in this manner, the first 
single-step 
interrupt 
occurs 
after 
the 
first 
instruction 
following the IRET from the single- 
step procedure. 


If the processor is single-stepping, it processes an 
interrupt (either internal or external) as follows. 
Control is passed normally (flags, CS and IP are 
pushed) to the procedure designated to handle the 
type of interrupt 
that has occurred. 
However, 
before the first instruction 
of that procedure 
is 
executed, 
the 
single-step 
interrupt 
is "recog- 
nized" and control is passed normally (flags, CS 
and IP are pushed) to the type 1 interrupt 
pro- 
cedure. When single-step procedure 
terminates, 
control 
returns 
to the previous 
interrupt 
pro- 
cedure. Figure 2-31 illustrates 
this process in a 
case where two interrupts 
occur when the pro- 
cessor is in single-step mode. 


A type 3 interrupt is dedicated to the breakpoint 
interrupt. A breakpoint is generally any place in a 
program where normal execution is arrested 
so 


that some sort of special processing may be per- 
formed. 
Breakpoints 
typically are inserted into 
programs during debugging as a way of display- 
ing registers, memory locations, 
etc., at crucial 
points in the program. 


The INT 3 (breakpoint) 
instruction 
is one byte 
long. This makes it easy to "plant" 
a breakpoint 
anywhere in a program. Section 2.10 contains an 
example that shows how a breakpoint may be set 
and how a breakpoint 
procedure may be used to 
place the processor into single-step mode. 


The breakpoint 
instruction 
also may be used to 
"patch" 
a program 
(insert 
new instructions) 
without recompiling or reassembling it. This may 
be done by saving an instruction byte, and replac- 
ing it with an INT 3 (CCH) machine instruction. 
The breakpoint procedure would contain the new 
machine 
instructions, 
plus code to restore the 
saved instruction 
byte and decrement IP on the 
stack 
before 
returning, 
so that 
the 
displaced 
instruction 
would 
be executed after 
the patch 
instructions. 
The breakpoint 
example in section 
2.10 illustrates these principles. 


Note that patching a program requires machine- 
instruction 
programming 
and should be under- 
taken with considerable caution; it is easy to add 
new bugs to a program in an attempt to correct 
existing ones. Note also that a patch is only a tem- 
porary measure to be used in exceptional condi- 
tions. The affected code should be updated and 
retranslated as soon as possible. 


The 8086/8088 RESET line provides an orderly 
way to start or restart an executing system. When 
the processor detects the positive-going edge of a 
pulse on RESET, it terminates all activities until 
the signal goes low, at which time it initializes the 
system as shown in table 2-4. 


Since the code segment register contains FFFFH 
and the instruction pointer contains OH, the pro- 
cessor 
executes 
its 
first 
instruction 
following 
system 
reset 
from 
absolute 
memory 
location 
FFFFOH. 
This 
location 
normally 
contains 
an 
intersegment direct JMP instruction whose target 
is the actual beginning of the system program. 
The LOC-86 utility supplies this JMP instruction 
from information 
in the program that identifies 
its first instruction. 
As external (maskable) inter- 


rupts are disabled 
by system reset, the system 
software should reenable interrupts as soon as the 
system is initialized to the point where they can be 
processed. 


CPU COMPONENT 
CONTENT 


Flags 
Clear 
Instruction 
Pointer 
OOOOH 
CS Register 
FFFFH 
OS Register 
OOOOH 
SS Register 
OOOOH 
ES Register 
OOOOH 
Queue 
Empty 


When configured 
in maximum 
mode, the 8086 
and 8088 provide information 
about instruction 
queue operations on lines QSOand QS 1. Table 2-5 
interprets 
the four 
states 
that 
these lines can 
represent. 


The queue status lines are provided for external 
processors 
that 
receive 
instructions 
and/or 
operands via the 8086/8088 ESC (escape) instruc- 
tion (see sections 2.5 and 2.8). Such a processor 
may monitor the bus to see when an ESC instruc- 
tion is fetched 
and then track 
the instruction 
through the queue to determine' when (and if) the 
instruction is executed. 


Table 2-5. Queue Status Signals 
(Maximum 
Mode Only) 


QSo 
QS1 
QUEUE 
OPERATION 
IN LAST 


CLKCYCLE 


0 
0 
No operation; 
default 
value 


0 
1 
First 
byte 
of 
an 
instruction 
was 
taken 
from 
the queue 


1 
0 
QlJeue 
was reinitialized 


1 
1 
Subsequent 
byte 
of 
an 
instruction 
was taken 
from 
the queue 


When the HLT (halt) instruction (see section 2.7) 
is executed, the 8086 or 8088 enters the halt state. 
This condition 
may be interpreted 
as "stop 
all 


operations 
until an external interrupt 
occurs or 
the system is reset." No signals are floated during 
the halt state, and the content of the address and 
data 
buses is undefined. 
A bus hold 
request 
arriving on the HOLD line (minimum mode) or 
either 
request/grant 
line (maximum 
mode) 
is 
acknowledged 
normally 
while the processor 
is 
halted. 


The halt state can be used when an event prevents 
the system from functioning correctly. An exam- 
ple 
might 
be 
a 
power-fail 
interrupt. 
After 
recognizing that loss of power is imminent, 
the 
CPU 
could 
use the 
remaining 
time 
to move 
registers, flags and vital variables to (for example) 
a battery-powered 
CMOS RAM area and then 
halt until the return of power was signaled by an 
interrupt or system reset. 


When configured 
in maximum 
mode, the 8086 
and 8088 emit eight status si.gnalsthat can be used 
by external devices. Lines SO, S1 and ~ 
identify 
the type of bus cycle that the CPU is starting to 
execute 
(table 
2-6). 
These 
lines are 
typically 
decoded by the 8288 Bus Controller. 
S3 and S4 
indicate which segment register was used to con- 
struct the physical address being used in this bus 
cycle (see table 2-7). Line S5 reflects the state of 
the interrupt-enable 
flag. S6 is always O. S7 is a 
spare line whose content is undefined. 


S2 
S1 
So 
TYPES 
OF BUS CYCLE 


0 
0 
0 
Interrupt 
Acknowledge 
0 
0 
1 
Read I/O 
0 
1 
0 
Write 
1/0 
0 
1 
1 
HALT 
1 
0 
0 
Instruction 
Fetch 
1 
0 
1 
Read Memory 
1 
1 
0 
Write Memory 
1 
1 
1 
Passive; 
no bus cycle 


S4 
S3 
SEGMENT 
REGISTER 


0 
0 
ES 
0 
1 
SS 
1 
0 
CS or none (1/0 
or Interrupt 
Vector) 
1 
1 
OS 


The 8086 and 
8088 execute exactly 
the same 
instructions. 
This 
instruction 
set 
includes 
equivalents to the instructions typically found in 
previous microprocessors, 
such as the 8080/8085. 
Significant new operations include: 


• 
multiplication 
and division 
of signed and 
unsigned binary numbers as well as unpacked 
decimal numbers, 


• 
move, 
scan 
and 
compare 
operations 
for 
strings up to 64k bytes in length, 


• 
non-destructive bit testing, 


• 
byte translation from one code to another, 


• 
software-generated 
interrupts, and 


• 
a 
group 
of 
instructions 
that 
can 
help 
coordinate 
the activities of multiprocessor 
systems. 


These 
instructions 
treat 
different 
types 
of 
operands uniformly. Nearly,every instruction can 
operate on either byte or word data. 
Register, 


memory 
and 
immediate 
operands 
may 
be 
specified interchangeably in most instructions (ex- 
cept, of course, that immediate values may only 
serve 
as 
"source" 
and 
not 
"destination", 


operands). In particular, memory variables can be 
added to, subtracted 
from, 
shifted, 
compared, 
and so on, in place, without moving them in and 
out of registers. This saves instructions, 
registers, 
and execution 
time in assembly language 
pro- 
grams. 
In 
high-level 
languages, 
where 
most 
variables are memory based, compilers, such as 
PL/M-86, 
can produce faster and shorter object 
programs. 


The 8086/8088 instruction 
set can be viewed as 
existing at two levels: the assembly level and the 
machine level. To the assembly, language 
pro- 
grammer, 
the 8086 and 8088 appear 
to have a 
repertoire of about 
100 instructions. 
One MOY 
(move) instruction, 
for example, transfers a byte 
or a word from a register or a memory location or 
an immediate 
value to either 
a register 
or a 
memory 
location. 
The 
8086 and 
8088 CPUs, 
however, recognize 28 different 
MOY machine 
instructions 
("move 
byte register to memory," 


"move word immediate to register," 
etc.). The 
ASM-86 assembler translates 
the assembly-level 
instructions 
written by a programmer 
into the 


The two levels of the instruction 
set address 
two 
different 
requirements: 
efficiency 
and simplicity. 
The numerous-there 
are about 
300 in all-forms 
of machine-level 
instructions 
allow 
these instruc- 


tions 
to make 
very efficient 
use of storage. 
For 
example, 
the machine 
instruction 
that increments 
a memory 
operand 
is three 
or 
four 
bytes 
long 
because 
the 
address 
of 
the 
operand 
must 
be 
encoded 
in 
the 
instruction. 
To 
increment 
a 
register, 
however, 
does 
not 
require 
as 
much 


information, 
so the instruction 
can be shorter. 
In 
fact, 
the 
8086 
and 
8088 
have 
eight 
different 
machine-level 
instructions 
that 
increment 
a dif- 
ferent 
16-bit register; 
these 
instructions 
are only 
one byte long. 


If a programmer 
had 
to write one instruction 
to 
increment 
a 
register, 
another 
to 
increment 
a 
memory 
variable, 
etc., 
the 
benefit 
of 
compact 
instructions 
would 
be offset 
by the difficulty 
of 
programming. 
The 
assembly-level 
instructions 
simplify 
the programmer's 
view of the instruction 


set. The programmer 
writes one form of the INC 
(increment) 
instruction 
and 
the 
ASM-86 
assembler 
examines 
the 
operand 
to 
determine 


which machine-level 
instruction 
to generate. 


This 
section 
presents 
the 
8086/8088 
instruction 


set from 
two 
perspectives. 
First, 
the 
assembly- 
level 
instructions 
are 
described 
in 
functional 


terms. 
The 
assembly-level 
instructions 
are 
then 


presented 
in a reference 
table 
that 
breaks 
out all 


permissible 
operand 
combinations 
with execution 
times 
and 
machine 
instruction 
length, 
plus 
the 
effect 
that 
the instruction 
has on the CPU 
flags. 
Machine-level 
instruction 
encoding 
and decoding 


are covered 
in section 
4.2. 


The 14 data 
transfer 
instructions 
(table 
2-8) move 


single 
bytes 
and 
words 
between 
memory 
and 
registers 
as well as between 
register 
AL or AX and 
I/O 
ports. 
The 
stack 
manipulation 
instructions 
are included 
in this group 
as are instructions 
for 
transferring 
flag 
contents 
and 
for 
loading 
seg- 
ment registers. 


MOV 
PUSH 
POP 
XCHG 
XLAT 


Move byte or word 
Push word onto stack 
Pop word off stack 
Exchange 
byte or word 
Translate 
byte 


IN 
OUT 
Input byte or word 
Output byte or word 


LEA 
LDS 
LES 


Load effective 
address 
Load pointer 
using DS 
Load pointer 
using ES 


LAHF 
SAHF 
PUSHF 
POPF 


Load AH register 
from flags 


Store AH register 
in flags 
Push flags onto stack 
Pop flags off stack 


MOY transfers 
a byte or a word 
from 
the source 
operand 
to the destination 
operand. 


PUSH 
decrements 
SP (the stack 
pointer) 
by two 
and 
then 
transfers 
a 
word 
from 
the 
source 
operand 
to the top of stack now pointed 
to by SP. 
PUSH 
often 
is used 
to place 
parameters 
on the 
stack before 
calling 
a procedure; 
more generally, 
it is the basic means 
of storing 
temporary 
data on 
the stack. 


POP transfers 
the word at the current 
top of stack 
(pointed 
to by SP) 
to the 
destination 
operand, 


and 
then 
increments 
SP by two 
to point 
to the 
new top of stack. 
POP 
can be used to move tem- 


porary 
variables 
from 
the 
stack 
to registers 
or 
memory. 


XCHG (exchange) switches the contents 
of the 
source and destination 
(byte or word) operands. 
When used in conjunction with the LOCK prefix, 
XCHG can test and set a semaphore that controls 
access to a resource shared by multiple processors 
(see section 2.5). 


XLAT 
(translate) 
replaces 
a byte 
in the AL 
register with a byte from a 256-byte, user-coded 
translation table. Register BX is assumed to point 
to the beginning of the table. The byte in AL is 
used as an index into the table and is replaced by 
the byte at the offset in the table corresponding to 
AL's binary value. The first byte in the table has 
an offset of O. For example, if AL contains 5H, 
and the sixth element of the translation table con- 
tains 33H, then AL will contain 33H following 
the instruction. 
XLAT is useful for translating 
characters from one code to another, 
the classic 
example being ASCII to EBCDIC or the reverse. 


IN transfers a byte or a word from an input port 
to the AL register or the AX register, respectively. 
The port number may be specified either with an 
immediate byte constant, allowing access to ports 
numbered 
0 through 
255, or with a number 
previously 
placed in the DX register, 
allowing 
variable access (by changing the value in DX) to 
ports numbered from 0 through 65,535. 


OUT transfers 
a byte or a word from the AL 
register or the AX register, respectively, to an out- 
put port. The port number may be specified either 
with an immediate byte constant, allowing access 
to ports 
numbered 
0 through 
255, or with a 
number previously placed in register DX, allow- 
ing variable access (by changing the value in DX) 
to ports numbered from 0 through 65,535. 


These instructions 
manipulate 
the addresses of 
variables rather 
than the contents 
or values of 
variables. They are most useful for list process- 
ing, based variables, and string operations. 


LEA (load effective address) transfers the offset 
of the source operand (rather than its value) to the 
destination operand. The source operand must be 
a memory operand, 
and the destination 
operand 
must be a 16-bit general register. LEA does not 
affect any flags. The XLAT and string instruc- 
tions 
assume 
that 
certain 
registers 
point 
to 
operands; LEA can be used to load these registers 
(e.g., loading BX with the address of the translate 
table used by the XLAT instruction). 


LDS (load pointer using DS) transfers 
a 32-bit 
pointer variable from the source operand, 
which 
must be a memory operand, 
to the destination 
operand and register DS. The offset word of the 
pointer is transferred 
to the destination operand, 
which may be any 16-bit general register. The seg- 
ment word of the pointer is transferred to register 
DS. Specifying SI as the destination operand is a 
convenient way to prepare 
to process a source 
string that is not in the current 
data 
segment 
(string instructions assume that the source string 
is located in the current data segment and that SI 
contains the offset of the string). 


LES (load pointer 
using ES) transfers 
a 32-bit 
pointer variable from the source operand, which 
must be a memory operand, 
to the destination 
operand and register ES. The offset word of the 
pointer is transferred 
to the destination operand, 
which may be any 16-bit general register. The seg- 
ment word of the pointer is transferred to register 
ES. Specifying DI as the destination operand is a 
convenient way to prepare to process a destina- 
tion string that is not in the current extra segment. 
(The destination 
string must be located 
in the 
extra segment, and DI must contain the offset of 
the string.) 


LAHF (load register AH from flags) copies SF, 
ZF, AF, PF and CF (the 8080/8085 
flags) into 
bits 7, 6, 4, 2 and 0, respectively, of register AH 


(see figure 2-32). The content of bits 5, 3 and 1 is 
undefined; the flags themselves are not affected. 
LAHF 
is provided 
primarily 
for 
converting 
8080/8085 
assembly language programs to run on 
an 8086 or 8088. 


SAHF (store register AH into flags) transfers bits 
7,6,4,2 
and 0 from register AH into SF, ZF, AF, 
PF 
and 
CF, 
respectively, 
replacing 
whatever 
values these flags previously had. OF, DF, IF and 
TF are not affected. This instruction is provided 
for 8080/8085 
compatibility. 


PUSHF decrements SP (the stack pointer) by two 
and then transfers all flags to the word at the top 
of stack pointed to by SP (see figure 2-32). The 
flags themselves are not affected. 


POPF transfers specific bits from the word at the 
current top of stack (pointed to by register SP) 
into 
the 
8086/8088 
flags, 
replacing 
whatever 
values the flags previously contained 
(see figure 
2-32). SP is then incremented by two to point to 
the new top of stack. PUSHF and POPF allow a 
procedure to save and restore a calling program's 
flags. They also allow a program to change the 
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U = UNDEFINED; VALUE IS INDETERMINATE 
o = OVERFLOW FLAG 
D = DIRECTION FLAG 
I = INTERRUPT ENABLE FLAG 
T = TRAP FLAG 
S = SIGN FLAG 
Z = ZERO FLAG 
A = AUXILIARY CARRY FLAG 
P = PARITY FLAG 
C = CARRY FLAG 


setting of TF (there is no instruction for updating 
this flag directly). The change is accomplished by 
pushing the flags, altering bit 8 of the memory- 
image and then popping the flags. 


8086 and 8088 arithmetic 
operations 
(table 2-9) 
may be performed 
on four types of numbers: 


unsigned 
binary, 
signed 
binary 
(integers), 


unsigned packed decimal and unsigned unpacked 
decimal (see table 2-10). Binary numbers may be 8 
or 16 bits long. Decimal numbers are stored in 
bytes, two digits per byte for packed decimal and 
one digit per byte for unpacked decimal. The pro- 
cessor always assumes that the operands specified 
in arithmetic instructions contain data that repre- 
sent valid numbers 
for the type of instruction 
being 
performed. 
Invalid 
data 
may 
produce 
unpredictable results. 


ADDITION 


ADD 
Add byte or word 
ADC 
Add byte or word with carry 
INC 
Increment byte or word by 1 
AAA 
ASCII adjust for addition 
DAA 
Decimal adjust for addition 


SUBTRACTION 


SUB 
Subtract byte or word 


. SBB 
Subtract byte or word with 
borrow 
DEC 
Decrement byte or word by 1 
NEG 
Negate byte or word 
CMP 
Compare byte or word 
AAS 
ASCII adjust for subtraction 
DAS 
Decimal adjust for subtraction 
MULTIPLICATION 


MUL 
Multiply byte or word unsigned 
IMUL 
Integer multiply byte or word 
AAM 
ASCII adjust for multiply 
DIVISION 


DIV 
Divide byte or word unsigned 
IDIV 
Integer divide byte or word 
AAD 
ASCII adjust for division 
CBW 
Convert byte to word 
CWO 
Convert word to doubleword 


BIT PATTERN 
UNSIGNED 
SIGNED 
UNPACKED 
PACKED 
HEX 
BINARY 
BINARY 
DECIMAL 
DECIMAL 


07 
000 
0 0 1 1 1 
7 
+7 
7 
7 


89 
1 0001001 
137 
-119 
invalid 
89 


C5 
1 1000101 
197 
-59 
invalid 
invalid 


Unsigned binary numbers may be either 8 or 16 
bits long; all bits are considered in determining a 
number's magnitude. The value range of an 8-bit 
unsigned 
binary 
number 
is 0-255; 16 bits can 
represent values from 0 through 
65,535. Addi- 


tion, 
subtraction, 
multiplication' 
and 
division 
operations 
are 
available 
for 
unsigned 
binary 
numbers. 


Signed binary numbers (integers) may be either 8 
or 16 bits long. The high-order (leftmost) bit is 
interpreted as the number's sign: 0 = positive and 
I = negative. Negative numbers are represented 
in standard 
two's 
complement 
notation. 
Since 


the high-order bit is used for a sign, the range of 
an 8-bit integer is -128 
through 
+127; 
16-bit 


integers 
may 
range 
from 
-32,768 
through 
+32,767. 
The value zero 
has a positive 
sign. 
Multiplication 
and division operations 
are pro- 
vided for signed binary numbers. 
Addition and 


subtraction 
are 
performed 
with 
the 
unsigned 
binary 
instructions. 
Conditional 
jump 
instruc- 
tions, 
as well as an "interrupt 
on overflow" 
instruction, 
can be used following an unsigned 
operation 
on an integer to detect overflow into 
the sign bit. 


Packed decimal numbers are stored as unsigned 
byte quantities. The byte is treated as having one 
decimal digit in each half-byte (nibble); the digit 
in the high-order half-byte is the most significant. 
Hexadecimal 
values 0-9 are valid in each half- 
byte, and the range of a packed decimal number is 
0-99. Addition and subtraction 
are performed in 
two steps. First an unsigned binary instruction is 
used to produce an intermediate result in register 
AL. Then an adjustment 
operation is performed 


which changes the intermediate value in AL to a 
final correct packed decimal result. Multiplica- 
tion and division adjustments 
are not available 


for packed decimal numbers. 


Unpacked 
decimal 
numbers 
are stored 
as un- 
signed byte quantities. 
The magnitude 
of the 


number is determined 
from the low-order half- 
byte; hexadecimal 
values 0-9 are valid and are 


interpreted 
as decimal numbers. 
The high-order 


half-byte must be zero for multiplication and divi- 
sion; it may contain any value for addition and 
subtraction. 
Arithmetic 
on 
unpacked 
decimal 


numbers is performed in two steps. The unsigned 
binary addition, 
subtraction 
and multiplication 


operations 
are used to produce an intermediate 


result in register AL. An adjustment 
instruction 


then changes the value in AL to a final correct 
unpacked decimal number. Division is performed 
similarly, except that the adjustment is carried out 
on the numerator 
operand 
in register AL first, 


then a following unsigned binary division instruc- 
tion produces a correct result. 


Unpacked 
decimal 
numbers 
are similar to the 


ASCII character representations 
of the digits 0-9. 


Note, however, that the high-order 
half-byte of 


an 
ASCII 
numeral 
is always 
3H. 
Unpacked 


decimal arithmetic may be performed 
on ASCII 


numeric 
characters 
under 
the 
following 


conditions: 


• 
the 
high-order 
half-byte 
of 
an 
ASCII 
numeral 
must 
be 
set 
to 
OH 
prior 
to 


multiplication or division. 


• 
unpacked 
decimal 
arithmetic 
leaves 
the 


high-order half-byte set to OH; it must be set 
to 3H to produce a valid ASCII numeral. 


The 8086/8088 
arithmetic 
instructions 
post cer- 


tain characteristics 
of the result of the operation 


to six flags. Most of these flags can be tested by 
following the arithmetic instruction with a condi- 
tional jump instruction; 
the INTO (interrupt 
on 


overflow) 
instruction 
also 
may 
be used. 
The 


various instructions affect the flags differently, as 
explained 
in 
the 
instruction 
descriptions. 
However, they follow these general rules: 


• 
CF (carry flag): If an addition 
results in a 
carry out of the high-order bit of the result, 
then CF is set; otherwise CF is cleared. If a 
subtraction results in a borrow into the high- 
order bit of the result, then CF is set; other- 
wise CF is cleared. Note that a signed carry is 
indicated by CF *" OF. CF can be used to 
detect an unsigned overflow. 
Two instruc- 
tions, ADC (add with carry) and SBB (sub- 
tract with borrow), incorporate the carry flag 
in their operations 
and can be used to per- 
form multibyte (e.g., 32-bit, 64-bit) addition 
and subtraction. 


• 
AF (auxiliary 
carry 
flag): 
If an addition 
results in a carry out of the low-order half- 
byte of the result, then AF is set; otherwise 
AF is cleared. If a subtraction 
results in a 
borrow into the low-order half-byte of the 
result, 
then 
AF 
is set; 
otherwise 
AF 
is 
cleared. The auxiliary carry flag is provided 
for 
the 
decimal 
adjust 
instructions 
and 
ordinarily is not used for any other purpose. 


• 
SF 
(sign 
flag): 
Arithmetic 
and 
logical 
instructions 
set the sign flag equal to the 
high-order bit (bit 7 or 15) of the result. For 
signed binary numbers, the sign flag will be 0 
for positive results and 1 for negative results 
(so long as overflow does not occur). A con- 
ditional jump instruction can be used follow- 
ing addition or subtraction 
to alter the flow 
of the program depending on the sign of the 
result. Programs performing unsigned opera- 
tions typically ignore SF since the high-order 
bit of the result is interpreted as a digit rather 
than a sign. 


• 
ZF (zero flag): If the result of an arithmetic 
or logical operation 
is zero, then ZF is set; 
otherwise ZF is cleared. A conditional jump 
instruction 
can be used to alter the flow of 
the program if the result is or is not zero. 
• 
PF (parity flag): If the low-order eight bits of 
an arithmetic 
or logical result contain 
an 
even number of I-bits, then the parity flag is 
set; otherwise it is cleared. PF is provided for 
8080/8085 
compatibility; 
it also can be used 
to check ASCII characters for correct parity. 


• 
OF 
(overflow 
flag): 
If 
the 
result 
of 
an 
operation is too large a positive number, or 
too small a negative number 
to fit in the 
destination operand (excluding the sign bit), 
then OF is set; otherwise OF is cleared. OF 
thus indicates signed arithmetic overflow; it 
can be tested with a conditional jump or the 
INTO 
(interrupt 
on overflow) 
instruction. 
OF 
may 
be 
ignored 
when 
performing 
unsigned arithmetic. 


The sum of the two operands, which may be bytes 
or words, replaces the destination operand. Both 
operands 
may 
be 
signed 
or 
unsigned 
binary 
numbers (see AAA and DAA). ADD updates AF, 
CF, OF, PF, SF and ZF. 


ADC (Add with Carry) sums the operands, which 
may be bytes or words, adds one if CF is set and 
replaces the destination 
operand with the result. 


Both operands may be signed or unsigned binary 
numbers (see AAA and DAA). ADC updates AF, 
CF, OF, PF, SF and ZF. Since ADC incorporates 
a carry from a previous operation, 
it can be used 
to write routines to add numbers longer than 16 
bits. 


INC 
(Increment) 
adds 
one to the destination 
operand. The operand may be a byte or a word 
and is treated as an unsigned binary number (see 
AAA and DAA). INC updates AF, OF, PF, SF 
and ZF; it does not affect CF. 


AAA (ASCII Adjust for Addition) 
changes the 
contents 
of register 
AL 
to a valid 
unpacked 
decimal 
number; 
the 
high-order 
half-byte 
is 
zeroed. AAA updates AF and CF; the content of 
OF, PF, SF and ZF is undefined following execu- 
tionof AAA. 


DAA (Decimal Adjust for Addition) corrects the 
result 
of previously 
a<;\ding two valid packed 
decimal operands (the destination 
operand must 
have been register AL). DAA changes the content 
of AL to a pair of valid packed decimal digits. It 
updates AF, CF, PF, SF and ZF; the content of 
OF is undefined following execution of DAA. 


The 
source 
operand 
is $ubtracted 
from 
the 


destination 
operand, 
and the result replaces the 


destination operand. The operands may be bytes 
or words. 
Both 
operands 
may 
be signed 
or 


unsigned binary numbers 
(see AAS and DAS). 


SUB updates AF, CF, OF, PF, SF and ZF. 


SBB (Subtract with Borrow) subtracts the source 
from the destination, 
subtracts one if CF is set, 


and returns the result to the destination operand.· 
Both operands 
may be bytes or words. 
Both 


operands 
may 
be signed 
or 
unsigned 
binary 


numbers (see AAS and DAS). SBB updates AF, 
CF, OF, PF, SF and ZF. Since it incorporates 
a 


borrow from a previous operation, 
SBB may be 


used to write routines 
that 
subtract 
numbers 


longer than 16 bits. 


DEC (Decrement) subtracts one from the destina- 
tion, 
which may be a byte or a word. 
DEC 


updates AF, OF, PF, .SF,and 
ZF; it does. not 


affect CF. 


NEO (Negate) subtracts the destination operand, 
which may be a byte or a word, 
from 0 and 


returns the result to the destination. 
This forms 


the two's complement of the number, effectively 
reversing the sign of an integer; If the operand is 
zero, its sign is not changed. Attempting to negate 
a byte containing 
-128 or a word containing 


- 32,768 causes no change to the operand and sets 
OF. NEO updates AF, CF, OF, PF, SF and ZF. 
CF is always set except when the operand is zero, 
in which case it is cleared. 


CMP (Compare) 
subtracts 
the source from the 


destination, 
which may be bytes or words, but 


does not return 
the result. 
The operands 
are 


unchanged, but the flags are updated and can be 
tested by a subsequent conditional jump instruc- 
tion. CMP updates AF, CF, OF, PF, SF and ZF. 
The comparison reflected in the flags is that of the 
destination to the source. If a CMP instruction is 
followed by a JO (jump.if greater) instruction, for 
example, 
the jump 
is taken 
if the destination 


operand is greater than the source operand. 


AAS (ASCII Adjust for Subtraction) 
corrects the 


result of a previous 'subtraction 
of two valid 


unpacked 
decimal 
operands 
(the 
destination 


operand must have been specified as register AL). 
AAS 
changes 
the 
content 
of AL 
to a valid 


unpacked 
decimal number; 
the high-order 
half- 


byte is zeroed. AAS updates AF and CF; the con- 
tent of OF, PF, SF and ZF is undefined following 
execution of AAS. 


DAS (Decimal Adjust for Subtraction) 
corrects 


the result of a previous subtraction 
of two valid 


packed 
decimal 
operands 
(the 
destination 


operand must have been speCified as register AL). 
DAS changes the content of AL to a pair of valid 
packed decimal digits. DAS updates AF, CF, PF, 
SF and 
ZF; 
the content 
of OF is undefined 


following execution of DAS. 


MUL (Multiply) 
performs 
an unsigned 
multi- 


plication 
of 
the 
source 
operand 
and 
the 


accumulator. 
If the source is a byte, then it is 


multiplied by register AL, and the double-length 


result is returned 
in AH and AL. If the source 
operand is a word, then it is multiplied by register 
AX, and the double-length 
result is returned 
in 
registers DX and AX. The operands are treated as 
unsigned binary numbers (see AAM). If the upper 
half of the result (AH for byte source, DX for 
word source) is nonzero, 
CF and OF are set; 
otherwise they are cleared. When CF and OF are 
set, they indicate that AH or DX contains signifi- 
cant digits of the result. The content of AF, PF, 
SF and ZF is undefined 
following execution of 


MUL. 


IMUL 
(Integer 
Multiply) 
performs 
a 
signed 
multiplication 
of the source 
operand 
and the 


accumulator. 
If the source is a byte, then it is 
multiplied by register AL, and the double-length 
result is returned in AH and AL. If the source is a 
word, then it is multiplied by register AX, and the 
double-length 
result is returned 
in registers DX 


and AX. If the upper half of the result (AH for 
byte source, DX for word source) is not the sign 
extension of the lower half of the result, CF and 
OF are set; otherwise they are cleared. When CF 
and OF are set, they indicate that AH or DX con- 
tains significant digits of the result. The content 
of AF, PF, SF and ZF is undefined 
following 


execution of IMUL. 


AAM (ASCII Adjust for Multiply) corrects the 
result of a previous multiplication 
of two valid 


unpacked 
decimal 
operands. 
A 
valid 
2-digit 


unpacked 
decimal number 
is derived from the 


content of AH and AL and is returned to AH and 
AL. The high-order 
half-bytes of the multiplied 


operands must have been OH for AAM to pro- 
duce a correct result. AAM updates PF, SF and 
ZF; the content of AF, CF and OF is undefined 
following executi(;lllof AAM. 


DIV (divide) performs an unsigned division of the 
accumulator 
(and its extension) 
by the source 


operand. 
If the source operand 
is a byte, it is 


divided into the double-length 
dividend assumed 


to be in registers AL and AH. The single-length 
quotient is returned in AL, and the single-length 
remainder 
is returned 
in AH. 
If 
the 
source 


operand is a word, it is divided into the double- 
length dividend 
in registers AX and DX. The 


single-length quotient is returned in AX, and the 
single-length remainder is returned in DX. If the 
quotient 
exceeds the capacity of its destination 


register (FFH for byte source, FFFFFH for word 
source), as when division by zero is attempted, 
a 


type 0 interrupt is generated, and the quotient and 
remainder are undefined. 
Nonintegral 
quotients 


are truncated to integers. The content of AF, CF, 
OF, PF, SF and ZF is undefined following execu- 
tion of DIV. 


IDIV (Integer Divide) performs a signed division 
of the accumulator 
(and its extension) 
by the 


source operand. If the source operand is a byte, it 
is 
divided 
into 
the 
double-length 
dividend 


assumed to be in registers AL and AH; the single- 
length quotient is returned in AL, and the single- 
lengtjl remainder is returned in AH. For byte in- 
teger division, the maximum positive quotient is 
+127 (7FH) and the minimum negative quotient is 
-127 (8IH). If the source operand is a word, it is 
divided 
into 
the 
double-length 
dividend 
in 


registers AX and DX; the single-length quotient is 
returned in AX, and the single-length remainder 
is returned in DX. For word integer division, the 
maximu'm positive quotient 
is +32,767 (7FFFH) 


and the minimum 
negative quotient 
is -32,767 


(8001H). If the quotient 
is positive and exceeds 


the maximum, or is negative and is less than the 
minimum, 
the 
quotient 
and 
remainder 
are 


undefined, and a type 0 interrupt is generated. In 
particular, 
this 
occurs 
if 
division 
by 
0 
is 


attemnted. 
Nonintegral 
quotients 
are truncated 


(toward 0) to integers, and the remainder has the 
same sign as the dividend. 
The content of AF, 


CF, OF, PF, Sf and ZF is undefined 
following 


IDIV. 


AAD (ASCII Adjust for Division) modifies the 
numerator 
in AL 
before 
dividing 
two 
valid 


unpacked decimal operands so that the quotient 
produced by the division will be a valid unpacked 
decimal number. AH must be zero for the subse- 


quent DIV to produce the correct result. The quo- 
tient is returned 
in AL, and the remainder 
is 
returned 
in AH; both high-order 
half-bytes are 


zeroed. AAD updates PF, SF and ZF; the content 
of AF, CF and OF is undefined following execu- 
tion of AAD. 


CBW (Convert Byte to Word) extends the sign of 
the byte in register AL throughout 
register AH. 
CBW does not affect any flags. CBW can be used 
to produce a double-length (word) dividend from 
a byte prior to performing byte division. 


CWD (Convert Word to Doubleword) extends the 
sign of the 
word 
in register 
AX 
throughout 


register OX. CWO does not affect 
any flags. 


CWO can be used to produce 
a double-length 


(doubleword) dividend from a word prior to per- 
forming word division. 


The 
8086 and 
8088 provide 
three 
groups 
of 


instructions 
(table 2-11) for manipulating 
bits 
within both bytes and words: logical, shifts and 
rotates. 


LOGICALS 
NOT 
"Not" byte or word 
AND 
"And" byte or word 
OR 
"Inclusive or" byte or word 


XOR 
"Exclusive or" byte or word 


TEST 
"Test" byte or word 


SHIFTS 
SHLISAL 
Shift logical/arithmetic 
left 
byte or word 
SHR 
Shift logical right byte or word 
SAR 
Shift arithmetic right byte or 
word 


ROTATES 
ROL 
Rotate left byte or word 
ROR 
Rotate right byte or word 
RCL 
Rotate through carry left byte 
orword 
RCR 
Rotate through carry 'right byte 
or word 


The 
logical 
instructions 
include 
the 
boolean 


operators 
"not," 
"and," 
"inclusive 
or," 
and 


"exclusive or," 
plus a TEST instruction that sets 


the flags, but does not alter either of its operands. 


AND, OR, XOR and TEST affect the flags as 
follows: The overflow (OF) and carry (CF) flags 
are always cleared by logical instructions, and the 
content of the auxiliary carry (AF) flag is always 
undefined 
following 
execution 
of 
a 
logical 
instruction. 
The sign (SF), zero (ZF) and parity 


(PF) flags are always posted to reflect the result of 
the operation 
and can be tested by conditional 


jump 
instructions. 
The interpretation 
of these 


flags is the same as for arithmetic instructions. SF 
is set if the result is negative (high-order bit is I), 
and is cleared if the result is positive (high-order 
bit is 0). ZF is set if the result is zero, cleared 
otherwise. PF is set if the result contains an even 
number of I-bits (has even parity) and is cleared if 
the number of I-bits is odd (the result has odd 
parity). Note that NOT has no effect on the flags. 


NOT inverts the bits (forms the one's comple- 
ment) of the byte or word operand. 


AND performs 
the logical "and" 
of the two 


operands (byte or word) and returns the result to 
the destination operand. A bit in the result is set if 
both corresponding 
bits of the original operands 


are set; otherwise the bit is cleared. 


OR performs the logical "inclusive or" of the two 
operands (byte or word) and returns the result to 
the destination operand. A bit in the result is set if 
either or both corresponding 
bits in the original 
operands 
are 
set; 
otherwise 
the 
result 
bit 
is 
cleared. 


XOR (Exclusive Or) performs the logical "exclu- 
sive or" 
of the two operands 
and returns 
the 


result to the destination 
operand. 
A bit in the 


result 
is set if the corresponding 
bits of the 
original operands contain opposite values (one is 
set, the other is cleared); otherwise the result bit is 
cleared. 


TEST performs 
the logical "and" 
of the two 
operands (byte or word), updates the flags, but 
does not return the result, i.e., neither operand is 
changed. If a TEST instruction 
is followed by a 
JNZ (jump if not zero) instruction, 
the jump will 
be taken if there are any corresponding 
I-bits in 
both operands. 


, 


The bits in bytes and 
words 
may be shifted 
arithmetically 
or logically. Up to 255 shifts may 
be performed, according to the value of the count 
operand coded in the instruction. 
The count may 
be specified as the con~tant I, or as register CL, 
allowing the shift count to be a variable supplied 
at execution time. Arithmetic shifts may be used 
to multiply and divide binary numbers by powers 
of two (see note in description of SAR). Logical 
shifts can be used to isolate bits in bytes or words. 


Shift instructions affect the flags as follows. AF is 
always undefined following a shift operation. PF, 
SF and ZF are updated normally, as in the logical 
instructions. 
CF always contains the value of the 
last bit shifted out of the destination 
operand. 
The content of OF is always undefined following 
a multibit shift. In a single-bit shift, OF is set if 
the value of the high-order (sign) bit was changed 
by the operation; if the sign bit retains its original 
value, OF is cleared. 


SHL/SAL destination, count 


SHL 
and 
SAL 
(Shift 
Logical 
Left 
and 
Shift 
Arithmetic Left) .perform the same operation and 
are physically the same instruction: 
The destina- 
tion byte or word is shifted left by the number of 
bits specified in the count 
operand. 
Zeros are 
shifted in on the right. If the sign bit retains its 
original value, then OF is cleared. 


SHR (Shift Logical Right) shifts the bits in the 
destination operand (byte or word) to the right by 


the number of bits specified in the count operand. 
Zeros are shifted in on the left. If the sign bit 
retains its original value, then OF is cleared. 


SAR (Shift Arithmetic Right) shifts the bits in the 
destination operand (byte or word) to the right by 
the number of bits specified in the count operand. 
Bits equal to the original high-order (sign) bit are 
shifted in on the left, preserving the sign of the 
original value. Note that SAR does not produce 
the 
same 
result 
as 
the 
dividend 
of 
an 
"equivalent" 
IDIV instruction 
if the destination 
operand is negative and I-bits are shifted out. For 
example, shifting -5 right by one bit yields -3, 
while integer division of -5 by 2 yields -2. The 
difference in the instructions 
is that ID.IV trun- 
cates all numbers toward zero, while SAR trun- 
cates positive numbers toward zero and negative 
numbers toward negative infinity. 


Bits in bytes and words also may be rotated. Bits 
rotated 
out of an operand 
are not lost as in a 
shift, but are "circled" 
back into the other "end" 
of the operand. 
As in the shift instructions, 
the 
number of bits to be rotated 
is taken from the 
count operand, 
which may specify either a con- 
stant of I, or the CL register. The carry flag may 
act as an extension of the operand in two of the 
rotate instructions, allowing a bit to be isolated in 
CF and then tested by a JC (jump if carry) or JNC 
(jump if not carry) instruction. 


Rotates affect only the carry and overflow flags. 
CF always contains 
the value of the last bit 
rotated out. On multibit rotates, the value of OF 
is always undefined. 
In single-bit rotates, 
OF is 
set if the operation changes the high-order (sign) 
bit of the destination 
operand. 
If the sign bit 
retains its original value, OF is cleared. 


ROL (Rotate Left) rotates the destination byte or 
word left by the number of bits specified in the 
count operand. 


ROR (Rotate 
Right) 
operates 
similar 
to ROL 
except that the bits in the destination byte or word 
are rotated right instead of left. 


RCL (Rotate through Carry Left) rotates the bits 
in the byte or word destination operand to the left 
by the number 
of bits specified 
in the count 
operand. The carry flag (CF) is treated as "part 
of" the destination 
operand; 
that is, its value is 
rotated into the low-order bit of the destination, 
and itself is replaced by the high-order bit of the 
destination. 


RCR 
(Rotate 
through 
Carry 
Right) 
operates 
exactly like RCL except that the bits are rotated 
right instead of left. 


Five basic string operations, 
called primitives, 
allow strings of bytes or words to be operated on, 
one element (byte or word) at a time. Strings of 
up to 64k bytes may be manipulated 
with these 
instructions. 
Instructions 
are available to move, 
compare and scan for a value, as well as for mov- 
ing string elements to and from the accumulator 
(see table 2-12). These basic operations 
may be 
preceded by a special one-byte prefix that causes 
the instruction 
to be repeated by the hardware, 
allowing long strings to be processed much faster 
than would be possible with a software loop. The 
repetitions can be terminated by a variety of con- 
ditions, and a repeated operation 
may be inter- 
rupted and resumed. 


The string instructions 
operate quite similarly in 
many respects; 
the common 
characteristics 
are 
covered here and in table 2-13 and figure 2-33 
rather than in the descriptions 
of the individual 
instructions. 
A string 
instruction 
may have a 
source operand, 
a destination 
operand, 
or both. 
The hardware assumes that a source string resides 
in the current data segment; a segment prefix byte 
may be used to override 
this assumption. 
A 
destination string must be in the current, extra seg- 
ment. The assembler checks the attributes of the 


operands 
to determine 
if the elements 
of the 
strings are bytes or words. The assembler does 
not, however, use the operand names to address 
the strings. 
Rather, 
the content 
of register SI 
(source index) is used as an offset to address the 
current element of the source string, and the con- 
tent of register DI (destination 
index) is taken as 
the offset of the current 
destination 
string ele- 
ment. These registers must be initialized to point 
to the source/destination 
strings before executing 
the string instruction; 
the LDS, LES and LEA 
instructions are useful in this regard. 


REP 
Repeat 


REPE/REPZ 
Repeat while equal/zero 


REPNE/REPNZ 
Repeat while not 


'eguall not zero 


MOVS 
Move byte or word string 


MOVSB/MOVSW 
Move byte or word string 


CMPS 
Compare byte or word 
string 


SCAS 
Scan byte or word string 


LODS 
Load byte or word string 


ST08 
Store byte or word string 


Table 2-13. String Instruction 
Register and 
Flag Use 


Index (offset) for source string 


Index (offset) for destination 
string 


Scan value 
Destination for LODS 
Source for STOS 


0= auto-increment 
SI, 01 
1 = auto-decrement 
SI, 01 


{ 


SI/DI.CX 
. 


AND 
OF WOULD 
TYPICALLY 
BE 
INITIALIZED 
HERE' 


STRING 
OF 
DELTA 


BYTE 
0 
1 
BYTE 
1 
-1 
WORD 
0 
2 
WORD 
1 
-2 


PREFIX 
Z 


REPE 
1 
REPZ 
1 
REPNE 
0 
REPNZ 
0 
, 


r-----l 


I 
INST~~~\ION 
I 
L 
J 


The string instructions 
automatically 
update 
SI 


and/or 
01 in anticipation 
of processing the next 


string element. The setting of OF (the direction 
flag) determines 
whether the index registers are 


auto-incremented 
(OF = 0) or auto-decremented 


(OF = 1). If byte strings are being processed, SI 
and/or DI is adjusted by1; the adjustment is 2 for 
word strings. 


If a Repeat prefix has been coded, then register 
CX (count register) is decremented by 1after each 
repetition of the string instruction; 
therefore, CX 


must be initialized to the number of repetitions 
desired before the string instruction is executed. If 
CX is 0, the string instruction is not executed, and 
control goes to the following instruction. 


Section 2.10 contains examples that illustrate the 
use of all the string instructions. 


Repeat, Repeat While Equal, Repeat While Zero, 
Repeat While Not Equal and Repeat While Not 
Zero are five mnemonics 
for two forms of the 


prefix byte that controls 
repetition 
of a subse- 


quent string instruction. The different mnemonics 
are provided 
to improve 
program 
clarity. The 


repeat prefixes do not affect the flags. 


REP 
is used in conjunction 
with the MOYS 


(Move String) and STOS (Store String) instruc- 
tions and is interpreted as "repeat while not end- 
of-string" 
(CX not 0). REPE and REPZ operate 


identically and are physically the same prefix byte 
as REP. 
These instructions 
are used with the 


CMPS (Compare String) and SCAS (Scan String) 
instructions 
and 
require 
ZF 
(posted 
by these 


instructions) 
to be set before initiating the next 


repetition. 
REPNE 
and 
REPNZ 
are 
two 


mnemonics 
for 
the 
same 
prefix 
byte. 
These 


instructions 
function 
the same as REPE 
and 


REPZ except that the zero flag must be cleared or 
the repetition 
is terminated. 
Note that ZF does 
not need to be initialized before executing the 
repeated string instruction. 


Repeated string sequences are interruptable; 
the 


processor will recognize the interrupt before pro- 
cessing the next string element. System interrupt 
processing 
is not 
affected 
in any way. Upon 


return from the interrupt, 
the repeated operation 


is resumed from the point of interruption. 
Note, 


however, that execution does not resume properly 


if a second or third prefix (i.e., segment override 
or LOCK) has been specified in addition to any of 
the repeat prefixes. The processor "remembers" 
only one prefix in effect at the time of the inter- 
rupt, 
the prefix that 
immediately 
precedes the 


string instruction. 
After returning from the inter- 


rupt, processing resumes at this point, 
but any 


additional prefixes specified are not in effect. If 
more than one prefix must be used with a string 
instruction, 
interrupts 
may be disabled 
for the 


duration of the repeated execution. However, this 
will not prevent a non-maskable 
interrupt 
from 


being recognized. Also, the time that the system is 
unable to respond to interrupts may be unaccept- 
able if long strings are being processed. 


MOYS (Move String) transfers a byte or a word 
from the source string (addressed by SI) to the 
destination string (addressed by 01) and updates 
SI and 01 to point to the next string element. 
When used in conjunction with REP, MOYS per- 
forms a memory-to-memory 
block transfer. 


These are alternate 
mnemonics 
for 
the move 
string instruction. 
These mnemonics 
are coded 


without 
operands; 
they 
explicitly 
tell 
the 


assembler that a byte string (MOYSB) or a word 
string (MOYSW) is to be moved (when MOYS is 
coded, the assembler determines the string type 
from 
the 
attributes 
of 
the 
operands). 
These 


mnemonics are useful when the assembler cannot 
determine the attributes of a string, e.g., a section 
of code is being moved. 


CMPS (Compare String) subtracts the destination 
byte or word (addressed by 01) from the source 
byte or word (addressed by SI). CMPS affects the 
flags but does not alter either operand, updates SI 
and 01 to point to the next string element and 
updates AF, CF, OF, PF, SF and ZF to reflect the 
relationship 
of the destination 
element 
to the 
source element. For example, if a 1G (Jump if 
Greater) instruction 
follows CMPS, the jump is 


taken if the destination element is greater than the 
source element. If CMPS is prefixed with REPE 
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strings are equal (ZF = 1)." If CMPS is preceded 
by REPNE 
or REPNZ, 
the operation 
is inter- 
preted as "compare 
while not end-of-string 
(CX 
not zero) and strings are not equal (ZF = 0)." 
Thus, CMPS can be used to find matching or dif- 
fering string elements. 


SCAS 
(Scan 
String) 
subtracts 
the destination 


string element (byte or word) addressed 
by Dr 


from the content of AL (byte string) or AX (word 
string) and updates the flags, but does not alter 
the destination 
string or the accumulator. 
SCAS 


also updates Dl to point to the next string element 
and AF, CF, OF, PF, SF and ZF to reflect the 
relationship 
of the scan value in ALl AX to the 


string element. If SCAS is prefixed with REPE or 
REPZ, the operation is interpreted as "scan while 
not end-of-string (CX not 0) and string-element = 
scan-value (ZF = 1)." This form may be used to 
scan for departure from a given value. If SCAS is 
prefixed with REPNE or REPNZ, 
the operation 


is interpreted 
as "scan 
while not end-of-string 
(CX not 0) and string-element 
is not equal to 


scan-value (ZF = 0)." This form may be used to 
locate a value in a string. 


LODS (Load String) transfers the byte or word 
string element addressed by Sl to register AL or 
AX, and updates Sl to point to the next element 
in the string. This instruction 
is not ordinarily 
repeated since the accumulator 
would be over- 


written by each repetition, 
and only the last ele- 


ment would be retained. However, LODS is very 
useful in software loops as part of a more com- 
plex 
string 
function 
built 
up 
from 
string 


primitives and other instructions. 


STOS (Store String) transfers a byte or word from 
register AL or AX to the string element addressed 
by Dl and updates Dl to point to the next location 
in the string. As a repeated operation, STOS pro- 
vides a convenient way to initialize a string to a 
constant value (e.g., to blank out a print line). 


The sequence of execution of instructions 
in an 


8086/8088 
program is determined by the content 


of the code segment register (CS) and the instruc- 
tion pointer (IP). The CS register contains 
the 


base address of the current code segment, the 64k 
portion of memory from which instructions 
are 


presently being fetched. The lP is used as an off- 
set from the beginning of the code segment; the 
combination 
of CS and lP points to the memory 


location from which the next instruction is to be 
fetched. (Recall that under most operating condi- 
tions, 
the next instruction 
to be executed 
has 
already been fetched from memory and is waiting 
in the CPU 
instruction 
queue.) 
The program 


transfer 
instructions 
operate 
on the instruction 


pointer and on the CS register; changing the con- 
tent of these causes normal sequential execution 
to be altered. When a program 
transfer occurs, 


the queue no longer contains the correct instruc- 
tion, and the BlU obtains 
the next instruction 


from memory using the new lP and CS values, 
passes the instruction directly to the EU, and then 
begins refilling the queue from the new location. 


Four groups of program transfers are available in 
the 
8086/8088 
(see table 
2-14): unconditional 


transfers, conditional 
transfers, 
iteration control 


instructions 
and 
interrupt-related 
instructions. 


Only the interrupt-related 
instructions affect any 


CPU flags. As will be seen, however, the execu- 
tion of many of the program transfer instructions 
is affected by the states of the flags. 


The 
unconditional 
transfer 
instructions 
may 


transfer control to a target instruction within the 
current code segment (intrasegment 
transfer) 
or 


to 
a 
different 
code 
segment 
(intersegment 


transfer). 
(The 
ASM-86 
assembler 
terms 
an 
intrasegment 
target NEAR and an intersegment 


target FAR.) The transfer is made uncondition- 
ally any time the instruction is executed. 


CALL activates an out-of-line procedure, 
saving 


information on the stack to permit a RET (return) 
instruction 
in the procedure 
to transfer 
control 
back to the instruction following the CALL. The 


UNCONDITIONAL TRANSFERS 


CALL 
Call procedure 
RET 
Return from procedure 
JMP 
Jump 


CONDITIONAL TRANSFERS 


JA/JNBE 
Jump 
if above/ 
not below 
nor equal 
JAE/JNB 
Jump 
if above or 
equal/not 
below 
JB/JNAE 
Jump 
if below/not 
above 
nor equal 
JBE/JNA 
Jump 
if below or 
equal/not 
above 
JC 
Jump 
if carry 
JE/JZ 
Jump 
if equal/zero 
JG/JNLE 
Jump 
if greater/not 
less 
nor equal 
JGE/JNL 
Jump 
if greater 
or 
equal/ not less 
JLlJNGE 
Jump 
if less/not 
greater 
nor equal 
JLE/JNG 
Jump 
if less or equal/ not 
greater 
JNC 
Jump 
if not carry 
JNE/JNZ 
Jump 
if not equal/not 
zero 
JNO 
Jump 
if not overflow 
JNP/JPO 
Jump 
if not parity/parity 
odd 
JNS 
Jump 
if not sign 
JO 
Jump 
if overflow 
JP/JPE 
J ump if parity / parity 
even 
JS 
Jump 
if sign 


ITERATION CONTROLS 


LOOP 
Loop 
LOOPE/LOOPZ 
Loop if equal/zero 
LOOPNE/LOOPNZ 
Loop if not equal/ not 
zero 
JCXZ 
Jump 
if register 
CX = 0 


INTERRUPTS 


INT 
Interrupt 
INTO 
Interrupt 
if overflow 
IRET 
Interrupt 
return 


assembler 
generates 
a different 
type 
of 
CALL 
instruction 
depending 
on whether 
the program- 
mer has defined 
the procedure 
name 
as NEAR 
or 
FAR. 
For control 
to return 
properly, 
the type of 
CALL 
instruction 
must 
match 
the type 
of RET 
instruction 
that 
exits 
from 
the 
procedure. 
(The 
potential 
for a mismatch 
exists 
if the procedure 
and 
the 
CALL 
are 
contained 
in 
separately 
assembled 
programs.) 
Different 
forms 
of 
the 
CALL 
instruction 
al10w the address 
of the target 
procedure 
to 
be obtained 
from 
the 
instruction 
itself 
(direct 
CALL) 
or from 
a memory 
location 
or register 
referenced 
by the instruction 
(indirect 
CALL). 
In 
the 
fol1owing 
descriptions, 
bear 
in 
mind 
that 
the processor 
automatical1y 
adjusts 
IP 
to point 
to the 
next 
instruction 
to be executed 
before 
saving it on the stack. 


For an intrasegment 
direct 
CALL, 
SP (the stack 
pointer) 
is decremented 
by two and 
IP is pushed 
onto 
the stack. 
The 
relative 
displacement 
(up to 
±32k) 
of the 
target 
procedure 
from 
the 
CALL 
instruction 
is 
then 
added 
to 
the 
instruction 
pointer. 
This 
form 
of 
the 
CALL 
instruction 
is 
"self-relative" 
and is appropriate 
for position- 
in- 
dependent 
(dynamical1y 
relocatable) 
routines 
in 
which 
the CALL 
and 
its target 
are in the same 
segment 
and are moved 
together. 


An 
intrasegment 
indirect 
CALL 
may 
be made 
through 
memory 
or 
through 
a register. 
SP 
is 
decremented 
by two 
and 
IP is pushed 
onto 
the 
stack. 
The 
offset 
of 
the 
target 
procedure 
is 
obtained 
from the memory 
word or 16-bit general 
register 
referenced 
in the instruction 
and replaces 
IP. 


For 
an 
intersegment 
direct 
CALL, 
SP 
is 
decremented 
by two, 
and 
CS is pushed 
onto 
the 
stack. 
CSis 
replaced 
by the segment 
word 
con- 
tained 
in the instruction. 
SP again 
is decremented 
by 
two. 
IP 
is 
pushed 
onto 
the 
stack 
and 
is 
replaced 
by 
the 
offset 
word 
contained 
in 
the 
instruction. 


For 
an inter segment 
indirect 
CALL 
(which 
only 
may 
be 
made 
through 
memory), 
SP 
is 
decremented 
by two, 
and 
CS is pushed 
onto 
the 
stack. 
CS is then 
replaced 
by the content 
of the 
second 
word 
of the doubleword 
memory 
pointer 
referenced 
by 
the 
instruction. 
SP 
again 
is 
decremented 
by two, 
and 
IP is pushed 
onto 
the 
stack 
and 
is replaced 
by the content 
of the first 
word of the doubleword 
pointer 
referenced 
by the 
instruction. 


RET (Return) transfers control from a procedure 
back to the instruction 
following the CALL that 
activated the procedure. The assembler generates 
an intrasegment 
RET 
if the programmer 
has 
defined the procedure NEAR, or an intersegment 
RET if the procedure has been defined as FAR. 
RET pops 
the word 
at the top 
of the stack 
(pointed to by register SP) into the instruction 
pointer 
and increments 
SP by two. If RET is 
intersegment, the word at the new top of stack is 
popped 
into the CS register, 
and SP is again 
incremented by two. If an optional pop value has 
been specified, RET adds that value to SP. This 
feature may be used to discard parameters pushed 
onto the stack before the execution of the CALL 
instruction. 


JMP 
unconditionally 
transfers 
control 
to the 
target location. Unlike a CALL instruction, 
JMP 
does not save any information 
on the stack, and 
no return to the instruction following the JMP is 
expected. Like CALL, the address of the target 
operand 
may be obtained 
from the instruction 
itself (direct JMP) or from memory or a register 
referenced by the instruction (indirect JMP). 


An intrasegment direct JMP changes the instruc- 
tion pointer by adding the relative displacement 
of the target from the JMP instruction. 
If the 
assembler can determine that the target is within 
127 bytes of the JMP, it automatically generates a 
two-byte form of this instruction called a SHORT 
JMP; otherwise, it generates a NEAR JMP that 
can address a target within ±32k. Intrasegment 
direct JMPS are self-relative and are appropriate 
in position-independent 
(dynamically relocatable) 
routines in which the JMP and its target are in the 
same segment and are moved together. 


An intrasegment 
indirect 
JMP 
may be made 
either 
through 
memory 
or 
through 
a 
16-bit 
general register. In the first case, the content of 
the word referenced 
by the instruction 
replaces 
the instruction 
pointer. 
In the second case, the 
new IP value is taken from the register named in 
the instruction. 


An intersegment direct JMP replaces IP and CS 
with values contained in the instruction. 


An intersegment indirect JMP may be made only 
through 
memory. 
The 
first 
word 
of 
the 
doubleword pointer referenced by the instruction 
replaces IP, and the second word replaces CS. 


The conditional 
transfer 
instructions 
are jumps 
that mayor 
may not transfer control depending 
on the state of the CPU 
flags at the time the 
instruction is executed. These 18 instructions (see 
table 2-15) each test a different 
combination 
of 
flags for a condition. 
If the condition is "true," 
then control is transferred 
to the target specified 
in the instruction. 
If the condition 
is "false," 
then control passes to the instruction that follows 
the conditional jump. All conditional 
jumps are 
SHORT, that is, the target must be in the current 
code segment and within -128 to + 127 bytes of 
the first byte of the next instruction 
(JMP OOH 
jumps to the first byte of the next instruction). 
Since the jump is made by adding the relative 
displacement 
of 
the 
target 
to the 
instruction 
pointer, all conditional jumps are self-relative and 
are 
appropriate 
for 
position-independent 
routines. 


The iteration control instructions 
can be used to 
regulate the repetition 
of software loops. These 
instructions use the CX register as a counter. Like 
the conditional 
transfers, 
the iteration 
control 
instructions 
are 
self-relative 
and 
may 
only 


transfer to targets that are within -128 to +127 
bytes 
of 
themselves, 
Le., 
they 
are 
SHORT 
transfers. 


LOOP decrements CX by I and transfers control 
to the target operand if CX is not 0; otherwise the 
instruction following LOOP is executed. 


LOOPE 
and LOOPZ 
(Loop 
While Equal and 
Loop While Zero) are different 
mnemonics 
for 
the same instruction 
(similar to the REPE and 


JA/JNBE 
JAE/JNB 
JB/JNAE 
JBE/JNA 
JC 
JE/ JZ 
JG/JNLE 
JGE/JNL 
JLlJNGE 
JLE/JNG 
JNC 
JNE/JNZ 
JNO 
JNP/JPO 
JNS 
JO 
JP/JPE 
JS 


(CF OR ZF)=O 
CF=O 
CF=1 
(CF OR ZF)=1 
CF=1 
ZF=1 
((SF XOR OF) OR ZF)=O 
(SF XOR OF)=O 
(SF XOR OF)=1 
((SF XOR OF) OR ZF)=1 
CF=O 
ZF=O 
OF=O 
PF=O 
SF=O 
OF=1 
PF=1 
SF=1 


above/not 
below nor equal 
above or equal/not 
below 
below/not 
above nor equal 
below or equal/not 
above 
carry 
equal/zero 
greater/not 
less nor equal 


greater 
or equal/not 
less 
less/ not greater 
nor equal 
less or equal/ not greater 
not carry 
not equal/not 
zero 
not overflow 
not parity / parity odd 
not sign 
overflow 
parity / parity equal 
sign 


Note: 
"above" 
and "below" 
refer to the relationship 
of two unsigned 
values; 


"greater" 
and "less" 
refer to the relationship 
of two signed 
values. 


REPZ 
repeat 
prefixes). 
ex 
is decremented 
by I, 
and control 
is transferred 
to the target 
operand 
if 


ex 
is not 0 and if ZF is set; otherwise 
the instruc- 


tion following 
LOOPE/LOOPZ 
is executed. 


LOOPNE 
and LOOPNZ 
(Loop 
While Not Equal 
and Loop 
While Not Zero) 
are also synonyms 
for 
the same 
instruction. 
ex 
is decremented 
by 
I, 


and control 
is transferred 
to the target 
operand 
if 
ex 
is not 0 and if ZF is clear; 
otherwise 
the next 


sequential 
instruction 
is executed. 


JeXZ 
(Jump 
If ex 
Zero) 
transfers 
control 
to the 


target 
operand 
if ex 
is O. This 
instruction 
is 
useful 
at the beginning 
of a loop 
to bypass 
the 
loop 
if ex 
has a zero value, 
i.e., 
to execute 
the 
loop zero times. 


The interrupt 
instructions 
allow 
interrupt 
service 
routines 
to be activated 
by programs 
as well as by 


external 
hardware 
devices. 
The effect 
of software 


interrupts 
is similar 
to hardware-initiated 
inter- 
rupts. 
However, 
the processor 
does 
not 
execute 
an interrupt 
·acknowledge 
bus cycle 
if the inter- 
rupt 
originates 
in software 
or with an NMI. 
The 
effect 
of the interrupt 
instructions 
on the flags is 


covered 
in the description 
of each instruction. 


INT (Interrupt) 
activates 
the interrupt 
procedure 
specified 
by 
the 
interrupt-type 
operand. 
INT 
decrements 
the stack 
pointer 
by two, 
pushes 
the 
flags onto 
the stack, 
and clears 
the trap 
(TF) and 
interrupt-enable 
(IF) 
flags 
to disable 
single-step 
and mask able interrupts. 
The 
flags are stored 
in 
the format 
used by the PUSHF 
instruction. 
SP is 


decremented 
again 
by two, 
and the es 
register 
is 


pushed 
onto 
the stack. 
The address 
of the inter- 
rupt 
pointer 
is 
calculated 
by 
multiplying 


interrupt-type 
by four; 
the second 
word of the in- 
terrupt 
pointer 
replaces 
es. 
SP 
<\gain 
is 


decremented 
by two, 
and 
IP is pushed 
onto 
the 
stack and is replaced 
by the first word of the inter- 
rupt 
pointer. 
If interrupt-type 
= 3, the assembler 


generates 
a short 
(I byte) form 
of the instruction, 


known 
as the breakpoint 
interrupt. 


Software 
interrupts 
can 
be used 
as "supervisor 
calls," 
i.e., requests 
for service from an operating 


system. 
A different 
interrupt-type 
can be used for 
each 
type 
of 
service 
that 
the 
operating 
system 


could 
supply 
for 
an application 
program. 
Soft- 
ware 
interrupts 
also 
may 
be used 
to check 
out 
interrupt 
service procedures 
written 
for hardware- 


initiated 
interrupts. 


INTO 
(Interrupt 
on Overflow) 
generates 
a soft- 


ware 
interrupt 
if the 
overflow 
flag 
(OF) 
is set; 
otherwise 
control 
proceeds 
to 
the 
following 
instruction 
without 
activating 
an 
interrupt 
pro- 
cedure. 
INTO 
addresses 
the target 
interrupt 
pro- 
cedure 
(its type is 4) through 
the interrupt 
pointer 
at location 
IOH; it clears the TF and IF flags and 
otherwise 
operates 
like INT. 
INTO 
may be writ- 


ten following 
an arithmetic 
or logical operation 
to 
activate 
an 
interrupt 
procedure 
if 
overflow 
occurs. 


IRET (Interrupt 
Return) 
transfers 
control 
back to 
the point 
of interruption 
by popping 
IP, CS and 
the 
flags 
from 
the 
stack. 
IRET 
thus 
affects 
all 
flags 
by 
restoring 
them 
to 
previously 
saved 
values. 
IRET 
is 
used 
to 
exit 
any 
interrupt 
procedure, 
whether 
activated 
by 
hardware 
or 


software. 


These instructions 
(see table 2-16) allow programs 


to control 
various 
CPU 
functions. 
One group 
of 
instructions 
updates 
flags, 
and 
another 
group 
is 
used primarily 
for synchronizing 
the 8086 or 8088 
with 
external 
events. 
A final 
instruction 
causes 


the CPU to do nothing. 
Except 
for the flag opera- 


tions, 
none 
of the processor 
control 
instructions 
affect 
the flags. 


CLC (Clear 
Carry 
flag) zeroes 
the carry 
flag (CF) 
and affects 
no other 
flags. It (and CMC and STC) 
is useful 
in conjunction 
with 
the RCL 
and 
RCR 
instructions. 


FLAG OPERATIONS 


STC 
Set carry flag 
CLC 
Clear carry flag 
CMC 
Complement 
carry flag 
STD 
Set direction 
flag 
CLD 
Clear direction 
flag 
STI 
Set interrupt 
enable 
flag 
CLI 
Clear interrupt 
enable 
flag 


EXTERNAL SYNCHRONIZATION 


HLT 
Halt until interrupt 
or reset 


WAIT 
Wait for TEST pin active 


ESC 
Escape 
to external 
processor 


LOCK 
Lock bus during 
next 


instruction 


NO OPERATION 


NOP 
No operation 


CMC 
(Complement 
Carry 
flag) "toggles" 
CF to 
its opposite 
state and affects 
no other 
flags. 


STC (Set Carry 
flag) sets CF to 1 and affects 
no 
other 
flags. 


CLD (Clear 
Direction 
flag) zeroes 
DF causing 
the 
string 
instructions 
to 
auto-increment 
the 
SI 
and/or 
DI index 
registers. 
CLD 
does 
not 
affect 
any other 
flags. 


STD (Set Direction 
flag) sets DF to 1 causing 
the 
string 
instructions 
to 
auto-decrement 
the 
SI 
and/or 
DI index 
registers. 
STD 
does 
not 
affect 
any other 
flags. 


CLl 
(Clear 
Interrupt-enable 
flag) 
zeroes 
IF. 
When the interrupt-enable 
flag is cleared, 
the 
8086 and 8088 do not recognize an external inter- 
rupt request that appears on the INTR line; in 
other words mask able interrupts 
are disabled. A 
non-maskable 
interrupt 
appearing 
on the NMI 
line, however, is honored, 
as is a software inter- 
rupt. CLl does not affect any other flags. 


STI (Set Interrupt-enable 
flag) sets IF to 1, en- 


abling processor 
recognition 
of mask able inter- 
rupt requests appearing 
on the INTR line. Note 
however, that a pending interrupt 
will not actu- 


ally be recognized until the instruction 
following 
STI has executed. STI does not affect any other 
flags. 


HLT (Halt) causes the 8086/8088 
to enter the halt 
state. The processor 
leaves the halt state upon 
activation of the RESET line, upon receipt of a 
non-mask able interrupt 
request on NMI, or, if 
interrupts 
are 
enabled, 
upon 
receipt 
of 
a 
maskable interrupt 
request on INtR. 
HLT does 
not affect any flags. It may be used as an alter- 
native to an endless software loop in situations 
where a program must wait for an interrupt. 


WAIT causes the CPU to enter the wait state 
while its TEST line is not active. WAIT does not 
affect 
any flags. This instruction 
is described 
more completely in section 2.5. 


ESC (Escape) provides a means for an external 
processor 
to obtain 
an opcode 
and possibly a 
memory 
operand 
from 
the 8086 or 8088. The 
external opcode is a 6-bit immediate constant that 
the assembler encodes in the machine instruction 


it builds (see table 2-26). An external processor 
may monitor 
the system bus and capture 
this 
opcode when the ESC is fetched. If the source 
operand is a register, the processor does nothing. 
If the source operand is a memory variable, the 
processor obtains the operand from memory and 
discards it. An external processor may capture the 
memory 
operand 
when the processor 
reads it 
from memory. 


LOCK 
is a 
one-byte 
prefix 
that 
causes 
the 
8086/8088 
(configured 
in maximum 
mode) 
to 
assert its bus LOCK signal while the following 
instruction 
executes. LOCK does not affect any 
flags. See section 2.5 for more information 
on 
LOCK. 


NOP 
(No Operation) 
causes 
the 
CPU 
to do 
nothing. NOP does not affect any flags. 


Table 2-21 provides detailed operational informa- 
tion 
for 
the 
8086/8088 
instruction 
set. 
The 
information 
is presented from the point of view 
of utility to the assembly language programmer. 
Tables 2-17, 2-18 and 2-19 explain the symbols 
used in table 2-21. Machine language instruction 
encoding and decoding information 
is given in 
Chapter 4. 


Instruction 
timings are presented as the number 
of clock periods required to execute a particular 
form (register-to-register, 
immediate-to-memory, 
etc.) of the instruction. If a system is running with 
a 5 MHz maximum clock, the maximum clock 
period is 200 ns; at 8 MHz, the clock period is 125 
ns. Where memory operands 
are used, "+EA" 
denotes a variable 
number 
of additional 
clock 
periods needed to calculate the operand's 
effec- 
tive address (discussed in section 2.8). Table 2-20 
lists all effective address calculation times. 


destination 
data transfer, 
bit manipulation 


source 
data transfer, 
arithmetic, 
bit manipulation 


short-label 
condotransfer, 
iteration control 


A register or memory location that may contain data 
operated on by the instruction, and which receives (is 
replaced by) the result of the operation. 


A register, memory location or immediate value that is 
used in the operation, but is not altered by the instruc- 
tion. 


Name of memory translation table addressed by register 
BX. 


A label to which control is to be transferred directly, or a 
register 
or 
memory 
location 
whose content 
is the 


address of the location to which control is to be transfer- 
red indirectly. 


A 
label 
to 
which 
control 
is 
to 
be 
conditionally 


transferred; must lie within -128 to +127bytes of the first 
byte of the next instruction. 


An I/O port number;-specified as an immediate value of 
0-255, or register OX (which contains port number in 
range 0-64k). 


Name of a string in memory that is addressed by register 
SI; used only to identify string as byte or word and 
specify segment override, if any. This string is used in 
the operation, but is not altered. 


Name of string in memory that is addressed by register 


01; used only to identify string as byte or word. This 
string receives (is replaced by) the result of the opera- 
tion. 


Specifies number of bits to shift or rotate; written as 
immediate value 1 or register CL (which contains the 
count in the range 0-255). 


Immediate value of 0-255 identifying 
interrupt 
pointer 


number. 


Number of bytes (0-64k, ordinarily an even number) to 
discard from stack. 
• 


Immediate value (0-63)that is encoded in the instruction 
for use by an external processor. 


IDENTIFIER 
EXPLANATION 


(blank) 
not altered 


0 
cleared to 0 


1 
set to 1 


X 
set or cleared according 
to result 


U 
undefined-contains 
no 
reliable value 


R 
restored from previously- 
saved value 


For 
control 
transfer 
instructions, 
the timings 
given include any additional 
clocks required to 
reinitialize 
the instruction 
queue as well as the 
time required to fetch the target instruction. 
For 
instructions 
executing on an 8086, four 
clocks 
should be added for each instruction reference to 
a word 
operand 
located 
at 
an odd 
memory 
address to reflect 
any additional 
operand bus 
cycles required. 
Similarly 
for 
instructions 
exe- 
cuting on an 8088, four clocks should be added to 
each instruction 
reference to a 16-bit memory 
operand; this includes all stack operations. The 
required number of data references is listed in 
table 2-21 for 
each instruction 
to 
aid in this 
calculation. 


Several additional 
factors 
can increase actual 
execution time over the figures shown in table 
2-21. The time provided assumesthat the instruc- 
tion has already been prefetched and that it is 
waiting in the instruction 
queue, an assumption 
that is valid under most, but not all, operating 
conditions. A series of fast executing (fewer than 
two clocks per opcode byte) instructions can drain 
the queue and increase execution time. Execution 
time also is slightly impacted by the interaction of 
the EU and BIU when memory operands must be 
read or 
written. 
If 
the 
EU 
needs access to 
memory, it may have to wait for up to one clock if 
the BIU has already started an instruction 
fetch 
bus cycle. (The EU can detect the need for 
a 
memory 
operand 
and post a bus request far 
enough in advance of its need for this operand to 
avoid waiting a full 4-clock bus cycle). Of course 
the EU does not have to wait if the queue is full, 
becausethe BIU is idle. (This discussion assumes 


No operands are written 


An 8-or 16-bit general register 
A 16-bit general register 


A segment register 


Register AX or AL 


A 
constant 
in 
the 
range 
O-FFFFH 


A constant in the range O-FFH 


An 
8- 
or 
16-bit 
memory 
10cation(1) 
An 8-bit memory location(1) 


A 16-bit memory 10cation(1) 


Name 
of 
256-byte 
translate 
table 
Name of string addressed 
by 
register 81 


Name of string addressed 
by 
register DI 


Register DX 


A label 
within 
-128 to 
+127 
bytes of the end of the instruc- 
tion 


A 
label 
in 
current 
code 
segment 
A 
label 
in 
another 
code 
segment 


A procedure 
in current 
code 
segment 


A procedure 
in another code 
segment 


A word containing the offset of 
the location in the current code 
segment to which control is to 
be transferred(1) 


A doubleword 
containing 
the 
offset and the segment 
base 
address 
of 
the 
location 
in 
another code segment to which 
control is to be transferred(1) 
A 
16-bit 
general 
register 
containing 
the 
offset 
of 
the 
location 
in the current 
code 
segment to which control is to 
be transferred 


A 
string 
instruction 
repeat 
prefix 


(1)Anyaddressing mode-direct, 
register in- 
direct, based, indexed, or based 
indexed-may 
be used (see section 2.8). 


(no operands) 


register 


reg 16 


seg-reg 


accumulator 


immediate 


mem8 
mem16 
source-table 


1aOle .l-.lU. bUeCtive AOOress Calculation 
Time 


EA COMPONENTS 
CLOCKS· 


Displacement 
Onlv 
6 
Base or Index Only 
(BX,BP,SI,DI) 
5 
Displacement 
+ 
9 
Base or Index 
(BX,BP,SI,DI) 
Base 
BP+DI, 
BX+SI 
7 


+ 
Index 
BP+SI, 
BX+DI 
8 


Displacement 
BP+DI+DISP 
11 
+ 
BX+SI+DISP 
Base 


+ 
BP+SI+DISP 
12 
Index 
BX+DI+DISP 


that the tllU can obtam the bus on demand, i.e., 
that no other processors 
are competing 
for the 


bus.) 


With typical instruction 
mixes, the time actually 
required to execute a sequence of instructions will 
typically be within 
5-10% 
of the sum of the 


individual timings given in table 2-21. Cases can 
be constructed, 
however, in which execution time 


may be much higher than the sum of the figures 
provided in the table. The execution time for a 
given sequence of instructions, however, is always 
repeatable, assuming comparable 
external condi- 


tions (interrupts, coprocessor activity, etc.). If the 
execution time for a given series of instructions 
must 
be determined 
exactly, 
the 
instructions 
should be run on an execution vehicle such as the 
SDK-86 or the iSBC 86112™ board. 


AAA 
I~AA (no operands) 
Flags 
ODITSZAPC 


ASCII adjust for addition 
U 
UUXUX 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


(no operands) 
4 
- 
1 
AAA 


AAD 
I~AD (no operands) 
Flags 
ODITSZAPC 


ASCII adjust for division 
U 
XXUXU 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


(no operands) 
60 
- 
2 
AAD 


AAM 
I~AM (no operands) 
Flags 
ODITSZAPC 


ASCII adjust for multiply 
U 
XXUXU 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


(no operands) 
83 
- 
1 
AAM 


AAS 
I~AS (no operands) 
Flags 
ODITSZAPC 


ASCII adjust for subtraction 
U 
UUXUX 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


(no operands) 
4 
- 
1 
AAS 


ADC 
I~DC destination,source 
Flags 
ODITSZAPC 
Add with carry 
X 
XXXXX 


Operands 
. 
Clocks 
Transfers· 
Bytes 
Coding 
Example 


register, register 
3 
- 
2 
ADC AX, SI 
register, memory 
9+EA 
1 
2-4 
ADC DX, BETA [SI] 
memory, register 
16+ EA 
2 
2-4 
ADC ALPHA [BX] [SI], DI 


register, immediate 
4 
- 
3-4 
ADC BX,256 
memory, immediate 
17+EA 
2 
3-6 
ADC GAMMA, 30H 
accumulator, 
immediate 
4 
- 
2-3 
ADC AL,5 


ADD 
I~DD destination,source 
Flags 
ODITSZAPC 
Addition 
X 
XXXXX 


Operands 
Clocks 
Transfers· 
Bytes 
Coding 
Example 


register, register 
3 
- 
2 
ADD CX, DX 
register, memory 
9+EA 
1 
2-4 
ADD DI, [BX].ALPHA 
memory, register 
16+ EA 
2 
2-4 
ADD TEMP, CL 
register, immediate 
4 
- 
3-4 
ADD CL,2 
memory, immediate 
17+ EA 
2 
3-6 
ADD ALPHA,2 
accumulator, 
immediate 
4 
- 
2-3 
ADD AX, 200 


AND 
I~ND destination,source 
Flags 
ODITSZAPC 
Logical and 
0 
XXUXO 


Operands 
Clocks 
Transfers· 
Bytes 
Coding 
Example 


register, register 
3 
- 
2 
AND AL,BL 
register, memory 
9+EA 
1 
2-4 
AND CX,FLAG~WORD 
memory, register 
16+EA 
2 
I 
2-4 
AND ASCII [DI],AL 
register, immediate 
4 
- 
3-4 
AND CX,OFOH 
memory, immediate 
17+ EA 
2 
3-6 
AND BETA,01H 
accumulator, 
immediate 
4 
- 
2-3 
AND AX,01010000B 


CALL 
I~ALL target 


i 
ODITSZAPC 


Call a procedure 
Flags 


Operands 
Clocks 
Transfers· 
Bytes 
Coding 
Examples 


near-proc 
19 
1 
3 
CALL NEAR~PROC 
far-proc 
28 
2 
5 
CALL FAR~PROC 
memptr16 
21+ EA 
2 
2-4 
CALL PROC_ TABLE [SI] 


regptr 16 
16 
1 
2 
CALL AX 
memptr 32 
37+EA 
4 
2-4 
CALL [BX].TASK [SI] 


CBW 
I,CBW (no operands) 
Flags 
ODITSZAPC 
Convert byte to word 


Operands 
Clocks 
Transfers· 
Bytes 
Coding 
Example 


(no operands) 
2 
- 
1 
CBW 


CLC 
ICLC (no operands) 
Flags 
ODITSZAPC 
Clear carry flag 
0 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


(no operands) 
2 
- 
1 
CLC 


CLO 
ICLD (no operands) 
Flags 
ODITSZAPC 


Clear direction flag 
0 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


(no operands) 
2 
- 
1 
CLD 


CLI 
ICLI (no operands) 
Flags 
ODITSZAPC 


Clear interrupt flag 
0 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


(no operands) 
2 
, - 
1 
CLI 


CMC 
ICMC (no operands) 
Flags 
ODITSZAPC 


Complement carry flag 
X 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


(no operands) 
2 
, 
- 
1 
CMC 


CMP 
I,CMP destination, source 
Flags 
ODITSZAPC 


Compare destination 
to source 
X 
XXXXX 


, 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 
. 


register, register 
3 
- 
2 
CMP BX, CX 
register, memory 
9+EA 
1 
2-4 
CMP DH, ALPHA 
memory, register 
9+EA 
1 
2-4 
CMP [BP+2], 
SI 
register, immediate 
4 
- 
3-4 
CMP BL,02H 
memory, immediate 
10+ EA 
1 
3-6 
CMP [BX].RADAR [DI], 3420H 
accumulator, 
immediate 
4 
- 
2-3 
CMP AL,00010000B 
, 


CMPS 
I,CMPS dest-string,source-string 
Flags 
ODITSZAPC 


Compare string 
. 
X 
XXXXX 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


dest-string, 
source-string 
22 
2 
1 
CMPS BUFF1, BUFF2 


(repeat) dest-string, 
source-string 
9+22/rep 
2/rep 
1 
REPE CMPS ID, KEY 


cwo 
I~WD (no operands) 
Flags 
ODITSZAPC 
Convert word to doubleword 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


(no operands) 
5 
- 
1 
CWO 


OAA 
IIDAA (no operands) 
Flags 
ODITSZAPC 
Decimal adjust for addition 
X 
XXXXX 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


(no operands) 
4 
- 
1 
DAA 


OAS 
I~AS (no operands) 
Flags 
ODITSZAPC 


Decimal adjust for subtraction 
U 
XXXXX 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


(no operands) 
4 
- 
1 
DAS 


OEC 
IIDEC destination 
Flags 
ODITSZAPC 
Decrement by 1 
X 
XXXX 


- 
Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


reg16 
2 
- 
1 
DEC AX 
reg8 
3 
- 
2 
DEC AL 
memory 
15+ EA 
2 
2-4 
DEC ARRAY [SI] 


OIV 
I~IV source 
Flags 
ODITSZAPC 


Division, unsigned 
U 
UUUUU 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


reg8 
80-90 
- 
2 
DIV CL 


reg16 
144-162 
- 
2 
DIV BX 
mem8 
(86-96) 
1 
2-4 
DIV ALPHA 
+EA 
mem16 
(150-168) 
1 
2-4 
DIV TABLE [SI] 


i 
+EA 


ESC 
I~SC external-opcode,source 
Flags 
ODITSZAP 
C 


Escape 


Op~rands 
Clocks 
Transfers· 
Bytes 
Coding Example 


immediate, memory 
8+EA 
1 
2-4 
ESC 6,ARRA Y [SI) 
immediate, 
register 
2 
- 
2 
ESC 20,AL 


HLT 
IHLT (no operands) 
Flags 
ODITSZAPC 
Halt 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


(no operands) 
2 
- 
1 
HLT 


IDIV 
IIDIV 
source 
Flags 
ODITSZAPC 
Integer division 
U 
UUUUU 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


reg8 
101-112 
- 
2 
IDIV BL 
reg16 
165-184 
- 
2 
IDIV CX 
mem8 
(107-118) 
1 
2-4 
IDIV DIVISOR_BYTE 
[$1] 
+EA 
mem16 
(171-190) 
1 
2-4 
IDIV [BX).DIVISOR_WORD 
+EA 


IMUL 
IIMUL source 
Flags 
ODITSZAPC 
Integer multiplication 
X 
UUUUX 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


reg8 
80-98 
- 
2 
IMUL CL 
reg16 
128c154 
- 
2 
IMUL BX 
mem8 
(86-104) 
1 
2-4 
IMUL RATE_BYTE 
+EA 
mem16 
(134-160) 
1 
2-4 
IMUL RATE_WORD 
[BP] [DI] 
+EA 


IN 
IIN accumulator, port 
Flags 
ODITSZAPC 
Input byte or word 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


accumulator, 
immed8 
10 
1 
2 
IN AL,OFFEAH 
accumulator, 
OX 
8 
1 
1 
IN AX, DX 


INC 
IINC destination 
Flags 
ODITSZAPC 
Increment by 1 
X 
X X X X 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


reg16 
2 
- 
1 
INC CX 
reg8 
3 
- 
2 
INC BL 
memory 
15+ EA 
2 
2-4 
INC ALPHA [DIJ[BX] 


INT 
liNT interrupt-type 
Flags 
ODITSZAPC 
Interrupt 
o 0 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


immed8 (type = 3) 
52 
5 
1 
INT 3 
immed8 (type "* 3) 
51 
5 
2 
INT 67 


INTRt 
INTR (external maskable interrupt) 
Flags 
ODITSZAPC 
Interrupt if INTR and IF=1 
o 0 


Operands 
Clocks 
Transfers· 
Byt~s 
Coding Example 


(no operands) 
61 
7 
N/A 
N/A 


IINTO (no operands) 


, 


ODITSZAPC 
INTO 
Interrupt if overflow 
Flags 
o 0 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


(no operands) 
53or 4 
5 
1 
INTO 


IRET 
IIRET (no operands) 
Flags 
ODITSZAPC 


Interrupt Return 
RRRRRRRRR 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


(no operands) 
24 
3 
1 
IRET 


JA/JNBE 
IJA/JNBE 
short-label 
Flags 
ODITSZAPC 


Jump if above/Jump 
if not below nor equal 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


short-label 
16or 4 
- 
2 
JA ABOVE 


,,' 


JAE/JNB 
IJAE/JNB 
short-label 
Flags 
ODITSZAPC 
Jump if above or equal/Jump 
if not below 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


short-label 
16or 4 
- 
2 
JAE ABOVE_EQUAL 


JB/JNAE 
IJB/JNAE 
short-label 
Flags 
ODITSZAPC 
Jump if below/Jump 
if not above nor equal 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


short-label 
16or 4 
- 
2 
JB BELOW 


"For the 8086,add four clocks for each 16-bit word transfer with an odd address. For the 8088,add four clocks for each 16-bit word transfer. 


tlNTR is not an instruction; 
it is included in table 2-21only for timing information. 


JBE/JNA 
IJBE/JNA short-label 
Flags 
ODITSZAPC 
Jump if below or equal/Jump if not above 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


short-label 
16or 4 
- 
2 
JNA NOT_ABOVE 


JC 
IJC short-label 
Flags 
ODITSZAPC 
Jump if carry 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


short-label 
16or 4 
- 
2 
JC CARRY_SET 


JCXZ 
IJCXZ short-label 
Flags 
ODITSZAPC 
Jump if CX is zero 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


short-label 
18or 6 
- 
2 
JCXZ COUNT_DONE 


JE/JZ 
IJE/ JZ short-label 
Flags 
ODITSZAPC 
Jump if equal/Jump if zero 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


short-label 
, 
16or 4 
- 
2 
JZ ZERO 


JG/JNLE 
IJG/JNLE short-label 
Flags 
ODITSZAPC 
Jump if greater/Jump if not less nor equal 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


short-label 
16or 4 
- 
2 
JG GREATER 


JGE/JNL 
IJGE/JNL short-label 
Flags 
ODITSZAPC 
Jump if greater or equal/Jump if not less 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


short-label 
16or 4 
- 
2 
JGE GREATER_EQUAL 


JL/JNGE 
IJL/JNGE short-label 
Flags 
ODITSZAPC 
Jump if less/Jump if not greater nor equal 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


short-label 
16or 4 
- 
2 
JL LESS 


JlE/JNG 
I JLE/JNG 
short-label 
Flags 
ODITSZAPC 
Jump if less or equal/Jump 
if not greater 


Operands 
Clocks 
Transfers· 
Bytes 
Coding 
Example 


short-label 
16or 4 
- 
2 
JNG NOT_GREATER 


JMP 
I JMP target 
Flags 
ODITSZAPC 
Jump 


Operands 
Clocks 
Transfers· 
Bytes 
Coding 
Example 


short-label 
15 
- 
2 
JMP SHORT 
near-label 
15 
- 
3 
JMP WITHIN_SEGMENT 
far-label 
15 
- 
5 
JMP FAR~LABEL 
memptr16 
18+ EA 
1 
2-4 
JMP [BX].TARGET 
regptr16 
11 
- 
2 
JMP CX 
memptr32 
24+ EA 
2 
2-4 
JMP OTHER.SEG [SI] 
,. 


JNC 
I JNC short-label 
Flags 
ODITSZAPC 
Jump if not carry 


Operands 
Clocks 
Transfers· 
Bytes 
Coding 
Example 


short-label 
16or 4 
- 
2 
JNC NOT_CARRY 


JNE/JNZ 
I JNE/JNZ 
short-label 
Flags 
ODITSZAPC 
Jump if not equal/Jump 
if not zero 


Operands 
Clocks 
Transfers· 
Bytes 
Coding 
Example 


short-label 
16or 4 
- 
2 
JNE NOT_EQUAL 


JNO 
I JNO short-label 
Flags 
ODITSZAPC 
Jump if not overflow 


Operands 
Clocks 
Transfers· 
Bytes 
Coding 
Example 


short-label 
I 
16or 4 
- 
2 
JNO NO_OVERFLOW 


JNP/JPO 
/JNP/JPO 
short-label 
Flags 
ODITSZAPC 
Jump if not parity/Jump 
if parity odd 


Operands 
Clocks 
Transfers· 
Bytes 
Coding 
Example 


short-label 
, 
16or 4 
- 
2 
JPO ODD_PARITY 


JNS 
I JNS short-label 
Flags 
ODITSZAPC 
Jump if not sign 


Operands 
Clocks 
Transfers· 
Bytes 
Coding 
Example 


short-label 
16or 4 
- 
2 
JNS POSITIVE 


JO 
IJO short-label 
Flags 
OOITSZAPC 
Jump if overflow 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


short-label 
16or 4 
- 
2 
JO SIGNEO_OVRFLW 


JP/JPE 
IJP/JPE short-label 
Flags 
OOITSZAPC 


Jump if parity I Jump if parity even 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


short-label 
16or 4 
- 
2 
JPE EVEN_PARITY 


JS 
IJS short-label 
Flags 
OOITSZAPC 
Jump if sign 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


short-label 
16or 4 
- 
2 
JS NEGATIVE 


LAHF 
ILAHF (no operands) 
Flags 
OOITSZAPC 
Load AH from flags 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


(no operands) 
4 
- 
1 
LAHF 


LOS 
ILOS destination,source 
Flags 
OOITSZAPC 
Load pointer using OS 


Operands 
Clocks 
Transfers 
Bytes 
Coding Example 


reg16, mem32 
16+ EA 
2 
2-4 
LOS SI,OATA.SEG [01] 


LEA 
I LEA destination,source 
Flags 
OOITSZAPC 
Load effective address 


Operands 
- 
Clocks 
Transfers· 
Bytes 
Coding Example 


reg16, mem16 
2+EA 
- 
2-4 
LEA BX, [BP] [01] 


LES 
ILES destination,source 
Flags 
OOITSZAPC 


Load pointer using ES 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


reg16, mem32 
16+EA 
2 
2-4 
LES 01, [BX].TEXT_BUFF 


LOCK 
ILOCK (no operands) 
Flags 
ODITSZAPC 
Lock bus 


Operands 
Clocks 
Transfers' 
Bytes 
Coding Example 


(no operands) 
2 
- 
1 
LOCK XCHG FLAG,AL 


LODS 
ILODS source-string 
Flags 
ODITSZAPC 
Load string 


Operands 
Clocks 
Transfers' 
Bytes 
Coding Example 


source-string 
12 
1 
1 
LODS CUSTOMER_NAME 


(repeat) source-string 
9+13/rep 
1/rep 
1 
REP LODS NAME 


LOOP 
IILOOP short-label 
Flags 
ODITSZAPC 
Loop 


Operands 
Clocks 
Transfers' 
Bytes 
Coding Example 


short-label 
i 
17/5 
- 
2 
LOOP AGAIN 


LOOPE/LOOPZ 
ILOOPE/LOOPZ 
short-label 
Flags 
ODITSZAPC 


Loop if equal/ Loop if zero 


Operands 
Clocks 
Transfers' 
Bytes 
Coding Example 


short-label 
. 


18or 6 
2 
LOOPE AGAIN 
- 


LOOPNE/LOOPNZ 
I~OO~NE/LOOPNZ 
sho.rt-Iabel 
Flags 
ODITSZAPC 


Loop If not equal/ Loop If not zero 


-. 
Operands 
Clocks 
Transfers' 
Bytes 
Coding Example 


short-label 
19or 5 
- 
2 
LOOPNE AGAIN 


NMlt 
I NMI (external nonmaskable interrupt) 
Flags 
OSITSZAPC 
- 
Interrupt if NMI = 1 
o 0 
. 


Operands 
Clocks 
Transfers' 
Bytes 
Coding Example 


(no operands) 
I 
50' 
5 
N/A 
N/A 


"For the 8086,add four clocks for each 16-bit word transfer with an odd address. For the 8086,add four clocks for each 16-bit word transfer. 
tNMI is not an instruction; 
it is included in table 2-21only for timing information. 


MOV 
IMOV destination,source 
Flags 
ODITSZAPC 


Move 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


memory, accumulator 
10 
1 
3 
MOV ARRAY [SI], AL 
accumulator, 
memory 
10 
1 
3 
MOV AX, TEMP_RESULT 
register, register 
2 
- 
2 
MOV AX,CX 


register, memory 
8+EA 
1 
2-4 
MOV BP, STACK_TOP 
memory, register 
9+EA 
1 
2-4 
MOV COUNT [01], CX 
register, immediate 
4 
- 
2-3 
MOV CL,2 
memory, immediate 
10+EA 
1 
3-6 
MOV MASK [BX] [SI], 2C.H 
seg-reg, reg16 
2 
- 
2 
MOV ES, CX 
seg-reg, mem16 
8+EA 
1 
2-4 
MOV OS, SEGMENT_BASE 
reg16, seg-reg 
2 
- 
2 
MOV BP, SS 
memory, seg-reg 
9+EA 
1 
2-4 
MOV [BX].SEG_SAVE, 
CS 


MOVS 
IMOVS dest-string,source-string 
Flags 
ODITSZAPC 
Move string 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


dest-string, 
source-string 
18 
2 
1 
MOVS LINE EDIT_DATA 


(repeat) dest-string, 
source-string 
9+17/rep 
2/rep 
1 
REP MOVSSCREEN,BUFFER 


MOVSB/MOVSW 
IMOVSB/MOVSW 
(no operands) 
Flags 
ODITSZAPC 


Move string (byte/word) 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


(no operands) 
18 
2 
1 
MOVSB 


(repeat) (no operands) 
9+17/rep 
2/rep 
1 
REP MOVSW 


MUL 
IMUL source 
Flags 
ODITSZAPC 


Multiplication, 
unsigned 
X 
UUUUX 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


reg8 
70-77 
- 
2 
MUL BL 
reg16 
118-133 
- 
2 
MUL CX 
mem8 
(76-83) 
1 
2-4 
MUL MONTH [SI] 
+EA 
mem16 
(124-139) 
1 
2-4 
MUL BAUD_RATE 
+EA 


NEG 
INEG destination 
Flags 
OOITSZAPC 
Negate 
X 
X X X X1' 


Operands 
Clocks 
Transfers' 
Bytes 
Coding Example 


register 
3 
- 
2 
NEG AL 
memory 
16+EA 
2 
2-4 
NEG MULTIPLIER 


NOP 
INOP (no operands) 
Flags 
OOITSZAPC 
No Operation 


Operands 
Clocks 
Transfers' 
Bytes 
Coding Example 


(no operands) 
3 
- 
1 
NOP 


NOT 
INOT destination 
Flags 
OOITSZAPC 


Logical not 


Operands 
Clocks 
Transfers' 
Bytes 
Coding Example 


register 
3 
- 
2 
NOT AX 
memory 
16+ EA 
2 
2-4 
NOT CHARACTER 


OR 
lOR destination,source 
Flags 
OOITSZAPC 


Logical inclusive or 
0 
XXUXO 


Operands 
Clocks 
Transfers' 
Bytes 
Coding Example 


register, register 
3 
- 
2 
OR AL, BL 
register, memory 
9+EA 
1 
2-4 
OR OX, PORT_10 [01) 


memory, register 
16+ EA 
2 
2-4 
OR FLAG_BYTE, 
CL 
accumulator, 
immediate 
4 
- 
2-3 
OR 
AL,01101100B 
register, immediate 
4 
- 
3-4 
OR CX,01H 
memory, immediate 
17+ EA 
2 
3-6 
OR [BX].CMO_WORO,OCFH 


OUT 
I~UT port,accumulator 
Flags 
OOITSZAPC 
Output byte or word 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


immedB, accumulator 
10 
1 
2 
OUT 44, AX 
OX, accumulator 
B 
1 
1 
OUT OX, AL 


POP 
I~OP destination 
Flags 
OOITSZAPC 


Pop word off stack 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


register 
B 
1 
1 
POP OX 
seg-reg (CS illegal) 
B 
1 
1 
POP OS 
memory 
17+EA 
2 
2-4 
POP PARAMETER 


POPF 
liOPF 
(no operands) 
Flags 
ODITSZAPC 
Pop flags off stack 
RRRRRRRRR 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


(no operands) 
8 
1 
1 
POPF 


PUSH 
IiUSH source 
Flags 
ODITSZAPC 


Push word onto stack 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


register 
11 
1 
1 
PUSH SI 


seg-reg (CS legal) 
10 
1 
1 
PUSH ES 
memory 
16+ EA 
2 
2-4 
PUSH RETURN_CODE 
[SI] 


PUSHF 
IiUSH F (no operands) 
Flags 
ODITSZAPC 
Push flags onto stack 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


(no operands) 
10 
1 
1 
PUSHF 


RCL 
I~CL destination,count 
Flags 
ODITSZAPC 
Rotate left through carry 
X 
X 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


register, 1 
2 
- 
2 
RCL CX, 1 
register, CL 
8+ 4/ bit 
- 
2 
RCL AL, CL 
memory, 1 
15+ EA 
2 
2-4 
RCL ALPHA, 1 
memory, CL 
20+EA+ 
2 
2-4 
RCL [BP].PARM, CL 
4/bit 


RCR 
I~CR designation,count 
Flags 
ODITSZAPC 
Rotate right through carry 
X 
X 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


register, 1 
2 
- 
2 
RCR BX,1 
register, CL 
8+4/bit 
- 
2 
RCR BL, CL 
memory, 1 
15+ EA 
2 
, 
2-4 
RCR [BX].STATUS,1 
memory, CL 
20+EA+ 
2 
2-4 
RCR ARRAY [Dl), CL 
4/bit 


REP 
I~EP (no operands) 
Flags 
ODITSZAPC 
Repeat string operation 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


(no operands) 
2 
- 
1 
REP MOVS DEST, SRCE 


REPE/REPZ 
IREPE/REPZ (no operands) 
Flags 
ODITSZAPC 
Repeatstringoperationwhileequal/whilezero 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


(no operands) 
2 
- 
1 
REPE CMPS DATA, KEY 


REPNE/REPNZ 
IREPNE/REPNZ 
(no operands) 
Flags 
ODITSZAPC 
Repeatstringoperationwhilenotequal/notzero 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


(no operands) 
2 
- 
1 
REPNE SCAS INPUT_LINE 


RET 
IRET optional-pop-value 
Flags 
ODITSZAPC 
Return from procedure 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


(intra-segment, 
no pop) 
8 
1 
1 
RET 
(intra-segment, 
pop) 
12 
1 
3 
RET 4 
(inter-segment, 
no pop) 
18 
2 
1 
RET 
(inter-segment, 
pop) 
17 
2 
3 
RET 2 


ROL 
IROL destination,count 
Flags 
ODITSZAPC 
Rotate left 
X 
X 


Operands 
Clocks 
Transfers 
Bytes 
Coding Examples 


register, 1 
2 
- 
2 
ROL BX,1 
register, CL 
8+4/bit 
- 
2 
ROL DI, CL 
memory, 1 
15+ EA 
2 
2-4 
ROL FLAG_BYTE 
[DI],1 
memory, CL 
20+EA+ 
2 
2-4 
ROL 
ALPHA, 
CL 
4/bit 


ROR 
I~OR destination ,count 
Flags 
ODITSZAPC 
Rotate right 
X 
X 


Operand 
Clocks 
Transfers· 
Bytes 
Coding Example 


register, 1 
2 
- 
2 
ROR AL,1 
register, CL 
8+4/bit 
- 
2 
ROR BX, CL 
memory, 1 
15+ EA 
2 
2-4 
ROR PORT_STATUS, 
1 
memory, CL 
20+EA+ 
2 
2-4 
ROR CMD_WORD, 
CL 
4/bit 


SAHF 
ISAHF (no operands) 
Flags 
ODITSZAPC 
Store AH into flags 
RRRRR 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


(no operands) 
4 
- 
1 
SAHF 


SAL/SHL 
ISAL/SHL destination,count 
Flags 
OOITSZAPC 


Shift arithmetic 
left/Shift 
logical left 
X 
X 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Examples 


register,1 
2 
- 
2 
SAL AL,1 
register, CL 
8+4/bit 
- 
2 
SHL 01, CL 
memory,1 
15+ EA 
2 
2-4 
SHL [BX].OVERORAW,1 


memory, CL 
20+EA+ 
2 
2-4 
SAL STORE_COUNT, 
CL 
4/bit 


SAR 
ISAR destination, source 
Flags 
OOITSZAPC 


Shift arithmetic right 
X 
XXUXX 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


register, 1 
2 
- 
2 
SAR OX,1 
register, CL 
8+4/bit 
- 
2 
SAR 01, CL 
memory, 1 
15+ EA 
2 
2-4 
SAR N_BLOCKS,1 
memory, CL 
20+EA+ 
2 
2-4 
SAR N_BLOCKS, 
CL 
4/bit 


SBB 
ISBB destination,source 
Flags 
OOITSZAPC 


Subtract with borrow 
X 
XXXXX 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


register, register 
3 
- 
2 
SBB BX, CX 
register, memory 
9+EA 
1 
2-4 
SBB 01, [BXj.PAYMENT 
memory, register 
16+ EA 
2 
2-4 
SBB I;3ALANCE,AX 
accumulator, 
immediate 
4 
- 
2-3 
SBB AX,2 


register, immediate 
4 
- 
3-4 
SBB CL,1 
memory, immediate 
17+ EA 
2 
3-6 
SBB COUNT [SI], 10 


, 
ISCAS dest-string 
OOITSZAPC 
SeAS 
Flags 
Scan string 
X 
XXXXX 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


dest-string 
15 
1 
1 
SCAS INPUT_LINE 
(repeat) dest-string 
9+15/rep 
1/rep 
1 
REPNE SCAS BUFFER 


I 


SEGMENTt 
I~EGMENT override prefix 
Flags 
OOITSZAPC 


Override to specified segment 


Operands 
Clocks 
Transfers· 
Bytes 
Coding Example 


(no operands) 
2 
- 
1 
MOV SS:PARAMETER, AX 


·For the 8086,add four clocks for each 16-bit word transfer with an odd address. For the 8088,add four clocks for each 16-bit word transfer. 


tASM-86 incorporates 
the segment override prefix into the operand specification 
and not as a separate instruction. 
SEGMENT is included in lable 


2-21only for liming information. 


sHR 
I SHR destination, 
count 
Flags 
ODITSZAPC 
Shift 
logical 
right 
X 
X 


Operands 
Clocks 
Transfers· 
Bytes 
Coding 
Example 


register, 
1 
2 
- 
2 
SHR 
SI,1 
register, 
CL 
8+4/bit 
- 
2 
SHR 
SI, CL 
memory, 
1 
15+ EA 
2 
2-4 
SHR 
ID_BYTE 
[SI] [BX], 1 


memory, 
CL 
20+EA+ 
2 
2-4 
SHR 
INPUT_WORD, 
CL 


4/bit 


SINGLE sTEPt 
I SINGLE 
STEP (Trap flag interrupt) 
Flags 
ODITSZAPC 


Interrupt 
if TF = 1 
o 0 


Operands 
Clocks 
Transfers· 
Bytes 
Coding 
Example 


(no operands) 
50 
5 
N/A 
N/A 


sTC 
I STC (no operands) 
Flags 
ODITSZAPC 


Set carry 
flag 
1 


Operands 
Clocks 
Transfers· 
Bytes 
Coding 
Example 


(no operands) 
2 
- 
1 
STC 


sTO 
I~TD (no operands) 
Flags 
ODITSZAPC 


Set direction 
flag 
1 


Operands 
Clocks 
Transfers· 
Bytes 
Coding 
Example 


(no operands) 
2 
- 
1 
STD 


sTI 
I~TI (no operands) 
Flags 
ODITSZAPC 


Set interrupt 
enable 
flag 
1 


Operands 
Clocks 
Transfers· 
Bytes 
Coding 
Example 


(no operands) 
2 
- 
1 
STI 


- 


sTos 
I STOS dest-string 
Flags 
ODITSZAPC 


Store 
byte or word 
string 


Operands 
Clocks 
Transfers· 
Bytes 
Coding 
Example 


dest-string 
11 
1 
1 
STOS 
PRINT_LINE 


(repeat) 
dest-string 
9+10/rep 
1/rep 
1 
REP 
STOS DISPLAY 


'For 
the 8086, add four 
clocks 
for each 
16-bit 
word 
transfer 
with 
an odd 
address. 
For the 8088, add four 
clocks 
for each 
16-bit 
word 
transfer. 


tSINGLE 
STEP 
is not an instruction; 
it is included 
in table 
2-21 only 
for timing 
information. 


SUB 
I SUB destination,source 
Flags 
ODITSZAPC 
Subtraction 
X 
XXXXX 


Operands 
Clocks 
Transfers· 
Bytes 
Coding 
Example 


register, register 
3 
- 
2 
SUB CX, BX 
register, memory 
9+EA 
1 
2-4 
SUB DX, MATH_TOTAL 
[SI) 
memory, register 
16+ EA 
2 
2-4 
SUB [BP+2], 
CL 
accumulator, 
immediate 
4 
- 
2-3 
SUB AL,10 
register, immediate 
4 
- 
3-4 
SUB SI,5280 
memory, immediate 
17+EA 
2 
3-6 
SUB [BP].BALANCE,1000 


TEST 
I:rEST destination,source 
Flags 
ODITSZAPC 
Test or non-destructive 
logical and 
0 
XXUXO 


Operands 
Clocks 
Transfers· 
Bytes 
Coding 
Example 


register, register 
3 
- 
2 
TEST SI, DI 
register, memory 
9+EA 
1 
2-4 
TEST SI, END_COUNT 
accumulator, 
immediate 
4 
- 
2-3 
TEST AL,00100000B 
register, immediate 
5 
- 
3-4 
TEST BX, OCC4H 
memory, immediate 
11+ EA 
- 
3-6 
TEST RETURN_CODE, 
01H 


, 


I~AIT 
(no operands) 
ODITSZAPC 
WAIT 
Wait while TEST pin not asserted 
Flags 


Operands 
Clocks 
Transfers· 
Bytes 
Coding 
Example 


(no operands) 
3 + 5n 
- 
1 
WAIT 
, 


XCHG 
I~CHG destination, source 
Flags 
ODITSZAPC 
Exchange 


Operands 
Clocks 
Transfers· 
Bytes 
Coding 
Example 


accumulator, 
reg16 
3 
- 
1 
XCHG AX, BX 
memory, register 
17+EA 
2 
2-4 
XCHG SEMAPHORE, AX 
register, register 
4 
- 
2 
XCHG AL, BL 


XLAT 
I~LAT source-table 
Flags 
ODITSZAPC 
Translate 


Operands 
Clocks 
Transfers· 
Bytes 
Coding 
Example 


source-table 
11 
1 
1 
XLAT ASCII_TAB 


XOR 
I XOR destination, source 
Flags 
ODITSZAPC 
Logical exclusive or 
0 
XXUXO 


Operands 
Clocks 
Transfers· 
Bytes 
Coding 
Example 


register, register 
3 
- 
2 
XOR CX, BX 


register, memory 
9+EA 
1 
2-4 
XOR CL, MASK_BYTE 


memory, register 
16+ EA 
2 
2-4 
XOR ALPHA [SI), DX 
accumulator, 
immediate 
4 
- 
2-3 
XOR AL,01000010B 


register, immediate 
4 
- 
3-4 
XOR SI,00C2H 
memory, immediate 
17+EA 
2 
3-6 
XOR RETURN_CODE,OD2H 


The 8086 and 8088 provide many different ways 
to access instruction operands. 
Operands may be 


contained 
in 
registers, 
within 
the 
instruction 


itself, in memory or in I/O ports. In addition, the 
addresses of memory and I/O port operands can 
be calculated 
in several different 
ways. These 
addressing 
modes greatly extend the flexibility 
and convenience of the instruction 
set. This sec- 


tion 
briefly 
describes 
register 
and 
immediate 


operands and then covers the 8086/8088 
memory 


and I/O addressing modes in detail. 


Instructions 
that specify only register operands 
are 
generally 
the 
most 
compact 
and 
fastest 
executing of all instruction forms. This is because 
the register "addresses" 
are encoded in instruc- 


tions in just a few bits, and because these opera- 
tions are performed entirely within the CPU (no 
bus cycles are run). Registers may serve as source 
operands, destination operands, or both. 


Immediate operands are constant data contained 
in an instruction. 
The data may be either 8 or 16 


bits 
in 
length. 
Immediate 
operands 
can 
be 


accessed 
quickly 
because 
they 
are 
available 


directly from the instruction queue; like a register 
operand, no bus cycles need to be rUn to obtain an 
immediate operand. The limitations of immediate 
operands are that they may only serve as source 
operands and that they are constant values. 


Whereas the EU has direct access to register and 
immediate operands, 
memory operands 
must be 


transferred 
to or from the CPU over the bus. 


When the EU needs to read or write a memory 
operand, it must pass an offset value to the BIU. 
The BIU adds the offset to the (shifted) content of 
a segment register producing 
a 20-bit physical 


address and then executes the bus cycle(s) needed 
to access the operand. 


The offset that the EU calculates for a memory 
operand is called the operand's 
effective address 


or EA. 
It is an unsigned 
l6-bit 
number 
that 


expresses the operand's 
distance in bytes from the 


beginning of the segment in which it resides. The 
EU can calculate the effective address in several 
different 
ways. 
Information 
encoded 
in 
the 


second byte of the instruction tells the EU how to 
calculate the effective address of each memory 
operand. 
A compiler 
or assembler 
derives this 


information 
from 
the statement 
or instruction 
written by the programmer. 
Assembly language 


programmers have access to all addressing modes. 


Figure 
2-34 
shows 
that 
the 
execution 
unit 


calculates the EA by summing a displacement, the 
content of a base register and the content of an 
index register. The fact that any combination 
of 


these three components may be present in a given 
instruction gives rise to the variety of 8086/8088 
memory addressing modes. 
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The 
displacement 
element 
is an 
8- or 
l6-bit 


number that is contained in the instruction. 
The 


displacement generally is derived from the posi- 
tion of the operand name (a variable or label) in 
the program. It also is possible for a programmer 
to modify this value or to specify the displace- 
ment explicitly. 


A programmer may specify that either BX or BP 
is to serve as a base register whose content is to be 
used in the EA computation. 
Similarly, either SI 


or DI may be specified 
as an index register. 


Whereas the displacement value is a constant, the 
contents 
of the base and 
index registers 
may 
change during execution. This makes it possible 
for one instruction 
to access different 
memory 
locations as determined by the current values in 
the base and/or index registers. 


It takes time for the EU to calculate a memory 
operand's effective address. In general, the more 
elements in the calculation, 
the longer it takes. 


Table 2-20 shows how much time is required to 
compute an effective address for any combination 
of displacement, base register and index register. 


Direct addressing (see figure 2-35) is the simplest 
memory addressing 
mode. 
No registers are in- 
volved; the EA is taken directly from the displace- 
ment field of the instruction. 
Direct addressing 


typically 
is 
used 
to 
access 
simple 
variables 


(scalars). 


The effective address of a' memory operand may 
be taken directly from one of the base or index 
registers as shown in figure 2-36. One instruction 
can operate on many different memory locations 
if the value in the base or index register is updated 


appropriately. 
The LEA (load effective address) 
and 
arithmetic 
instructions 
might 
be used to 
change the register value. 


Note that any 16-bit general register may be used 
for register indirect addressing with the JMP or 
CALL instructions. 


In based addressing 
(figure 2-37), the effective 
address is the sum of a displacement value and the 
content of register BX or register BP. Recall that 
specifying BP.as a base register directs the BIU to 
obtain the operand 
from the current stack seg- 


Eli::rr 


. 
~ 
EA 


ment (unless a segment override prefix is present). 
This makes based addressing with BP a very con- 
venient way to access stack data (see section 2.10 
for examples). 


Based addressing also provides a straightforward 
way to address structures which may be located at 
different places in memory (see figure 2-38). A 
base register can be pointed 
at the base of the 
structure and elements of the structure addressed 
by their displacements 
from the base. Different 
copies of the same structure can be accessed by 
simply changing the base register. 
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Figure 2-38. Accessing a Structure 
With Based 
Addressing 


In indexed addressing, 
the effective address 
is 
calculated from the sum of a displacement 
plus 
the content 
of an index register (SI or DI) as 
sho,wn in figure 2-39. Indexed addressing often is 


E!f3-r 


J 
EA 


used to access elements in an array (see figure 
2-40). The displacement 
locates the beginning of 
the array, 
and the value of the index register 
selects one element (the first element is selected if 
the index register 
contains 
0). Since all array 
elements are the same length, simple arithmetic 
on the index register will select any element. 


Based indexed addressing 
generates an effective 
address that is the sum of a base register, 
an 
index 
register 
and 
a displacement 
(see figure 
2-41). Based indexed addressing is a very flexible 
mode because two address components 
can be 
varied at execution time. 
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Figure 2-40. Accessing an Array With Indexed 
Addressing 


Based indexed addressing provides a convenient 
way for a procedure to address an array allocated 
on a stack (see figure 2-42). Register BP can con- 
tain the offset of a reference point on the stack, 
typically the top of the stack after the procedure 
has saved>registers and allocated 
local storage. 
The offset of the beginning of the array from the 
reference point can be expressed by a displace- 
ment value, and an index register can be used to 
access individual array elements. 


Arrays contained in structures and matrices (two- 
dimension 
arrays) 
also could be accessed with 
based indexed addressing. 
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String instructions do not use the normal memory 
addressing 
modes 
to 
access 
their 
operands. 
Instead, the index registers are used implicitly as 
shown in figure 2-43. When a string inst~uctiort is 
executed, SI is assumed to point to the first byte 
or word of the source string, and DI is assumed to 
point to the first byte or w'ord of the destination 
string. In a repeated string operation, 
the CPUs 


automatically adjust SI and DI to obtain subse- 
quent bytes or words. 


If an 110 port is memory mapped, 
any of the 


memory operand addressing modes may be used 
to access the port. For example, a ,group of ter- 
minals can be accessed as an "array." 
String 


instructions also can be used to transfer data to 
memory-mapped ports with an appropriate 
hard- 


ware interface. Section 2.10 contains examples of 
addressing memory-mapped 
110 ports. 
Two different addressing modes can be used to 
access ports located in the 110 space; these are 
illustrated in figure 2-44. In direct port address- 
ing, 
the 
port 
number 
is an 
8-bit 
immediate 
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similar to register indirect addressing of memory 
operands. The port number is taken from register 
DX and can range from 0 to 65,535. By pre- 
viously adjusting the content of register DX, one 
instruction can access any port in the 110 space. 
A group of adjacent ports can be accessed using a 
simple software loop that adjusts the value in DX. 


A comprehensive integrated set of tools supports 
8086/8088 
software development. These tools are 


programs that run on Intellec® 800 or Series II 
Microcomputer 
Development 
Systems under the 


ISIS-II operating system, the same hardware and 
operating system used to develop software for the 
8080 and the 8085. Since the 8086 and 8088 are 
software-compatible 
with one another, 
the same 


tools are used for both 
processors 
to provide 


programmers 
with 
a 
uniform 
development 


environment. 


T 
~ 
ox 


A program 
that will ultimately 
execute on an 
8086- or 8088-based system is developed in steps 
(see figure 2-45). The overall program 
is com- 
posed of functional 
units called modules. 
For 
purposes of this discussion, a module is a section 
of code that is separately 
created, 
edited, 
and 
compiled or assembled. 
A very small program 
might consist of a single module; a large program 
could be comprised of \00 or more modules. The 
8086/8088 
LINK-86 
utility 
binds 
modules 
together 
into 
a single program. 
(The module 
structure of a program is critical to its successful 
development 
and maintenance; 
see section 2.10 
for guidelines.) 


8086 and 8088 modules can be written in either 
PLlM-86 or ASM-86 (see table 2-22). PLlM-86 is 
a 
high-level 
language 
suitable 
for 
most 
microprocessor 
applications. 
It is easy to use, 
even by programmers 
who have little experience 
with microprocessors. 
Because it reduces software 
development time, PL/M-86 
is ideal for most of 
the programming 
in any application, 
especially 
applications that must get to market quickly. 


ASM-86 is the 
8086/8088 
assembly 
language. 
ASM-86 provides the programmer who is familiar 
with the CPU architecture, 
access to all processor 
features. For critical code segments within pro- 
grams that make sophisticated 
use of the hard- 
ware, have extremely demanding performance 
or 
memory constraints, ASM-86 is the best choice. 
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• Fast Development 
• Fastest Execution Speed 


• Less Programmer Training 
• Smallest Memory Requirements 


• Detailed Hardware Knowledge 
Not Required 
• Access To All Processor Facilities 


The languages are completely compatible, 
and a 


judicious 
combination 
of the two often makes 


good sense. Prototype 
software can be developed 


rapidly 
with 
PL/M-86. 
When 
the 
system 
is 


operating 
correctly, 
it can be analyzed 
to see 


which sections can best profit from being written 
in ASM-86. 
Since the logic of these sections 
already has been debugged, selective rewriting can 
be done quickly and with low risk. 


Each 
PLlM-86 
or ASM-86 
module 
(called a 


source moduel) is keyed into the Intellec® system 
using the ISIS-II text editor and is stored as a 
diskette file. This source file is then input to the 
appropriate 
language 
translator 
(ASM-86 


assembler or PL/M-86 
compiler). The language 


translator 
creates a diskette file from the source 


file, which is called a relocatable object module. 
The translator also lists the program and flags any 
errors 
detected 
during 
the 
translation. 
The 


relocatable object module contains the 8086/8088 
machine instructions 
that the translator 
created 


from the statements 
in the source module. The 


term 
"relocatable" 
refers 
to the fact that 
all 


references to memory locations in the module are 
relative, 
rather 
than 
being 
absolute 
memory 
addresses. The module generally is not executable 
until the relative references are changed to the 
actual memory locations where the module will 
reside in the execution system's memory. The pro- 
cess 
of 
changing 
the 
relative 
references 
to 


absolute memory locations is called locating. 


There are very good reasons 
for not locating 


modules when they are translated. 
First, the exe- 


cution system's 
physical memory configuration 


(where RAM and 
ROM/PROM 
segments 
are 


actually located in the megabyte memory space) 
may not be known at the time the modules are 
written. Second, it is desirable to be able to use a 
common module (e.g., a square root routine) in 
more than one system. If absolute addresses were 
assigned at translation 
time, the common module 


would either have to occupy the same physical 


addresses in every system, or separate versions 
with different addresses would have to be main- 
tained for each system. When locating is deferred, 
a single version of a common routine can be used 
by any number of systems. Finally, the locations 
of 
modules 
typically 
change 
as a 
system 
is 


developed, maintained and enhanced. Separating 
the location process from the translation 
process 


means that as modifications are made, unchanged 
modules 
only 
need 
to 
be 
relocated, 
not 


retransla ted. 


Relocatable 
object modules may be placed into 


special files called libraries, 
using the LIB-86 


library 
manager 
program. 
Libraries 
provide 
a 


convenient means of collecting groups of related 
modules so that they can be accessed automati- 
cally by the LINK-86 program. 


When enough 
relocatable 
object modules have 


been created to test the system, or part of it, the 
modules are linked and located. 
Linking com- 
bines all the separate modules into a single pro- 
gram. 
Locating 
changes 
the 
relative 
memory 


references in the program to the actual memory 
locations where the program will be loaded in the 
execution system. The link and locate process also 
is referred to as R & L, for relocation and linkage. 


Two 
other 
programs 
round 
out 
the software 


development 
tools available 
for the 8086 and 


8088. OH-86 converts an absolute object file into 
a hexadecimal format used by some PROM pro- 
grammers and system loaders (for example, the 
SDK-86 and iSBC 957™ loaders). CONV-86 can 
do most 
of the conversion 
work 
required 
to 


translate 
8080/8085 
assembly 
language 
source 


modules into ASM-86 source modules. 


The 8086/8088 
software 
development 
facilities 


are covered in more detail in the remainder of this 
section. However, these are only introductions 
to 


avauaOle In tne tOllowlng publlcauons 
avaIlable 
from Intel's Literature Department: 


ISIS-II: 


ISIS-II System User's Guide, Order No. 9800306 


MCS-86 Assembly 
Language Reference Manual, 
Order No. 9800640 


MCS-86 
Assembler 
Operating 
Instructions 
for 
ISIS-II Users, Order No. 9800641 


PL/M-86 
Programming 
Manual, 
Order 
No. 
9800466 


ISIS-II PL/M-86 
Compiler Operator's Manual, 
Order No. 9800478 


LINK-86, LOC-86, LIB-86, OH-86: 


MCS-86 
Software 
Development 
Utilities 
Operating Instructions 
for ISIS-II 
Users, Order 
No. 9800639 


MCS-86 
Assembly 
Language 
Converter 
Operating Instructions 
for ISIS-II 
Users, Order 
No. 9800642 


PLlM-86 
is 
a 
general-purpose, 
high-level 
language 
for programming 
the 8086 and 8088 
microprocessors. 
It is an extension of PL/M-80, 


the most widely-used, 
high-level programming 
language for microprocessors. 
(PL/M-80 
source 
programs can be processed by the PL/M-86 com- 
piler; the resulting object program 
is generally 
reduced by 15-30010in size.) PLlM-86 
is suitable 
for all types of microprocessor 
software 
from 
operating systems to application programs. 


PL/M-86's 
purpose is simple: to reduce the time 
and cost of developing and maintaining software 
for the 8086 and 8088. It accomplishes this by 
creating a programming environment that, for the 
most part, is distinct from the architecture of the 
CPUs. 
Registers, 
segments, 
addressing 
modes, 
stacks, 
etc., 
are effectively 
"invisible" 
to the 


appear 
to 
respond 
to 
simple 
commands 
and 
familiar algebraic expressions. The responsibility 
for translating 
these source statements 
into the 
machine instructions 
ultimately required to exe- 
cute on the 8086/8088 
is assumed by the PLlM-86 
compiler. By "hiding" 
the details of the machine 
architecture, 
PL/M -86 encourages programmers 


to concentrate 
on solving the problem at hand. 


Furthermore, 
because 
PL/M-86 
is closer 
to 
natural 
language, 
it 
is 
easier 
to 
"think 
in 
PLlM-86" 
than 
it is to 
"think 
in assembly 


language." 
This speeds up the expression 
of a 


program solution, and, equally important, 
makes 


that solution easier for someone other than the 
original 
programmer 
to understand. 
PL/M-86 
also contains 
all the constructs 
necessary 
for 
structured programming. 


A programmer 
builds a PLlM-86 
program 
by 
writing 
statements 
and 
comments 
(see figure 
2-46). 
There 
are 
several 
different 
types 
of 
statements in PLlM-86; 
they always end with a 


semicolon. 
Blanks 
can be used 
freely before, 


within, 
and after 
statements 
to improve 
read- 
ability. A statement also may span more than one 
line. 


The characters 
"1*" 
start a comment, 
and the 


characters 
"*1" 
end it; any characters 
may be 


used in between. Comments do not affect the exe- 
cution of a PLlM-86 program, 
but all good pro- 
grams are thoughtfully 
commented. 
Comments 


are notes that document and clarify the program's 
operation; they may be written virtually anywhere 
in a PLlM-86 program. 


Most PLlM-86 
programs 
begin by defining the 


data items (variables) with which they are going to 
work. An individual 
PL/M-86 
data element is 


called 
a 
scalar. 
Every 
scalar 
variable 
has 
a 


programmer-supplied 
name up to 31 characters 


long, and a type. PL/M-86 supports five types of 
scalars: byte, word, 
integer, 
real, and pointer. 


Table 
2-23 
lists 
the 
characteristics 
of 
these 
PLlM-86 data types. 


'"TRAFFIC 
DATA RECORDER 
CONTROL 
PROGRAM" 
"VERSION 
2.2, RELEASE 
5, 23APR79." 


"THIS RELEASE 
FIXES THREE BUGS" 


"DOCUMENTED 
IN PROBLEM 
REPORT #16."' 


'"COMPUTE 
TOTAL PAYMENT DUE"' 
TOTAL = PRINCIPAL 
+ INTEREST; 


IF TERMINAL$READY 
THEN CALL FILL$BUFFER; 
ELSE CALL WAIT (50); 
'"WAIT 50 MS FOR RESPONSE"' 


TYPE 
BYTES 
RANGE 
USAGE 


BYTE 
1 
o to 255 
Unsigned 
Integer, 
Character 


WORD 
2 
o to 65,535 
Unsigned 
Integer 


INTEGER 
2 
-32,768 to 
Signed 
Integer 
+32,767 


REAL 
4 
1 x 10-38 to 
Floating Point 
3.37 x 10+38 


POINTER 
2'4 
N'A 
Address 
Manipulation 


Variables 
are 
defined 
by writing 
a DECLARE 


statement 
of this form: 


Options 
of the DECLARE 
statement 
can be used 


to specify 
an initial 
value 
for 
the scalar 
and 
to 


define a series of items in a shorthand 
form. 


Besides 
scalar 
variables, 
scalar 
constants 
may be 
used 
in 
PLlM-86 
programs 
(see 
figure 
2-47). 


Constants 
may be written 
"as is" or may be given 


names to improve 
program 
clarity. 


Scalars 
can be aggregated 
into named 
collections 
of data such as arrays 
and structures. 
An array 
is 


a 
collection 
of 
scalars 
of 
the 
same 
type 
(all 


integer, 
all 
real, 
etc.). 
Arrays 
are 
useful 
for 


representing 
data that has a repetitive 
nature. 
For 


example, 
monthly 
rainfall 
samples 
could 
be 


represented 
as an array 
of 
12 elements, 
one 
for 


each month: 


Each 
element 
in 
an 
array 
is 
accessible 
by 
a 
number 
called 
a subscript 
which 
is the element's 


relative 
location 
in the 
array. 
In PL/M-86, 
the 
first element 
in an array 
has a subscript 
of 0; it is 


considered 
the "Oth" 
element. 
Thus, 
RAINFALL 
(11) refers 
to December's 
sample. 
The 
subscript 


need not be a constant; 
variables 
and expressions 


also may be used as subscripts. 


Strings 
of character 
data 
are typically 
defined 
as 


byte 
arrays. 
Characters 
can 
be 
accessed 
with 
subscripts 
or with powerful 
string-handling 
func- 
tions built into PL/M-86. 


10 
OAH 
120 
00001010B 
10.0 
1.0E1 


'A' 


'"DECIMAL 
NUMBER"' 
'"HEXADECIMAL 
NUMBER"' 
'"OCTAL 
NUMBER"' 
'"BINARY 
NUMBER"' 
'"FLOATING 
POINT NUMBER"' 
'"FLOATING 
POINT NUMBER*' 
'"CHARACTER"' 


'"CONSTANTS 
MAY BE GIVEN NAMES"' 
DECLARE 
STATUS$PORT 
LITERALLY 
'OFFEH'; 
DECLARE 
THRESHOLD 
LITERALLY 
'98.6'; 


A structure is a collection of related data elements 
that do not necessarily have the same type. The 
elements are related by virtue of "belonging" 
to 


the entity represented by the structure. 
Here is a 


simple structure declaration: 


(SPAN 


YR$BUILT 


The year the bridge was built could be accessed by 
writing BRIDGE. YR$BUIL T; the structure ele- 
ment name is "qualified" 
by the dot and the 
structure 
name. This allows structures 
with the 


same element names 
to be distinguished 
from 


each other (e.g., HIGHWAY.YR$BUILT). 


Arrays and structures can be combined into more 
complex data aggregates: 


• 
array elements may be structures rather than 
scalars, 


• 
a structure element may be an array, 


• 
structures 
in arrays may themselves contain 


arrays. 


Figure 
2-48 
provides 
sample 
PLlM-86 
data 


declarations. 


Data that has been defined can be operated on 
with PLlM-86 
executable 
statements. 
The fun- 


damental executable statement 
is the assignment 
statement, written in this form: 


This means "evaluate 
the expression and assign 
(move) the result to the variable." 


There are three basic classes of expressions 
in 


PLlM-86; 
arithmetic, 
relational 
and logical (see 


table 2-24 and figure 2-49). All expressions are 
combinations 
of 
operands 
and 
operators, 
although 
an expression 
can consist of a single 


operand. 
Operands 
are variables and constants; 


operators 
vary according 
to the type of expres- 
sion. Evaluation of an expression always yields a 
single result; different classes of expressions yield 
different types of results. 


EXPRESSION 
OPERATORS 
RESULT 


ARITHMETIC 
+, -,", 
I, MOD 
NUMBER 


RELATIONAL 
>,<,=,>=,<= 
"TRUE"-FFH 
"FALSE"-OH 


LOGICAL 
AND, OR, XOR, NOT 
8/16-BIT STRING 


,····SCALARS····' 
DECLARE SWITCH 
DECLARE COUNT 
INDEX 
DECLARE (NET, GROSS, 


BYTE; 
WORD, 
INTEGER; 
TOTAL) 
REAL; 


'·1 SCALAR*' 
'·1 SCALAR*' 
'·3 SCALARS·' 


, •••• ARRAyS····, 
DECLARE MONTH (12) 
BYTE; 


DECLARE TERMINAL_LINE 
(80) 


'····STRUCTURE····' 
DECLARE EMPLOYEE STRUCTURE 
(ID_NUMBER 
DEPARTMENT 
RATE 


WORD, 
BYTE 
REAL); 


, •••• ARRAY OF STRUCTURES····' 
DECLARE INVENTORY_ITEM 
(100) 
STRUCTURE 
(PART_NUMBER 
WORD, 
ON_HAND 
WORD, 
RE_ORDER 
BYTE); 


, •••• ARRAY WITHIN STRUCTURE····' 
DECLARE COUNTY_DATA 
STRUCTURE 
(NAME (20) 
BYTE, 


TEN_ YR_RAINFALL(10) 
BYTE, 


PER CAPITA_INCOME 
REAL); 


,. ARITH METIC· , 


A = 2; B = 3; 
B = B+ 1; 
C=(A·B)-2; 
C = ((A· B) + 3) MOD 3; 


'·RELATIONAL 
., 
A=2; B=3 
C= B> A; 
C= B<>A; 
C = B = (A+1); 


'·LOGICAL·' 
A = 0011$0001B; 
B = 1000$0001B; 
C= NOT B; 
C = A AND B; 
C = A OR B; 
C= BXORA; 
C = (A AND B) OR OFOH; 


'·B CONTAINS 4*' 
'·C CONTAINS 6·' 
'·C CONTAINS 2·' 


'·C CONTAINS OFFH·' 
'·C CONTAINS OFFH·' 
'·C CONTAINS OFFH*' 


'·C CONTAINS 0111$1110B·' 
'·C CONTAINS 0000$0001B·' 
'*C CONTAINS 1011$0001B·' 
'*C CONTAINS 1011$0000B·' 
'·C CONTAINS 1111$0001B·' 


Simple PL/M-86 
programs 
can be written with 
just DECLARE and assignment statements. Such 
programs, 
however, 
execute 
exactly 
the same 
sequence of statements 
every time they are run 
and would not prove very useful. PL/M-86 
pro- 
vides statements that change the flow of control 
through a program. 
These statements allow sec- 
tions of the program 
to be executed selectively, 
repeated, skipped entirely, etc. 


The IF statement (figure 2-50) selects one or the 
other of two statements for execution depending 
on the result of a relational 
expression. 
The IF 
statement is written: 


Statement 1 is executed if the expression is "true"; 
statement2 is not executed in this case. If the rela- 
tion is "false," 
statement 1 is skipped and state- 
ment2 is executed. In determining the "truth" 
of 
an expression, the IF statement only examines the 
low-order bit of the result (1="true"). 
Therefore, 
arithmetic 
and logical expressions 
also may be 
used in an IF statement. 


A = 3; B = 5; 
IFA< 
B 
THEN MINIMUM 
= 1; 


ELSE MINIMUM 
= 2; 


MORE_DATA 
= OFFH; 
IF NOT MORE 
DATA 
THEN DONE = 1; 
ELSE DONE = 0; 


/'EXECUTED'/ 
/'SKIPPED'/ 


/'SKIPPED'/ 
/'EXECUTED'/ 


/'NESTED 
IF STATEMENTS'/ 
CLOCK_ON 
= 1; HOUR=24; ALARM=OFF; 
IF CLOCK_ON 
THEN IF HOUR = 24 
THEN IF ALARM = OFF 
THEN HOUR = 0; /'EXECUTED'/ 


A DO block begins with a DO statement and ends 
with 
an 
END 
statement. 
All 
intervening 
statements are part of the block. A DO block can 
appear anywhere in a program that an executable 
statement can appear. There are four kinds of DO 
statements in PLlM-86: 
simple DO, DO CASE, 
interative DO, and DO WHILE. 


A simple DO statement (figure 2-51) causes all the 
statements in the block to be treated as though 
they were a single statement. Simple DOs enable a 
single IF statement 
to cause multiple statements 
to be executed (the alternative would be to repeat 
the 
IF 
statement 
for 
every 
statement 
to 
be 
executed). 


/'SIMPLE 
DO'/ 
A=5; B=9; 
IF(A+2)< 
BTHEN 
DO; 
X=X-1; 
Y(X)=O; 
END; 
ELSE 
DO; 
X=X+1; 
Y(X)=1; 
END; 


/'EXECUTED'/ 
/'EXECUTED'/ 


/'SKIPPED'/ 
/'SKIPPED" 


"DOCASE"/ 
A=2; 
DO CASE (A); 
X=X+1; 
X=X+2; 
X=X+3; 
X=X+4; 
END; 
Figure 2-51. PLlM-86 
Simple DO 
and DO CASE 


'"SKIPPED"/ 
'"SKIPPED"' 
'"EXECUTED"/ 
'"SKIPPED"' 


DO CASE (figure 2-51) causes one statement in 
the DO block to be selected and executed depend- 
ing on 
the 
result 
of 
the 
expression 
(usually 
arithmetic) 
written 
immediately 
following 
DO 
CASE: 


If the expression yields 0, the first statement in the 
DO block is executed; if the expression yields 1, 
the second statement is executed, etc. A statement 
in the DO block may be null (consist of only a 
semicolon) to cause no action for selected cases. 
DO CASE provides a rapid and easily-understood 
way to respond to data like "transaction 
codes" 


where a different 
action is required for each of 


many values a code might assume (an alternative 
would be an IF statement for every value the code 
could assume). 


An iterative DO block (figures 2-52 and 2-53) is 
executed from 0 to an infinite number of times 
based on the relationship of an index variable to 
an expression 
that 
terminates 
execution. 
The 


general form is: 


The "BY step-expr" 
is optional, 
and the step is 
assumed to be 1 if not supplied (the typical case). 
When control 
first reaches the DO statement, 


start-expr is evaluated and is assigned to index. 
Then index is compared 
to stop-expr; 
if index 


exceeds stop-expr, 
control goes to the statement 


following the DO block, otherwise the block is 
executed. At the end of the block, the result of 
step-expr is added to index, and it is compared to 


stop-expr again, etc. (The iterative DO is quite 
flexible-this 
is 
a 
simplified 
explanation.) 


Iterative DOs are handy for "stepping 
through" 


an array. For example, an array of 10 elements 
could be zeroed by: 


DO I =OT09; 


ARRA Y(I) = 0; 


END; 


In a DO WHILE 
(figures 2-52 and 2-54), the 
statements are executed repeatedly as long as the 
expression 
following 
WHILE 
evaluates 
to 


"true." 
DO WHILE 
often 
can be applied 
in 
situations where an interative DO will not work, 
or is clumsy, such as where repetition 
must be 
controlled 
by 
a 
non-integer 
value. 
Like 
an 


iterative DO, DO WHILE may be executed from 
o times to an infinite number of times. 


'"ITERATIVE DO"' 
DO 1=OT05; 
ARRAY (I) = I; 
TOTAL = TOTAL +1; 
END; 


'"1=6ATTHIS 
POINT*' 


'"EXECUTED 6 TIMES"' 
'"EXECUTED 6 TIMES"' 


'"DO WHILE"' 
MORE = 0; SPACE_OK 
=1; 


DO WHILE (MORE AND SPACE_OK); 
ITEMS = ITEMS + 1; 
'"SKIPPED"' 
N 
TRACKS = 
N_ TRACKS + 10; 
'"SKIPPED*' 
IF N_ TRACKS >= 999 
'"SKIPPED*' 
THEN SPACE_OK 
= 0; 
END; 


'"DOWHILE"' 
CODE = 'A'; 
DO WHILE (CODE = 'A'); 


TEMP = TEMP * STEP; 
'*EXECUTION STOPS* I 
IFTEMP>98.6 
I*AFTERTEMP*I 
THEN CODE = '8'; 
I*EXCEEDS 98.6"1 
N_STEPS = N_STEPS 
+ 1; 


END; 


causes 
an 
unconditional 
transfer 
(branch) 
to 
another statement in the program. The statement 
receiving control would be written 


where 
"target" 
is 
a 
label 
identifying 
the 


statement. 


activates a procedure 
defined earlier in the pro- 


gram. 
The variables 
listed in "parm-list" 
are 


passed 
to 
the 
procedure, 
the 
procedure 
is 


executed, and then control returns to the state- 
ment following the CALL. Thus, unlike a GOTO, 
a CALL 
brings control 
back 
to the point 
of 


departure. 


Procedures 
are 
"subprograms" 
that 
make 
it 


possible to simplify the design of complex pro- 
grams and to share a single copy of a routine 
among programs. A procedure usually is designed 
to perform one function; i.e., to solve one part of 
the total problem with which the program is deal- 
ing. 
For 
example, 
a 
program 
to 
calculate 


paychecks could be broken down into separate 
procedures for calculating gross pay, income tax, 
Social Security and net pay. The organization 
of 


the "main" 
program then could be understood at 


a glance: 


CALL GROSS_PAY; 
CALL INCOME_TAX; 
CALL SOCIAL_SECURITY; 
CALL NET_PAY; 


Furthermore, 
the income tax procedure could be 
divided into separate procedures 
for calculating 


state and federal taxes. Procedures, 
then, provide 


a mechanism by which a large, complex problem 
can be attacked 
with a "divide 
and conquer" 
strategy. 


A procedure usually is defined early in a program, 
but it is only executed when it is referred to by 
name in a later PL/M-86 
statement. A procedure 


can accept a list of variables, called parameters, 
that it will use in performing 
its function. These 


parameters may assume different values each time 
the procedure is executed. 


PL/M-86 
provides 
two classes of procedures, 
typed and untyped. A typed procedure returns a 
value to the statement 
that activates it and, in 


addition, may accept parameters 
from that state- 
ment. A typed procedure is activated whenever its 
name appears in a statement; the value it returns 
effectively takes the place of the procedure name 
in the statement. Typed procedures can be used in 
all kinds of PL/M-86 
expressions. Untyped pro- 


cedures may accept parameters, 
but do not return 


a value. 
Untyped 
procedures 
are activated 
by 


CALL statements. Figure 2-55 shows how simple 
typed and untyped procedures 
may be declared 
and then activated. 


The statements forming the body o( a procedure 
need not exist within the module that activates the 
procedure. The activating module can declare the 
procedure EXTERNAL, 
and the LINK-86 utility 
will connect the two modules. 


PLlM-86 
procedures 
can be written 
to handle 


interrupts. 
Procedures 
also 
may 
be 
declared 


REENTRANT, 
making them concurrently usable 


by different 
tasks 
in 
a 
multitasking 
system. 


PLlM-86 also has about 50 procedures built into 
the language, including facilities for: 


• 
converting variables from one type to another 


• 
shifting and rotating bits 


• 
performing input and output 


• 
manipulating strings 


• 
activating the CPU LOCK signal. 


I'DECLARATION 
OF A TYPED PROCEDURE THAT 
ACCEPTS TWO REAL PARAMETERS AND RETURNS A REAL VALUE'I 
AVG: PROCEDURE (X,Y) REAL; 
DECLARE (X,Y) REAL; 
RETURN (X+Y)/2.0; 
ENDAVG; 


I' ACTIVATING 
A TYPED PROCEDURE'I 
LOW =2.0; 
HIGH =3.0; 
TOTAL = TOTAL + AVG (LOW,HIGH); 
1'2.5IS 
ADDEDTO TOTAL'I 


I'DECLARATION 
OF AN UNTYPED PROCEDURE 
THAT ACCEPTS ONE PARAMETER'I 
TEST: PROCEDURE (X); 
DECLARE X BYTE; 
IF X =OH THEN 
COUNT = COUNT + 1; 


END TEST; 


1 'ACTIVATING 
AN UNTYPED PROCEDURE'I 
CALL TEST (ALPHA); 
I'COUNT 
IS INCREMENTED 
IF ALPHA = 0'1 


Programmers 
who are familiar 
with the CPU 


architecture can obtain complete access to all pro- 
cessor facilities with ASM-86. Since the execution 
unit on both the 8086 and the 8088 is identical, 
both processors use the same assembly language. 
Examples 
of processor 
features 
not accessible 


through PLlM-86 that can be utilized in ASM-86 
programs include: software interrupts, the WAIT 
and ESC instructions 
and explicit control of the 


segment registers. 


An ASM-86 program 
often 
can be written 
to 


execute faster and/or 
to use less memory than the 


same 
program 
written 
in 
PLlM-86. 
This 
is 
because the compiler has a limited "knowledge" 
of 
the 
entire 
program 
and 
must 
generate 
a 


generalized set of machine instructions 
that will 


work in all situations, but may not be optimal in a 
particular situation. For example, assume that the 
elements of an array are to be summed and the 
result 
placed 
in a variable 
in memory. 
The 


machine instructions 
generated by the PL/M-86 


compiler would move the next array element to a 
register and then add the register to the sum 
variable in memory. 
An ASM-86 programmer, 


knowing that a register will be "safe" 
while the 


array is summed, could instead add all the array 
elements to a register and then move the register 
to the sum variable, saving one instruction execu- 
tion per array element. 


It is easier to write assembly language programs in 
ASM-86 than it is in many assembly languages. 
ASM-86 
contains 
powerful 
data 
structuring 
facilities that are usually found only in high-level 


languages. ASM-86 also simplifies the program- 
mer's "view" 
of the 8086/8088 machine instruc- 
tion set. For example, although there are 28 dif- 
ferent types of MOV machine instructions, 
the 


programmer 
always writes a single form of the 


instruction: 


The assembler 
generates 
the correct 
machine- 
instruction 
form based on the attributes 
of the 


source and destination 
operands 
(attributes 
are 


covered later in this section). Finally, the ASM-86 
assembler performs extensive checks on the con- 
sistency of operand definition versus operand use 
in instructions, 
catching many common types of 


clerical errors. 


Compared to many assemblers, ASM-86 accepts a 
relaxed statement 
format (see figure 2-56). This 


helps to reduce clerical errors and allows pro- 
grammers 
to format 
their programs 
for better 


readability. Variable and label names may be up 
to 31 characters 
long and are not restricted to 


alphabetic and numeric characters. 
In particular, 


the underscore (_) 
may be used to improve the 


readability of long names. Blanks may be inserted 
freely between identifiers (there are no "column" 
requirements), 
and 
statements 
also 
may span 


multiple lines. 


All ASM-86 statements are classified as instruc- 
tions or directives. A clear distinction 
must be 


made 
here 
between 
ASM-86 
instructions 
and 


MOV 
AX, [BX + 3] 
MOV AX, 
[BX + 3] 
MOV 
AX, 
& 
[BX+3] 


ZERO 
EQU 
0 
CUR_PROJ 
EQU 
PROJECT [BX] [SI] 
THE_STACK_STARTS_HERE 
SEGMENT 
TIGHT_LOOP: 
JMP TIGHT_LOOP 
MOV 
ES: DATA_STRING 
[SI], AL 
WAIT: 
LOCK XCHG 
AX,SEMAPHORE 


; TYPICAL ASM-86 INSTRUCTION 
; BLANKS NOT SIGNIFICANT 


; CONTINUED STATEMENTS 


; SIMPLE ASM-86 DIRECTIVE 
; MORE COMPLEX DIRECTIVE 
; LONG IDENTIFIER 
; LABELLED STATEMENT 
; SEGMENT OVERRIDE PREFIX 
; LABEL & LOCK PREFIX 


8086/8088 
machine instructions. 
The assembler 
generates 
machine 
instructions 
from 
ASM-86 
instructions 
written 
by 
a 
programmer. 
Each 
ASM-86 
instruction 
produces 
one 
machine 
instruction, 
but 
the 
form 
of 
the 
generated 
machine instruction 
will vary according 
to the 
operands written in the ASM-86 instruction. 
For 
example, writing 


produces 
a 
byte-immediate-to-register 
MOV, 
while writing 


produces 
a word-register-to-memory 
MOV. To 
the programmer, 
though, there is simply a MOV 
source-to-destination 
instruction. 


where parentheses 
denote 
optional 
fields (the 
parentheses are not actually written by program- 
mers). The label field names the storage location 
containing the machine instruction so that it can 
be referred to symbolically as the target of a JMP 
instruction elsewhere in the program. 
Writing a 
prefix causes ASM-86 to generate 
one of the 
special prefix bytes (segment override, bus lock or 
repeat) 
immediately 
preceding 
the 
machine 
instruction. 
The mnemonic identifies the type of 
instruction (MOV for move, ADD for add, etc.) 
that is to be generated. Zero, one or two operands 
may be written 
next, 
separated 
by commas, 
according to the requirements 
of the instruction. 
Finally, writing a semicolon signifies that what 
follows is a comment. 
Comments 
do not affect 
the execution of a program, 
but they can greatly 


improve its clarity; all good ASM-86 programs 
are thoughtfully commented. 


Writing a directive gives ASM-86 information 
to 
use in generating instructions, 
but does not itself 
produce 
a machine 
instruction. 
About 
20 dif- 
ferent directives are available in ASM-86. Direc- 
tives are written like this: 


Some directives require 
a name to be present, 
while others prohibit a name. ASM-86 recognizes 
the directive from the mnemonic keyword written 
in the next field. Any operands 
required by the 
directive are written next, separated by commas. 
A comment may be written as the last field of a 
directive. 


Some of the 
more 
commonly 
used directives 
define procedures 
(PROC), 
allocate storage for 
variables (DB, DW, DD) give a descriptive name 
to a number or an expression (EQU), define the 
bounds 
of segments (SEGMENT 
and ENDS), 
and force instructions 
and data to be aligned at 
word boundaries (EVEN). 


Binary, decimal, octal and hexadecimal numeric 
constants 
(see figure 2-57) may be written 
in 
ASM-86 statements; 
the assembler can perform 
basic arithmetic operations 
on these as well. All 
numbers must, however, be integers and must be 
representable 
in 
16 bits including 
a sign bit. 


Negative 
numbers 
are 
assembled 
in standard 
two's complement notation. 


Character constants are enclosed in single quotes 
and may be up to 255 characters long when used 


MOV 
MOV 
ADD 
OCTAL_8 
OCTAL_9 
ALL_ONES 
MINUS_5 
MINUS_6 


STRING [SI], 'A' 
STRING [SI], 41H 
AX,OC4H 
EQU 
100 
EQU 
10Q 
EQU 
111111118 
EQU 
-5 
EQU 
-60 


; CHARACTER 
; EQUIVALENT 
IN HEX 
; HEX CONSTANT MUST START WITH NUMERAL 
; OCTAL 
; OCTAL ALTERNATE 
; BINARY 
; DECIMAL 
; DECIMAL ALTERNATE 


to initialize storage. 
When used as immediate 
operands, character constants may be one or two 
bytes long to match the length of the destination 
operand. 


Most ASM-86 programs 
begin by defining 
the 
variables with which they will work. Three direc- 
tives, DB, DW and DD, are used to allocate and 
name 
data 
storage 
locations 
in ASM-86 
(see 
figure 2-58). The directives are used to define 
storage 
in 
three 
different 
units: 
DB 
means 
"define byte," 
DW means "define 
word," 
and 
DD means "define 
doubleword." 
The operands 
of these directives tell the assembler how many 
storage units to allocate and what initial values, if 
any, with which to fill the locations. 


A_SEG 
ALPHA 
BETA 
GAMMA 
DELTA 
EPSILON 
A_SEG 


B_SEG 
IOTA 
KAPPA 
LAMBDA 
MU 
B_SEG 


SEGMENT 
DB 
ow 
DO 
DB 
ow 
ENDS 


; NOT 
INITIALIZED 
; NOT 
INITIALIZED 
; NOT 
INITIALIZED 
; NOT 
INITIALIZED 
; CONTAINS 
05H 


SEGMENT 
AT 55H ; SPECIFYING 
BASE 
ADDRESS 
DB 
'HELLO' 
; CONTAINS 
48 45 4C 4C 4F H 
ow 
'AB' 
; CONTAINS 
42 41 H 
DO 
B_SEG 
; CONTAINS 
0000 5500 H 
DB 
100 DUP 0 
; CONTAINS 
(100 X) OOH 
ENDS 


ATTRIBUTES 
OPERATORS 


VARIABLE 
SEGMENT 
OFFSET 
TYPE 
LENGTH 
SIZE 


ALPHA 
A_SEG 
0 
1 
1 
1 
BETA 
A_SEG 
1 
2 
1 
2 
GAMMA 
A_SEG 
3 
4 
1 
4 
DELTA 
A_SEG 
7 
1 
1 
1 
EPSILON 
A_SEG 
8 
2 
1 
2 
IOTA 
B_SEG 
0 
1 
5 
5 
KAPPA 
B_SEG 
5 
2 
1 
2 
LAMBDA 
B_SEG 
7 
4 
1 
4 
MU 
B_SEG 
11 
1 
100 
100 


For every variable in an ASM-86 program, 
the 
assembler 
keeps track 
of three attributes: 
seg- 
ment, offset and type. Segment identifies the seg- 
ment that contains the variable (segment control 
is covered shortly). Offset is the distance in bytes 
of the variable from the beginning of its contain- 


ing segment. Type identifies the variable's alloca- 
tion unit (I = byte, 2 = word, 4 = doubleword). 
When a variable is referenced in an instruction, 
ASM-86 uses these attributes 
to determine what 
form 
of 
the 
instruction 
to 
generate. 
If 
the 
variable's attributes 
conflict with its usage in an 
instruction, 
ASM-86 prodUces an error message. 
For example, attempting to add a variable defined 
as a word to a byte register is an error. There are 
cases where the assembler must be explicitly told 
an operand's 
type. For example, writing MOVE 
[BX],5 will produce an error message because the 
assembler does riot know if [BX] refers to a byte, 
a word or a doubleword. 
The following operators 
can be used to provide this information: 
BYTE 
PTR, WORD PTR and DWORD 
PTR. 
In the 
p,revious example, a word could be moved to the 
location 
referenced 
by [BX] by writing MOVE 
WORD PTR [BX],5. 


ASM-86 also provides 
two built-in 
operators, 
LENGTH 
and 
SIZE, 
that 
can be written 
in 
ASM-86 
instructions 
along 
with 
attribute 
information. 
LENGTH 
causes the assembler to 
return the number of storage units (bytes, words 
or doublewords) 
occupied 
by an array. 
SIZE 
causes ASM-86 to return 
the total 
number 
of 
bytes occupied by a variable or an array. These 
operators and attributes make it possible to write 
generalized instruction sequences that need not be 
changed (only reassembled) if the attributes of the 
variables change (e.g., a byte array is changed to a 
word array). See figure 2-59 for an example of 
using the attributes and attribute operators. 


ASM-86 provides a means of symbolically defin- 
ing individual bits and strings of bits within a byte 
or a word. Such a definition 
is called a record, 
and each named bit string (which may consist of a 
single bit) in a record is called a field. Records 
promote efficient use of storage while at the same 
time improving 
the readability 
of the program 
and reducing 
the likelihood 
of clerical errors. 


Defining 
a record 
does 
not 
allocate 
storage; 
rather, 
a record 
is a template 
that 
tells the 
assembler the name and location of each bit field 
within the byte or word. When a field name is 
written later in an instruction, 
ASM-86 uses the 
record to generate an immediate mask for instruc- 
tions like TEST, AND, OR, etc., or an immediate 
count for shifts and rotates. See figure 2-60 for an 
example of using a record. 


; SUM THE CONTENTS OF TABLE INTO AX 
TABLE 
DW 
50 
DUP(?) 


; NOTE SAME INSTRUCTIONS 
WOULD WORK FOR 


; TABLE 
DB 
25 
DUP(?) 


; TABLE 
DW 
118 
DUP(?), 
ETC. 


SUB 
MOV 
MOV 


ADD_NEXT: 
SUB 
SI, TYPE TABLE 
ADD 
AX, TABLE (SI] 
LOOP 
ADD_NEXT 


; AX CONTAINS SUM 


AX,AX 
; CLEAR SUM 
CX, LENGTH TABLE; 
LOOP TERMINATOR 
SI, SIZE TABLE 
;POINT SUBSCRIPT 
; TO END OF TABLE 
; BACK UP ONE ELEMENT 
; ADD ELEMENT 
; UNTILCX =0 


EMP_BYTE 
DB 
? 
; 1 BYTE, UNINITIALIZED 


; BIT DEFINITIONS: 
7-2 
: YEARS EMPLOYED 
1 
: SEX (1 = FEMALE) 
o 
: STATUS (1 = EXEMPT) 
EMP_BITSRECORD 
;RECORD DEFINED HERE 
& 
YRS_EMP: 
6, 


& 
SEX: 1, 
& 
STATUS: 
1 


; SELECT NONEXEMPT 
FEMALES EMPLOYED10+YEARS 


MOV 
AL, EMP_BYTE 
; KEEP ORIGINAL INTACT 
TEST 
AL, MASK SEX 
; FEMALE? 
JZ 
REJECT 
; NO, QUITE 
TEST 
AL, MASK STATUS 
; NONEXEMPT? 
JNZ 
REJECT 
;NO, QUIT 
SHR 
AL, CL 
; ISOLATE YEARS 
CM P 
AL, 11 
; >=10 YEARS? 
JL 
REJECT 
; NO, QUIT 
; PROCESS SELECTED EMPLOYEE 


; RECORD USED HERE 
; GET SHIFT COUNT 


An ASM-86 structure is a map, or template, that 
gives names and attributes (length, type, etc.) to a 
collection of fields. Each field in a structure 
is 
defined 
using 
DB, 
DW 
and 
DD 
directives; 
however, no storage is allocated to the structure. 
Instead, the structure becomes associated with a 
particular 
area of memory when a field name is 
referenced 
in an instruction 
along with a base 


value. The base value "locates" 
the structure; it 


may be a variable name or a base register (BX or 
BP). 
The 
structure 
may 
be 
associated 
with 


another area of memory by specifying a different 
base value. Figure 2-61 shows how a simple struc- 
ture may be defined and used. Note that a struc- 
ture field may itself be a structure, allowing much 
more 
complex 
organizations 
to 
be laid 
out. 


Structures 
are particularly 
useful 
in situations 


where the same storage format is at multiple loca- 
tions, 
where 
the 
location 
of 
a collection 
of 


variables 
is not 
known 
at assembly-time, 
and 


where the location 
of a collection 
of variables 
changes during execution. 
Applications 
include 


multiple buffers for a single file, list processing 
and stack addressing. 


Figure 2-62 provides sample ASM-86 coding for 
each of the 
8086/8088 
addressing 
modes. 
The 


assembler interprets a bracketed reference to BX, 
BP, SI or DI as a base or index register to be used 
to construct 
the effective address of a memory 


operand. 
An unbracketed 
reference 
means the 


register itself is the operand. 


The following 
cases illustrate 
typical 
ASM-86 


coding for accessing arrays and structures, 
and 


show 
which 
addressing 
mode 
the 
assembler 
specifies in the machine instruction it generates: 


• 
If ALPHA is an array, then ALPHA 
[SI] is 


the element 
indexed 
by SI, and ALPHA 


[SI + 1] is the following byte (indexed). 


• 
If ALPHA is the base address of a structure 
and BETA is a field in the structure, 
then 


ALPHA.BETA 
selects 
the 
BETA 
field 


(direct). 


• 
If register BX contains the base address of a 
structure and BETA is a field in the struc- 
ture, then [BX].BET A refers to the BETA 
field (based). 


EMPLOYEE 
STRUC 
SSN 
DB 
9 
DUP(?) 
RATE 
DB 
1 
DUP(?) 
DEPT 
OW 
1 
DUP(?) 
YR_HIRED 
DB 
1 
DUP(?) 
EMPLOYEE 
ENDS 


MASTER 
DB 
12 
DUP(?) 
TXN 
DB 
12 
DUP(?) 


; CHANGE RATE IN MASTER TO VALUE IN TXN. 


MOV 
AL, TXN.RATE 
MOV 
MASTER~RATE, AL 


; ASSUME BX POINTS TO AN AREA CONTAINING 
DATA IN THE SAME FORMAT ASTHE 
EMPLOYEE 
STRUCTURE. 
ZERO THE SECOND DIGIT 
OFSSN 
MOV 
SI,1; 
INDEX VALUE OF 2ND DIGIT 
MOV 
[BX).SSN[SI],O 


ADD 
ADD 
ADD 
ADD 
ADD 
ADD 
ADD 
ADD 
ADD 
ADD 
ADD 
ADD 
IN 
OUT 


AX,BX 
AL,5 
CX,ALPHA 
ALPHA,6 
ALPHA,DX 
BL, [BX] 
[SI], BH 
[PP].ALPHA, 
AH 
CX, ALPHA [SI] 
ALPHA [DI+2], 10 
[BX].ALPHA 
[SI], AL 
SI, [BP+4] [DI] 
AL,30 
DX,AX 


; REGISTER +- REGISTER 
; REGISTER +-IMMEDIATE 
; REGISTER +- MEMORY (DIRECT) 
; MEMORY (DIRECT) +-IMMEDIATE 
; MEMORY (DIRECT) +- REGISTER 
; REGISTER +- MEMORY (REGISTER INDIRECT) 
; MEMORY (REGISTER INDIRECT) +-IMMEDIATE 
; MEMORY (BASED) +- REGISTER 
; REGISTER +- MEMORY (INDEXED) 
; MEMORY (INDEXED) +-IMMEDIATE 
; MEMORY (BASED INDEXED) +- REGISTER 
; REGISTER +- MEMORY (BASED INDEXED) 
; DIRECT PORT 
; INDIRECT PORT 


• 
If register BX contains 
the address of an 


array, then [BX] [SI] refers to the element 
indexed by SI (based indexed). 


• 
If register BX points to a structure 
whose 


ALPHA 
field 
is 
an 
array, 
then 
[BX] 


.ALPHA 
[SI] selects the element indexed by 


SI (based indexed). 


• 
If register BX points to a structure 
whose 


ALPHA 
field 
is itself 
a 
structure, 
then 
[BX].ALPHA.BETA 
refers 
to the 
BETA 
field of the ALPHA substructure (based). 


• 
If register BX points to a structure and the 
ALPHA field of the structure is an array and 
each element of ALPHA is a structure, then 
[BX].ALPHA[SI 
+ 3].BETA 
refers to the 
field 
BETA 
in 
the 
element 
of 
ALPHA 


indexed by [SI + 3] (based indexed). 


Note that DI may be used in place of SI in these 
cases and that BP may be substituted 
for BX. 


Without 
a segment override prefix, expressions 
containing BP refer to the current stack segment, 
and expressions containing 
BX refer to the cur- 
rent data segment. 


An ASM-86 program is organized into a series of 
named segments. These are "logical" 
segments; 


they 
are 
eventually 
mapped 
into 
8086/8088 


memory segments, 
but this usually is not done 
until the program is located. A SEGMENT direc- 
tive starts a segment, and an ENDS directive ends 
the 
segment 
(see figure 
2-63). 
All 
data 
and 


instructions 
written 
between 
SEGMENT 
and 


ENDS are part of the named segment. In small 
programs, 
variables often are defined in one or 


two segment(s), stack space is allocated in another 
segment, and instructions are written in a third or 
fourth segment. It is perfectly possible, however, 
to write a complete program 
in one segment; if 


this is done, all the segment registers will contain 
the same 
base 
address; 
that 
is, the 
memory 
segments 
will completely 
overlap. 
Large 
pro- 


grams may be divided into dozens of segments. 


The 
first 
instructions 
in 
a 
program 
usually 


establish 
the correspondence 
between 
segment 


names and segment registers, and then load each 
segment register with the base address of its cor- 
responding segment. The ASSUME directive tells 
the assembler what addresses will be in the seg- 
ment registers at execution time. The assembler 
checks each memory instruction 
operand, 
deter- 


mines which segment it is in and which segment 
register contains the address of that segment. If 
the assumed register is the register expected by the 
hardware 
for 
that 
instruction 
type, 
then 
the 


assembler generates the machine instruction nor- 
mally. If, however, the hardware expects one seg- 
ment register to be used, and the operand isnot in 
the segment pointed to by that register, then the 
assembler 
automatically 
precedes 
the machine 


instruction 
with a segment override prefix byte. 


(If 
the 
segment 
cannot 
be 
overridden, 
the 


assembler produces an error message.) An exam- 
ple may clarify this. If register BP is used in an 
instruction, 
the 8086 and 8088 CPUs expect, as a 


default, that the memory operand will be located 
in the segment pointed to by SS-in 
the current 


DATA_SEG 
SEGMENT 
; DATA DEFINITIONS GO HERE 
DATA_SEG 
ENDS 


STACK_SEG 
SEGMENT 
; ALLOCATE 
100WORDS FOR A STACK AND 
LABEL THE INITIAL TOS FOR LOADING SP. 
OW 100 DUP(?) 
STACK TOP LABEL WORD 
STACK_SEG 
ENDS 


CODE_SEG 
SEGMENT 


; GIVE ASSEMBLER 
INITIAL REGISTER-TO-SEGMENT 


; 
CORRESPONDENCE. 
NOTE THAT IN THIS 
; 
PROGRAM THE EXTRA SEGMENT INITIALLY 
; 
OVERLAPS THE DATA SEGMENT ENTIRELY. 


ASSUME 
CS: CODE_SEG, 
& 
OS: DATA_SEG, 


& 
ES: DATA_SEG, 


.& 
SS: STACK_SEG 


START: 
; THIS IS THE BEGINNING OF THE PROGRAM. 
; LOC-86 WILL PLACE A JMP TO THIS 
; LOCATION AT ADDRESS FFFFOH. 


; LOAD THE SEGMENT REGISTERS. CS DOES NOT 
HAVE TO BE LOADED BECAUSE SYSTEM 
RESET SETS IT TO FFFFH, AND THE 
LONG JMP INSTRUCTION 
ATTHAT 
ADDRESS 
UPDATES IT TO TH E ADDRESS OF CODE_SEG. 
SEGMENT REGISTERS ARE LOADED FROM AX 
BECAUSE THERE IS NO IMMEDIATE-TO- 
SEGMENT_REGISTER 
FORM OF THE MOV 
INSTRUCTION. 


MOV 
AX,DATA_SEG 
MOV 
OS, AX 
MOV 
ES,AX 
MOV 
AX, STACK_SEG 
MOV 
SS,AX 


; SET STACK POINTER TO INITIAL TOS. 
MOV 
SP, OFFSET STACK_TOP 


; SEGMENTS ARE NOW ADDRESSABLE. 
; MAIN PROGRAM CODE GOES HERE. 
CODE_SEG 
ENDS 


; NEXT STATEMENT 
ENDS ASSEMBLY AND TELLS 
LOC-86 THE PROGRAMS STARTING ADDRESS. 


END 
START 


stack segment. 
A programmer 
may, 
however, 
choose to use BP to address a variable in the cur- 
rent data segment-the 
segment pointed 
to by 
DS. The ASSUME directive enables the assembler 
to 
detect 
this 
situation 
and 
to 
automatically 
generate the needed override prefix. 


It also is possible for a programmer 
to explicitly 
code segment override prefixes rather than relying 
on the assembler. This may result in a somewhat 
better-documented 
program 
since attention 
is 


called 
to 
the 
override. 
The 
disadvantage 
of 


explicit segment overrides is that the assembler 
does not check whether 
the operand 
is in fact 


addressable 
through 
the 
overriding 
segment 


register. 


ASM-86, in conjunction 
with the relocation and 
linkage 
facilities, 
provides 
much 
more 


sophisticated 
segment handling capabilities than 
have been described 
in this introduction. 
For 


example, different logical segments may be com- 
bined 
into 
the 
same 
physical 
segment, 
and 


segments may be assigned the same physicalloca- 
tions (allowing a "common" 
area to be accessed 
by different 
programs 
using different 
variable 


and label names). 


Procedures may be written in ASM-86 as well as 
in PL/M-86. 
In fact, procedures 
written in one 
language are callable from the other, 
provided 


that a few simple conventions are observed in the 
ASM-86 program. 
The purpose of ASM-86 pro- 


cedures is the same as in PL/M-86: 
to simplify the 
design of complex programs and to make a single 
copy of a commonly-used routine accessible from 
anywhere in the program. 


An ASM-86 program activates a procedure with a 
CALL instruction. The procedure terminates with 
a RET instruction, 
which transfers control to the 


instruction following the CALL. Parameters may 
be passed in registers or pushed onto the stack 
before calling the procedure. The RET instruction 
can discard stack parameters 
before returning to 


the caller. 


Unlike PL/M-86 procedures, ASM-86 procedures 
are executable where they are coded, as well as by 
a CALL 
instruction. 
Therefore, 
ASM-86 pro- 


cedures often are defined following the main pro- 
gram 
logic, 
rather 
than 
preceding 
it 
as 
in 


PL/M-86. 
Figure 
2-64 shows how procedures 


may be defined and called in ASM-86. Section 
2-10 contains examples of procedures that accept 
parameters on the stack. 


Fundamentally, 
LINK-86 
combines 
separate 


relocatable object modules into a single program. 
This 
process 
consists 
primarily 
of 
combining 


(logical) segments of the same name into single 
segments, 
adjusting 
relative 
addresses 
when 
segments are combined, 
and resolving external 


references. 


A programmer can use a procedure that is actual- 
ly contained 
in another 
module by naming the 


procedure 
in an ASM-86 EXTRN 
directive, 
or 


declaring 
the procedure 
to be EXTERNAL 
in 


PL/M-86. 
The procedure 
is defined or declared 


PUBLIC in the module where it actually resides, 
meaning that it can be used by other modules. 
When 
LINK-86 
encounters 
such 
an 
external 


reference, it searches through the other modules 
in its input, trying to find the matching PUBLIC 
declaration. 
If it finds the referenced 
object, it 


links it to the reference, "satisfying" 
the external 


reference. 
If 
it cannot 
satisfy 
the 
reference, 


LINK-86 prints a diagnostic 
message. LINK-86 


also checks PL/M-86 
procedure 
calls and func- 


tion 
references 
to insure 
that 
the 
parameters 


passed to a procedure are the type expected by the 
procedure. 


LINK-86 gives the programmer, 
particularly 
the 


ASM-86 
programmer, 
great 
control 
over 
segments (segments may be combined end to end, 
renamed, 
assigned 
the 
same 
locations, 
etc.). 


LINK-86 also produces a map that summarizes 
the link process and lists any unusual conditions 
encountered. 
While the output 
of LINK-86 is 


generally input to LOe-86, 
it also may again be 


input to LINK-86 to permit modules to be linked 
in incremental groups. 


LOC-86 
accepts 
the 
single 
relocatable 
object 


module 
produced 
by LINK-86 
and 
binds 
the 


memory 
references 
in 
the 
module 
to 
actual 


memory 
addresses. 
Its 
output 
is an 
absolute 


object module ready for loading into the memory 
of an execution vehicle. LOC-86 also inserts a 


FREQUENCY 
DB 
256 DUP (0) 


USART_DATA 
EQU 
OFFOH 
; DATA PORT ADDRESS 
USART_STAT 
EQU 
OFF2H 
; STATUS PORT ADDRESS 


NEXT: 
CALL 
CHAR_IN 
CALL 
COUNT_IT 
JMP 
NEXT 


CHAR_IN 
PROC 


; THIS PROCEDURE DOES NOT TAKE PARAMETERS. 


IT SAMPLES THE USART STATUS PORT 
UNTIL A CHARACTER 
IS READY, AND 
THEN READS THE CHARACTER 
INTO AL 
MOV 
DX, USART_STAT 
AGAIN: 
IN 
AL, DX 
; READ STATUS 
AND 
AL,2 
; CHARACTER PRESENT? 
JZ 
AGAIN 
; NO, TRY AGAIN 
MOV 
DX, USART_DATA 
IN 
AL, DX 
; YES, READ CHARACTER 
RET 
CHAR_IN 
ENDP 


COUNT_IT 
PROC 


; THIS PROCEDURE EXPECTS A CHARACTER IN AL. 


IT INCREMENTS A COUNTER IN A FREQUENCY 
TABLE BASED ON THE BINARY VALUE OF 
THE CHARACTER. 
XOR 
MOV 
INC 
RET 
ENDP 


AH, AH 
; CLEAR HIGH BYTE 
SI, AL 
; INDEX INTO TABLE 
FREQUENCY [S]; BUMPTHECOUNTER 


direct intersegment 
JMP instruction 
at location 


FFFFOH. The target of the JMP instruction is the 
logical beginning of the program. When the 8086 
or 8088 is reset, this instruction 
is automatically 
executed to restart the system. LOC-86 produces 
a memory map of the absolute object module and 
a table 
showing 
the address 
of every symbol 


defined in the program. 


LIB-86 is a valuable adjunct 
to the R & L pro- 


grams. It is used to maintain relocatable 
object 


modules in special files called libraries. Libraries 


are a convenient 
way to make 
collections 
of 


modules available to LINK-86. When a module 
being linked refers to "external" 
data or instruc- 


tions, LINK-86 can automatically 
search a series 
of libraries, 
find 
the referenced 
module, 
and 


include it in the program being created. 


OH-86 converts an absolute object module into 
Intel's standard hexadecimal format. This format 
is used by some PROM programmers 
and system 


loaders, 
such as the iSBC 957™ and SDK-86 


loaders. 


Users 
who 
have 
developed 
substantial, 
fully- 
tested 
assembly 
language 
programs 
for 
the 
SOSO/SOS5microprocessors 
may 
want 
to 
use 
CON V-S6 to automatically convert large amounts 
of this code into ASM-S6 source code (see figure 
2-65). CONV-S6 accepts an ASM-SO source pro- 
gram as input and produces an ASM-S6 source 
program 
as 
output, 
plus 
a 
print 
file 
that 
documents the conversion and lists any diagnostic 
messages. 


Some programs cannot be completely converted 
by CONV-S6. Exceptions include: 


• 
self-modifying code, 


• 
software timing loops, 


• 
SOS5RIM and SIM instructions, 


• 
interrupt code, and 


• 
macros. 


By using the diagnostic 
messages produced 
by 
CONV-S6, the converted ASM-S6 source file can 
be manually edited to clean up any sections not 
converted. 
A converted 
program 
is typically 
10-20% larger than the ASM-SO version and does 
not take full advantage of the SOS6/S0SSarchitec- 
ture. However, 
the development 
time saved by 
using CONY -S6 can make it an attractive 
alter- 
native 
to 
rewriting 
working 
programs 
from 
scratch. 


Sample Programs 


Figures 2-66 and 2-67 show how a simple program 
might be written in PL/M-S6 
and ASM-S6. The 
program 
simulates 
a pair 
of rolling dice and 
executes on an Intel SDK-S6 System Design Kit. 
The SDK-S6 is an SOS6-based computer 
with 
memory, parallel and serial I/O ports, a keypad 
and a display. The SDK-S6 is implemented on a 
single PC board which includes a large prototype 
area for system expansion and experimentation. 
A ROM-based monitor program provides a user 
interface to the system; commands 
are entered 
through 
the keypad and monitor 
responses are 
written on the display. With the addition 
of a 
cable and software 
interface 
(called SDK-CS6), 
the SDK-S6 may be connected 
to an Intellec® 


Microcomputer 
Development 
System. 
In 
this 
mode, the user enters monitor 
commands 
from 
the Intellec keyboard and receives replies on the 
Intellec CRT display. 


r-, 


- 
- 
••• 
( 
EDIT 
) 


I 
'-1.J 


I 
rEDiTE57 


~ 


J 
ASM-86 I 
•• 
SOURCE 
~~~~ 


The dice program runs on an SDK-86 that is con- 
nected to an Intellec® Microcomputer 
Develop- 
ment System. 
The program 
displays 
two con- 
tinuously changing digits in the upper left corner 
of the Intellec display. 
The digits are random 
numbers in the range 1-6. A roll is started 
by 
entering a monitor 
GO command. 
Pressing the 
INTR key on the SDK-86 keypad stops the roll. 


There are two procedures in the PL/M-S6 version 
of the dice program. 
The first is called CO for 
console output. This is an untyped PUBLIC pro- 
cedure that is supplied on an SDK-CS6 diskette. 
CO is written 
in PL/M-86 
and 
outputs 
one 
character 
to the Intellec console. 
It is declared 
EXTERNAL in the dice program because it exists 
in 
another 
module. 
LINK-86 
searches 
the 
SDK-C86 library for CO and includes it in the 
single relocatable object module it builds. 


RANDOM 
is an internal typed procedure; 
It IS 
contained in the dice module and returns a word 
value that is a random number between I and 6. 
RANDOM 
does not use any parameters 
and is 
activated 
in the parameter 
list passed 
to CO. 
When CO is called like this, first RANDOM is ac- 
tivated, then 30 is added to the number it returns 
and the sum is passed to CO. 


PL/M-86 
COMPILER 
DICE 


ISIS-II 
PL/M-86 
V1.2 
COMPILATION 
OF 
MODULE 
DICE 


OBJECT 
MODULE 
PLACED 
IN 
:F1:DICE.OBJ 


COMPILER 
INVOKED 
BY: 
PLM86 
:F1 :DICE.P86 
XREF 


8 
1 
9 
2 


10 
2 


11 
1 
12 
2 


13 
2 


14 
2 


15 
2 
16 
2 
17 
2 


18 
19 


20 


21 
2 
22 
2 
23 
2 


24 
2 
25 
2 
26 
2 


27 


1* 
GIVE 
NAMES 
TO 
CONSTANTS 
*1 
DECLARE 
CLEAR$CRT1 
LITERALLY 
'OlBH'; 
DECLARE 
CLEAR$CRT2 
LITERALLY 
'045H'; 


DECLARE 
HOME$CURSOR1 
LITERALLY 
'OlBH'; 
DECLARE 
HOME$CURSOR2 
LITERALLY 
'048H'; 


DECLARE 
SPACE 
LITERALLY 
'020H'; 


1* 
INTELLEC 
*1 
1* 
CRT 
*1 
1* 
CONTROL 
*1 
1* 
CODES 
*1 
I*ASCII 
BLANK*I 


1* 
PROGRAM 
VARIABLES 
*1 
DECLARE 
(RANDOM$NUMBER,SAVE) 
WORD; 


1* 
CONSOLE 
OUTPUT 
PROCEDURE 
*1 
CO: 
PROCEDURE(X) 
EXTERNAL; 


DECLARE 
X 
BYTE; 


END 
CO; 


1* 
RANDOM 
NUMBER 
GENERATOR 
PROCEDURE 
*1 
1* 
ALGORITHM 
FOR 
16-BIT 
RANDOM 
NUMBER 
FROM: 
*1 


1* 
"A 
GUIDE 
TO 
PL/M 
PROGRAMMING 
FOR 
*1 
1* 
MICROCOMPUTER 
APPLICATIONS," 
*1 
1* 
DANIEL 
D. 
MCCRACKEN, 
*1 
1* 
ADDISON-WESLEY, 
1978 
*1 
RANDOM: 
PROCEDURE 
WORD; 
RANDOM$NUMBER 
= 
SAVE; 
I*START 
WITH 
OLD 
NUMBER*I 
RANDOM$NUMBER 
= 
2053 
* 
RANDOM$NUMBER 
+ 
13849; 
SAVE 
= 
RANDOM$NUMBER; 
I*SAVE 
FOR 
NEXT 
TIME*I 


I*FORCE 
16-BIT 
NUMBER 
INTO 
RANGE 
1-6*1 


RANDOM$NUMBER 
= 
RANDOM$NUMBER 
MOD 
6 
+ 
1; 
RETURN 
RANDOM$NUMBER; 
END 
RANDOM; 


1* 
MAIN 
ROUTINE 
*1 
1* CLEAR 
THE 
SCREEN*I 


CALL 
CO(CLEAR$CRT1); 
CALL 
CO(CLEAR$CRT2); 


1* 
ROLL 
THE 
DICE 
UNTIL 
INTERRUPTED 
*1 


DO 
WHIlE 
1; 
I""DO 
FOREVER""I 


I*NOTE 
THAT 
ADDING 
30 
TO 
THE 
DIE 
VALUE 
I" 
CONVERTS 
IT 
TO 
ASCII. 
CALL 
CO(RANDOM 
+ 
030H); 
CALL 
CO(SPACE); 
CALL 
CO(RANDOM 
+ 
030H); 
I" 
HOME 
THE 
CURSOR 
"I 
CALL 
CO(HOME$CURSOR1); 
CALL 
CO(HOME$CURSOR2); 
END; 


*1 
"I 
1* 1ST 
DIE"I 
I*BLANK*I 
1"2ND 
DIE"I 


CLEARCRT1 
LITERALLY 


18 


CLEARCRT2 
LITERALLY 


19 


CO 
PROCEDURE 
EXTERNAL(O) 
STACK=OOOOH 
18 
19 
21 
22 
23 
24 
25 


71 
DICE 
PROCEDURE 
STACK=0004H 


HOMECURSOR1 
LITERALLY 


24 


HOMECURSOR2 
LITERALLY 


25 


44 
RANDOM 
PROCEDURE 
WORD 
STACK=0002H 


21 
23 


MODULE 
INFORMATION: 


CODE 
AREA 
SIZE 
CONSTANT 
AREA 
SIZE 
VARIABLE 
AREA 
SIZE 
MAXIMUM 
STACK 
SIZE 
51 
LINES 
REIlD 


o 
PROGRAM 
ERROR(S) 


0075H 
OOOOH 
0004H 
0004H 


117D 


OD 
4D 
4D 


ISIS-II 
MCS-86 
MACRO 
ASSEMBLER 
V2.0 
ASSEMBLY 
OF 
MODULE 
DICE 
OBJECT 
MODULE 
PLACED 
IN 
:F1:DICE.OBJ 
ASSEMBLER 
INVOKED 
BY: 
ASM86 
:Fl:DICE.A86 
XREF 


0000 
1BOO 
0002 
4500 
0004 
lBOO 
0006 
4800 
0008 
2000 
OOCA 
???? 


; SEGME~T 
GROUP 
UEFINITIONS 
NEEDED 
FOR 
PL/M-86 
COMPATIBILITY 
CGROUP 
GROUP 
CODE 


DGROUP 
GROUP 
DATA,STACK 


INFORM 
ASSEMBLER 
OF 
SEGMENT 
REGISTER 
CONTENTS. 
ASSUME 
CS:CGROUP,DS:DGROUP,SS:DGROUP,ES:NOTHING 


; ALLOCATE 
DATA 
UATA 
SEGMENT 
PUBLIC 
'DATA' 
NOTE 
THAT 
THE 
FOLLOWING 
ARE 
PASSED 
ON 
THE 
STACK 
TO 
THE 
PL/M-86 


PROCEDURE 
'CO'. 
BY 
CONVENTION, 
A 
BYTE 
PARAMETER 
IS 
PASSED 
IN 


THE 
LOW-ORDER 
8-BITS 
OF 
A WORD 
ON 
THE 
STACK. 
HENCE, 
THESE 
ARE 


; 
DEFINED 
AS 
WORD 
VALUES, 
THOUGH 
THEY 
OCCUPY 
1 BYTE 
ONLY. 


CLEAR 
CRT1 
DW 
OlBH 
INTELLEC 


CLEAR-CRT2 
DW 
045H 
CRT 
HOME 
CURSORl 
DW 
OlBH 
CONTROL 


HOME-CURSOR2 
DW 
048H 
CODES 
SPACE 
DW 
020H 
ASCII 
BLANK 
SAVE 
DW? 
HOLDS 
LAST 
16-BIT 
RANDOM 
NUMBER 
DATA 
ENDS 


; ALLOCATE 
STACK 
SPACE 


STACK 
SEGMENT 
STACK 
'STACK' 


DW 
20 
DUP 
(?) 


; LABEL 
INITIAL 
TOS: 
FOR 
LATER 
USE. 


STACK 
TOP 
LABEL 
WORD 
STACK- 
ENDS 


RANDOM 
NUMBER 
GENERATOR 
PROCEDURE 
ALGORITHM 
FOR 
16-BIT 
RANDOM 
NUMBER 
FROM: 


"A 
GUIDE 
TO 
PL/M 
PROGRAMMING 
FOR 
MICROCOMPUTER 
APPLICATIONS," 
DANIEL 
D. 
MCCRACKEN 
; 
ADDISON-WESLEY, 
1978 
RANDOM 
PROC 
MOV 
AX,SAVE 
; NEW 
NUMBER 


NAME 
TYPE 
VALUE 
ATTRIBUTES, 
XREFS 


??SEG 
SEGMENT 
SIZE=OOOOH 
PARA 
PUBLIC 
CGROUP. 
GROUP 
CODE 
7U 
11 
CLEAR 
CRT 1. 
V WORD 
OOOOH 
DATA 
19/1 77 
CLEAR-CRT2. 
V WORD 
0002H 
DATA 
20# 
79 
CG. 
L NEAR 
OOOOH 
EXTRN 
411 78 
80 
86 
88 
92 
95 
97 
CODE. 
SEGMENT 
SIZE=005AH 
PARA 
PUBLIC 
'CODE' 
711 37 
100 
DATA. 
SEGMENT 
SIZE=OOOCH 
PARA 
PUBLIC 
'DATA' 
811 14 25 
DGROUP. 
GROUP 
DATA 
STACK 
811 11 
11 
69 
74 
flOME CURSOR 
1. V WORD 
0004H 
DATA 
21/1 94 
HOME-CURSOR2. 
V WORD 
0006H 
DATA 
2211 96 
RANDTIM. 
L 
NEAR 
OOOOH 
CODE 
4611 60 
83 
89 
ROLL. 
L 
NEAR 
0031H 
CODE 
83/1 99 
SAVE. 
V WORD 
OOOAH 
DATA 
24/1 47 
51 
SPACE 
V WORD 
0008H 
DATA 
2311 87 
STACK 
SEGMENT 
SIZE=0028H 
PARA 
STACK 
'STACK' 


STACK 
TOP 
V 
WORD 
0028H 
STACK 
3211 74 
START- 
. 
L 
NEAR 
0019H 
CODE 
69/1 104 


The ASM-86 version of the dice program operates 
like the PL/M-86 version. Since the program uses 
the PL/M-86 
CO procedure 
for writing data to 


the Intellec console, it adheres to certain conven- 
tions established by the PL/M-86 
compiler. The 
program's 
logical 
segments 
(called 
CODE, 
DATA and STACK-the 
program does not use 


an extra segment) are organized into two groups 
called CGROUP and DGROUP. All the members 
of a group of logical segments are located in the 
same 
64k 
byte 
physical 
memory 
segment. 


Physically, 
the program's 
DATA and STACK 


segments can be viewed as "subsegments" 
of 
DGROUP. 


PL/M-86 
procedures 
expect parameters 
to be 
passed on the stack, so the program pushes each 
character before calling CO. Note that the stack 
will be "cleaned up" by the PL/M-86 procedure 
before 
returning 
(Le., 
the 
parameter 
will be 
removed from the stack by CO). 


2.10 
Programming 
Guidelines 
and Examples 


This section addresses 
8086/8088 
programming 


from 
two 
different 
perspectives. 
A series of 


general 
guidelines 
is 
presented 
first. 
These 


guidelines apply to all types of systems and are 
intended to make software easier to write, and 
particularly, easier to maintain and enhance. The 
second part contains 
a number of specific pro- 


gramming 
examples. 
Written 
primarily 
in 


ASM-86, 
these 
examples 
illustrate 
how 
the 


instruction set and addressing modes may be uti- 
lized in various, commonly encountered program- 
ming situations. 


These guidelines encourage 
the development 
of 


8086/8088 
software that is adaptable 
to change. 


Some of the guidelines refer to specific processor 
features and others suggest approaches to general 
software 
design issues. PL/M-86 
programmers 
need not be concerned with the discussions that 
deal with specific hardware 
topics; they should, 
however, 
give careful 
attention 
to the system 
design subjects. Systems 
that 
are designed 
in 
accordance 
with 
these 
recommendations 
should 
be less costly to modify 
or extend. 
In 
addition, 
they should 
be better-positioned 
to 


take advantage 
of new hardware 
and software 
products 
that are constantly 
being introduced 
by Intel. 


Segments should be considered 
as independent 


logical units whose physical locations in memory 
happen 
to be defined by the contents of the seg- 


ment registers. Programs 
should be independent 


of the actual contents of the segment registers and 
of the physical locations of segments in memory. 
For 
example, 
a 
program 
should 
not 
take 


advantage of the "knowledge" 
that two segments 


are physically adjacent to each other in memory. 
The single exception 
to this fully-independent 


treatment of segments is that a program may set 
up more than one segment register to point to the 
same 
segment 
in memory, 
thereby 
obtaining 


addressability 
through 
more than 
one segment 


register. For example, if both DS and ES point to 
the same segment, a string located in that segment 
may be used as a source operand 
in one string 


instruction and as a destination 
string in another 
instruction 
(recall that a destination 
string must 


be located in the extra segment). 


Any data aggregate or construct such as an array, 
a structure, a string or a stack should be restricted 
to 64k bytes in length and should be wholly con- 
tained in one segment (i.e., should not cross a seg- 
ment boundary). 


Segment registers should only contain values sup- 
plied by the relocation and linkage facilities. Seg- 
ment register values may be moved to and from 
memory, pushed onto the stack and popped from 
the stack. Segment registers should never be used 
to hold temporary 
variables nor should they be 


altered in any other way. 


As an additional 
guideline, code should not be 
written within six bytes of the end of physical 
memory (or the end of the code segment if this 
segment is dynamically 
relocatable). 
Failure to 


observe this guideline could result in an attempted 
opcode 
prefetch 
from 
non-existent 
memory, 


hanging the CPU if READY is not returned. 


It is possible to write a program that deliberately 
changes some of its own machine 
instructions 


during execution. While this technique may save a 
few bytes or machine cycles, it does so at the 
expense of program 
clarity. This is particularly 
true if the program 
is being examined 
at the 
machine instruction 
level; the machine instruc- 
tions shown in the assembly listing may not match 
those found in memory or monitored 
from the 
bus. It also precludes executing the code from 
ROM. Also, because of the prefetch queue within 
the 8086 and 8088, code that 
is self-modified 
within six bytes of the current point of execution 
cannot 
be guaranteed 
to execute as intended. 
(This code may already have been fetched.) Fin- 
ally, 
a 
self-modifying 
program 
may 
prove 
incompatible 
with 
future 
Intel 
products 
that 
assume 
that 
the 
content 
of 
a code 
segment 
remains constant during execution. 


A corrollary 
to this requirement 
is that variable 
data should not be placed in a code segment. Con- 
stant data may be written in a code segment, but 
this is not recommended 
for two reasons. First, 
programs 
are simpler to understand 
if they are 
uniformly subdivided into segments of code, data 
and stack. Second, placing data in a code segment 
can restrict the segment's position independence. 
This is because, 
in general, 
the segment 
base 
address of a data item may be changed, but the 
offset (displacement) 
of the data item may not. 


This 
means 
that 
the 
entire 
segment 
must 
be 
moved as a unit to avoid changing the offset of 
the constant 
data. 
If the constant 
data 
were 
located in a data segment or an extra segment, 
individual 
procedures 
within the code segment 
could be moved independently. 


Since 
I/O 
devices 
vary 
so 
widely 
in 
their 
capabilities and their interface designs, I/O soft- 
ware is inevitably device dependent. 
Substituting 
a hard 
disk 
for 
a floppy 
disk, 
for example, 
necessitates 
software 
changes 
even though 
the 
disks are functionally identical. I/O software can, 
however, be designed to minimize the effect of 
device changes on programs. 


Figure 2-68 illustrates a design concept that struc- 
tures an I/O system into a hierarchy of separately 
compiled/assembled 
modules. 
This 
approach 
isolates 
application 
modules 
that 
use 
the 
input/output 
devices 
from 
all 
physical 
characteristics 
of the hardware 
with which they 
ultimately communicate. 
An application 
module 


that reads a disk file, for example, should have no 
knowledge of where the file is located on the disk, 
what size the disk sectors are, etc. This allows 
these characteristics 
to change without affecting 
the 
application 
module. 
To 
an 
application 
module, the I/O system appears to be a series of 
file-oriented commands (e.g., Open, Close, Read, 
Write). An application 
module would typically 
issue 
a 
command 
by 
calling 
a 
file 
system 
procedure. 


The file system processes I/O command requests, 
perhaps checking for gross errors, and calls a pro- 
cedure in the I/O supervisor. The I/O supervisor 
is a bridge between the functional I/O request of 
the application module and the physical I/O per- 
formed by the lowest-level modules in the hier- 
archy. There should be separate modules in the 
supervisor for different types of devices and some 
device-dependent code may be unavoidable at this 
level. The I/O supervisor would typically perform 
overhead 
activities 
such 
as 
maintaining 
disk 
directories. 


The modules that actually communicate 
with the 
I/O devices (or their controllers) are at the lowest 
level in the hierarchy. These modules contain the 
bulk of the system's device-dependent 
code that 
will have to be modified in the event that a device 
is changed. 


The 8089 Input/Output 
Processor 
is specifically 
designed 
to 
encourage 
the 
development 
of 
modular, 
hierarchical 
I/O 
systems. 
The 
8089 
allows knowledge of device characteristics 
to be 
"hidden" 
from not only application 
programs, 
but also from the operating system that controls 
the CPU. The CPU's 
I/O supervisor can simply 
prepare a message in memory that describes the 
nature of the operation to be performed, and then 
activate the 8089. The 8089 independently 
per- 
forms all physical I/O and notifies the CPU when 
the operation has been completed. 


Operating systems also should be organized in a 
hierarchy 
similar 
to the concept 
illustrated 
in 
figure 2-69. Application 
modules should "see" 
only the upper level of the operating system. This 
level might provide services like sending messages 
between 
application 
modules, 
providing 
time 
delays, etc. An intermediate level might consist of 
housekeeping 
routines 
that dispatch 
tasks, alter 
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pnorItIes, 
manage 
memory, 
etc. At the lowest 
level would 
be 
the 
modules 
that 
implement 
primitive operations such as adding and removing 
tasks or messages from lists, servicing timer inter- 
rupts, etc. 


Interrupt 
Service 
Procedures 


Procedures that service external interrupts should 
be considered differently than those that service 
internal 
interrupts. 
A service procedure 
that is 


activated by an internal interrupt, may, and often 
should, 
be made 
reentrant. 
External 
interrupt 


procedures, 
on the other hand, should be viewed 


as temporary tasks. In this sense, a task is a single 
sequential thread of execution; it should not be 
reentered. The processor's response to an external 
interrupt may be viewed as the following sequence 
of events: 


• 
the running (active) task is suspended, 


• 
a new task, the interrupt service procedure, is 
created and becomes the running task, 


• 
the interrupt task ends, and is deleted, 
• 
the 
suspended 
task 
is 
reactived 
and 
becomes the running 
task from the point 


where it was suspended. 


An external interrupt 
procedure 
should only be 


interruptable 
by a request that activates a dif- 


ferent interrupt procedure. 
When the number of 


interrupt 
sources is not too large, this can be 


accomplished by assigning a different type code 
and 
corresponding 
service 
procedure 
to 
each 


source. 
In systems 
where 
a large 
number 
of 


similar sources can generate closely spaced inter- 
rupts 
(e.g., 
500 
communication 
lines), 
an 


approach similar to that illustrated in figure 2-70, 
may be used to insure that the interrupt 
service 


procedure 
is not reentered, 
and yet, interrupts 


arriving 
in bursts 
are 
not 
missed. 
The 
basic 


technique is to divide the code required to service 
an interrupt into two parts. The interrupt service 
procedure itself is kept as short as possible; it per- 
forms the absolute minimum amount of process- 
ing necessary to service the device. It then builds a 
message that contains enough information 
to per- 


mit another task, the interrupt message processor, 
to complete 
the interrupt 
service. It adds 
the 


message to a queue (which might be implemented 
as a linked list), and terminates 
so that 
it is 


available to service the next interrupt. 
The inter- 


rupt message processor, 
which is not reentrant, 


obtains a message from the queue, finishes pro- 
cessing the interrupt associated with that message, 
obtains the next message (if there is one), etc. 
When a burst of interrupts occurs, the queue will 
lengthen, but interrupts will not be missed so long 
as there is time for the interrupt service procedure 
to be activated and run between requests. 


MULTIPLE INTERRUPT SOURCES 
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Parameters 
are frequently 
passed to procedures 
on a stack. Results produced 
by the procedure, 
however, 
should be returned 
in other memory 
locations or in registers. In other words, the called 
procedure 
should "clean 
up" 
the stack by dis- 
carding 
the parameters 
before 
returning. 
The. 
RET 
instruction 
can 
perform 
this 
function. 
PL/M-86 
procedures 
always 
follow 
this 
convention. 


Programs should make no assumptions about the 
contents of the undefined bits in the flag-images 
stored 
in memory 
by the PUSHF 
and SAHF 
instructions. These bits always should be masked 
out of any comparisons 
or tests that use these 
flag-images. The undefined bits of the word flag- 
image can be cleared by ANDing the word with 
FD5H. The undefined bits of the byte flag-image 
can be cleared by ANDing the byte with D5H. 


These 
examples 
demonstrate 
the 
8086/8088 
instruction set and addressing modes in common 
programming situations. The following topics are 
addressed: 


procedures (parameters, reentrancy) 


various 
forms 
of 
JMP 
and 
CALL 
instructions 


bit manipulation 
with the ASM-86 RECORD 
facility 


dynamic code relocation 


memory mapped I/O 


breakpoints 


interrupt handling 


string operations 


These examples are written primarily in ASM-86 
and will be of most interest to assembly language 
programmers. 
The PL/M-86 
compiler generates 
code 
that 
handles 
many 
of 
these 
situations 
automatically for PL/M-86 programs. For exam- 
ple, 
the compiler 
takes 
care 
of the 
stack 
in 
PL/M-86 
procedures, 
allowing the programmer 
to concentrate 
on solving the application 
prob- 
lem. PL/M-86 programmers, 
however, may wan,t 


to 
examine 
the 
memory 
mapped 
I/O 
and 
interrupt 
handling 
examples, since the concepts 
illustrated 
are generally applicable; 
one of the 
interrupt procedures is written in PL/M-86. 


The examples are intended to show one way to use 
the instruction set, addressing modes and features 
of ASM-86. They do not demonstrate 
the "best" 
way to solve any particular, problem. The flexibil- 
ity of the 8086 and 8088, application 
differences 
plus variations in programming 
style usually add 
up to a number of ways to implement a program- 
ming solution. 


The code in figure 2-71 illustrates 
several tech- 
niques that are typically used in writing ASM-86 
procedures. 
In this example a calling program 
invokes a procedure 
(called EXAMPLE) 
twice, 
passing if a different byte array each time. Two 
parameters are passed on the stack; the first con- 
tains the number of elements in the array, and the 
second 
contains 
the 
address 
(offset 
in 
DAT A_SEG) 
of the first array element. 
This 
same technique can be used to pass a variable- 
length parameter list to a procedure (the "array" 
could be any series of parameters 
or parameter 
addresses). Thus, although the procedure always 
receives two parameters, 
these can be used to 
indirectly 
access any 
number 
of 
variables 
in 
memory. 


Any results returned 
by a procedure 
should be 
placed in registers or in memory, but not on the 
stack. AX or AL is often used to hold a single 
word or byte result. Alternatively, the calling pro- 
gram can pass the address (or addresses) of a 
result area to the procedure as a parameter. 
It is 
good practice for ASM-86 programs to follow the 
calling conventions 
used by PL/M-86; 
these are 
documented 
in MCS-86 Assembler 
Operating 
Instructions 
For 
ISIS-II 
Users, 
Order 
No. 
9800641. 


EXAMPLE 
is defined 
as a FAR 
procedure, 
meaning it is in a different segment than the call- 
ing program. 
The calling program 
must use an 
intersegment 
CALL 
to activate 
the procedure. 
Note that this type of CALL saves CS and IP on 
the stack. If EXAMPLE 
were defined as NEAR 
(in the same segment as the caller) then an intra- 
segment CALL would be used, and only IP would 
be saved on the stack. It is the responsibility 
of 
the calling program to know how the procedure is 
defined and to issue the correct type of CALL. 


STACK_SEG 
SEGMENT 
DW 
20 DUP (?) 
; ALLOCATE 20-WORD STACK 


STACK_TOP 
LABEL 
WORD 
; LABEL INITIAL TOS 
STACK_SEG 
ENDS 


DATA_SEG 
SEGMENT 
ARRAY_1 
DB 
10 DUP (?) 
; 10-ELEMENT BYTE ARRAY 


ARRAY_2 
DB 
5 DUP (?) 
; 5-ELEMENT BYTE ARRAY 


DATA_SEG 
ENDS 


PROC_SEG 
SEGMENT 
ASSU ME CS:PROC_SEG 
,DS:DATA_SEG 
,SS:STACK_SEG 
,ES:NOTHING 


; MUST BE ACTIVATED BY 
INTERSEGMENT 
CALL 


; PROCEDURE PROLOG 
PUSH 
MOV 
PUSH 
PUSH 
PUSHF 
SUB 
SP,6 


; END OF PROLOG 


; PROCEDURE BODY 
MOV 
CX,[BP+8] 
;GETELEMENTCOUNT 
MOV 
BX, [BP+6] 
; GET OFFSET OF 1ST ELEMENT 


; PROCEDURE CODE GOES HERE 
; FIRST PARAMETER CAN BE ADDRESSED: 
; 
[BX] 
; LOCAL STORAGE CAN BE ADDRESSED: 
; 
[BP-8], 
[BP-10], 
[BP-12] 


; END OF PROCEDURE BODY 


; PROCEDURE EPILOG 
ADD 
POPF 
POP 
BX 
POP 
CX 
POP 
BP 
; END OF EPILOG 


; PROCEDURE RETURN 
RET 


BP 
BP,SP 
CX 
BX 


; SAVE BP 
; ESTABLISH BASE POINTER 
; SAVE CALLER'S 
REGISTERS 


; 
AND FLAGS 


; ALLOCATE3WORDS 
LOCAL STORAGE 


; DE-ALLOCATE 
LOCAL STORAGE 


; RESTORE CALLER'S 
REGISTERS 
AND 
FLAGS 


CALLER_SEG 
SEGMENT 
; GIVE ASSEMBLER 
SEGMENT/REGISTER 
CORRESPONDENCE 
ASSUME 
CS:CALLER_SEG, 
& 
DS:DATA_SEG, 
& 
SS:STACK_SEG, 
& 
ES:NOTHING 
; NO EXTRA SEGMENT IN THIS PROGRAM 


; INITIALIZE SEGMENT REGISTERS 
START: 
MOV 
AX,DATA_SEG 
MOV 
DS,AX 
MOV 
AX,STACK_SEG 
MOV 
SS,AX 
MOV 
SP,OFFSET STACK_TOP 
; POINT SP TO TOS 


; ASSUME ARRAY _1 IS INITIALIZED 


; CALL "EXAMPLE", 
PASSING ARRAY_1, 
THAT IS, THE NUMBER OF ELEMENTS 
IN THE ARRAY, AND THE LOCATION OF THE FIRST ELEMENT. 
MOV 
AX,SIZE ARRAY_1 
PUSH 
AX 
MOV 
AX,OFFSET ARRAY_1' 


PUSH 
AX 
CALL 
EXAMPLE 


; ASSUME ARRAY _21S 
INITIALIZED 


MOV 
PUSH 
MOV 
PUSH 
CALL 


AX,SIZE ARRAY _2 
AX 
AX,OFFSET ARRAY_2 
AX 
EXAMPLE 
ENDS 


END 
START 


Figure 2-72 shows the stack before 
the caller 
pushes the parameters onto it. Figure 2-73 shows 
the stack as the procedure 
receives it after the 
CALL has been executed. 


EXAMPLE 
is divided into four sections. 
The 
"prolog" 
sets up register BP so it can be used to 
address data on the stack (recall that specifying 
BP as a base register 
in an instruction 
auto- 
matically refers to the stack segment unless a seg- 
ment override prefix is coded). The next step in 
the prolog is to save the "state of the machine" as 


it existed when the procedure was activated. This 
is done by pushing any registers used by the pro- 
cedure ('only CX and BP in this case) onto the 
stack. If the procedure changes the flags, and the 
caller expects the flags to be unchanged following 
execution 
of the procedure, 
they also may be 
saved on the stack. The last instruction in the pro- 
log allocates three words on the stack for the pro- 
cedure to use as local temporary 
storage. Figure 
2-74 shows the stack at the end of the prolog. 
Note that PL/M-86 
procedures 
assume that all 
registers except SP and BP can be used without 
saving and restoring. 


PARAMETER 1 


PARAMETER 2 


OLD CS 


OLDIP 


OLD BP 


OLD CX 


OLD BX 


OLD FLAGS 


LOCAL 1 


LOCAL2 


LOCAL 3 


BP+8_ 
BP+6_ 


BP-8_ 
BP-l0_ 


BP-12_ 


PARAMETER 1 


PARAMETER 2 


OLDCS 


OLDIP 


The procedure "body" 
does the actual processing 
(none in the example). 
The parameters 
on the 
stack are addressed relative to BP. Note that if 
EXAMPLE 
were a NEAR procedure, 
CS would 


not be on the stack and the parameters would be 
two bytes "closer" 
to BP. BP also is used to 


address the local variables on the stack. Local 
constants 
are 
best 
stored 
in a data 
or extra 
segment. 


The procedure "epilog" 
reverses the activities of 


the prolog, leaving the stack as it was when the 
procedure was entered (see figure 2-75). 


, 
c 


PARAMETER 1 


PARAMETER 2 


RETURN ADDRESS 


OLDBP 


... 


The procedure "return" 
restores CS and IP from 


the stack and discards the parameters. 
As figure 
2-76 shows, when the calling program is resumed, 
the stack is in the same state as it was before any 
parameters were pushed onto it. 


Figure 2-77 shows a simple procedure that uses an 
ASM-86 structure to address the stack. Register 
BP is pointed to the base of the structure, which is 
the top of the stack since the stack grows toward 
lower addresses (see figure 2-78). Any structure 
element can then be addressed by specifying BP as 
a base register: 


Figure 2-79 shows a different approach 
to using 


an ASM-86 structure to define the stack layout. 
As shown in figure 2-80, register BP is pointed at 
the middle of the structure (at OLD_BP) 
rather 


than at the base of the structure. 
Parameters and 


the return 
address 
are thus located 
at positive 


displacements 
(high addresses) 
from BP, while 


local 
variables 
are 
at 
negative 
displacements 


(lower addresses) from BP. This means that the 
local variables will be "closer" 
to the beginning 


of the stack segment and increases the likelihood 
that the assembler will be able to produce shorter 
instructions 
to access these variables, 
i.e., their 


offsets from SS may be 255 bytes or less and can 
be expressed as a I-byte value rather than a 2-byte 
value. Exit from the subroutine 
also is slightly 


faster because a MOV instruction can be used to 
deallocate the local storage instead of an ADD 
(compare figure 2-71). 


It is possible for a procedure to be activated a sec- 
ond time before 
it has returned 
from its first 


activation. 
For example, 
procedure 
A may call 


procedure 
B, and an interrupt 
may occur while 


procedure B is executing. If the interrupt service 
procedure calls B, then procedure 
B is reentered 


and must be written to handle this situation cor- 
rectly, 
Le., 
the 
procedure 
must 
be 
made 


reentrant. 


An ASM-86 procedure will be reentrant if it uses 
the stack for storing all local variables. When the 
procedure 
is reentered, 
a new "generation" 
of 


variables will be allocated on the stack. The stack 
will grow, 
but 
the sets of variables 
(and 
the 


parameters 
and 
return 
addresses 
as well) will 
automatically be kept straight. The stack must be 
large 
enough 
to 
accommodate 
the 
maximum 


"depth" 
of procedure 
activation 
that can occur 


under actual running conditions. 
In addition, any 


procedure 
called by a reentrant 
procedure 
must 


itself be reentran t. 


A related situation 
that also requires reentrant 


procedures 
is 
recursion. 
The 
following 
are 


examples of recursion: 


• 
A calls A (direct recursion), 


• 
A calls B, B calls A (indirect recursion), 


• 
A calls B, B calls C, C calls A (indirect 
recursion). 


CODE 
SEGMENT 
ASSUME CS:CODE 
MAX 
PROC 


; THIS PROCEDU RE IS CALLED BY THE FOLLOWING 
SEQUENCE: 


PUSH PARM1 
PUSH PARM2 


; 
CALL MAX 


; IT RETURNS THE MAXIMUM OF THE TWO WORD 
PARAMETERS 
IN AX. 


; DEFINE THE STACK LAYOUT AS A STRUCTURE. 
STACK_LAYOUT 
STRUC 
OLD_BP 
DW? 
RETURN_~DDR 
DW? 
PARM_2 
DW? 
PARM_1 
DW? 
STACK_LAYOUT 
ENDS 


; SAVED BP VALUE-BASE 
OF STRUCTURE 


; RETURN ADDRESS 
; SECOND PARAMETER 
; FIRST PARAMETER 


; PROLOG 
PUSH 
BP 
; SAVE IN OLD_BP 
MOV 
BP, SP 
; POINT TO OLD_BP 


; BODY 
MOV 
AX, [BP].PARM_1 
; IF FIRST 
CMP 
AX, [BP].PARM_2 
; >SECOND 
JG 
FIRST_IS_MAX 
; THEN RETURN FIRST 
MOV 
AX, [BP].PARM_2 
; ELSE RETURN SECOND 


; EPILOG 
FIRST_IS_MAX: 
POP 
BP 
; RESTORE BP (& SP) 
; RETURN 
RET 
4 
; DISCARD PARAMETERS 
MAX 
ENDP 


CODE 
ENDS 
END 


, 


PARAMETER 
1 


PARAMETER 
2 


RETURN 
ADDRESS 


OLD 
BP 


" 
" 


The 8086/8088 
instruction set contains many dif- 


ferent types of JMP and CALL instructions (e.g., 
direct, indirect through register, indirect through 
memory, 
etc.). These varying types of transfer 


provide efficient use of space and execution time 
in different programming 
situations. 
Figure 2-81 


illustrates typical use of the different 
forms of 


these 
instructions. 
Note 
that 
the 
ASM-86 


assembler uses the terms "NEAR" 
and "FAR" 


to denote intrasegment 
and intersegment 
trans- 


fers, respectively. 


8086 AND 8088 CENTRAL 
PROCESSING 
UNITS 
EXTRA 
SEGMENT 


; CONTAINS STRUCTURE TEMPLATE THAT "NEARPROC" 
; 
USESTO ADDRESS AN ARRAY PASSED BY ADDRESS. 
DUMMY 
STRUC 
PARM_ARRAY 
DB 
256 DUP 7 
DUMMY 
ENDS 
EXTRA 
ENDS 


SEGMENT 
ASSUME CS:CODE,ES:EXTRA 
NEARPROC 
PROC 


; LAY OUTTHE 
STACK (THE DYNAMIC STORAGE AREA OR DSA). 


DSASTRUC 
STRUC 
I 
DW 
LOC_ARRAY 
DW 
OLD_BP 
DW 
RETADDR 
DW 
POINTER 
DD 
COUNT 
DB 
DB 
ENDS 


10 DUP (7) 
7 
7 
7 
7 
7 


, 
; ORIGINAL BP VALU E 
; RETURN ADDRESS 
; 2ND PARM-POINTERTO 
"PARM_ARRAY" 
; 1ST PARM-A 
BYTE OCCUPIES 


A WORD ON THE STACK 


; USE AN EQU TO DEFINE THE BASE ADDRESS OF THE 
DSA. CANNOT SIMPLY USE BP BECAUSE IT WILL 
BE POINTING TO "OLD_BP" 
IN THE MIDDLE OF 


; 
THE DSA. 


DSA 
EQU 
[BP - OFFSET OLD_BP] 


; PROCEDURE ENTRY 
PUSH 
MOV 
SUB 


BP 
; SAVE BP 
BP, SP 
; POINT BP AT OLD_BP 
SP, OFFSET OLD_BP; 
ALLOCATE 
LOC_ARRAY 
& I 


; PROCEDURE BODY 


; ACCESS LOCAL VARIABLE I 
MOV 
AX,DSA.I 


; ACCESS LOCAL ARRAY (3) I.E., 4TH ELEMENT 
MOV 
SI,6 
; WORD ARRAY-INDEX 
IS 3*2 
MOV 
AX,DSA.LOC_ARRAY 
[SI] 


; LOAD POINTER TO ARRAY PASSED BY ADDRESS 
LES 
BX,DSA.POINTER 


; ES:BX NOW POINTS TO PARM_ARRAY 
(0) 


; ACCESS SI'TH ELEMENT OF PARM_ARRAY 
MOV 
AL,ES:[BX].PARM_ARRAY 
[SI] 


; ACCESS THE BYTE PARAMETER 
MOV 
AL,DSA.COUNT 


; PROCEDURE EXIT 
MOV 
SP,BP 
; DE-ALLOCATE 
LOCALS 
POP 
BP 
; RESTORE BP 
; STACK NOW AS RECEIVED FROM CALLER 
RET 
6 
; DISCARD PARAMETERS 


ENDP 
ENDS 
END 


NEARPROC 
CODE 


The procedure 
in figure 2-81 illustrates 
how a 


PL/M-86 
DO 
CASE 
construction 
may 
be 


implemented in ASM-86. It also shows: 


, 
, 


I 
COUNT 


-POINTER 


RETADDR 


OLD_BP 


LOC_ARRAY 
(9) 


LOC_ARRAY 
(8) 


LOC_ARRAY 
(7) 


LOC_ARRAY 
(6) 


LOC_ARRAY 
(5) 


LOC_ARRAY 
(4) 


LOC_ARRAY 
(3) 


LOC 
ARRAY (2) 


LOC_ARRAY(1) 


LOC_ARRAY 
(0) 


I 


" 


• 
an indirect 
CALL 
through 
memory 
to a 


procedure located in another segment, 


• 
an indirect 1MP though memory to a label in 
the same segment, 


• 
an indirect 1MP through a register to a label 
in the same segment, 


• 
a direct CALL 
to a procedure 
in another 
segment, 


• 
a direct CALL to a procedure 
in the same 
segment, 


Figure 2-80. Procedure 
Example 
3 Stack Layout 


• 
direct 1MPs to labels in the same segment, 
within -128 to +127 bytes ("SHORT") 
and 


farther than -128 to +127 bytes ("NEAR"). 


DATA 
SEGMENT 


; DEFINE THE CASE TABLE (JUMP TABLE) USED BY PROCEDURE 


"DO_CASE." 
THE OFFSET OF EACH LABEL WILL 


; 
BE PLACED IN THE TABLE BY THE ASSEMBLER. 


CASE_TABLE 
DW 
ACTIONO, ACTION1, ACTION2, 
& 
ACTION3, ACTION4, ACTION5 
DATA 
ENDS 


; DEFINE TWO EXTERNAL (NOT PRESENT IN THIS 
ASSEMBLY 
BUT SUPPLIED BY R & L FACILITY) 
PROCEDURES. ONE IS IN THIS CODE SEGMENT 
(NEAR) AND ONE IS IN ANOTHER SEGMENT (FAR). 


EXTRN 
NEAR_PROC: 
NEAR, FAR_PROC: 
FAR 


; DEFINE AN EXTERNAL LABEL (JUMP TARGET) THAT 
IS IN ANOTHER SEGMENT. 


EXTRN 
ERR_EXIT: 
FAR 


CODE 
SEGMENT 
ASSUME 
CS: CODE, DS: DATA 
;ASSUMEDSHASBEENSETUP 
BY CALLER TO POINT TO "DATA" 
SEGMENT. 


DO_CASE 
PROC 
NEAR 
; THIS EXAMPLE PROCEDURE RECEIVES TWO 
PARAMETERS ON THE STACK. THE FIRST 
PARAMETER IS THE "CASE NUMBER" 
OF 
A ROUTINE TO BE EXECUTED (0-5). THE SECOND 
PARAMETER 
IS A POINTER TO AN ERROR 
PROCEDURE THAT IS EXECUTED IF AN INVALID 
CASE NUMBER (>5) IS RECEIVED. 


; LAY OUT THE STACK. 
STACK_LAYOUT 
STRUC 
OLD_BP 
DW? 
RETADDR 
DW? 
ERR_PROC_ADDR 
DD 
? 
CASE_NO 
DB? 
DB 
? 
STACK_LAYOUT 
ENDS 


; SET UP PARAMETER ADDRESSING 
PUSH 
BP 
MOV 
BP, SP 


; CODE TO SAVE CALLER'S 
REGISTERS COULD GO HERE. 


; CHECK THE CASE NUMBER 
MOV 
MOV 
CMP 
JLE 


BH,O 
BL, [BP].CASE_NO 
BX, LENGTH CASE_TABLE 
OK 
; ALL CONDITIONAL 
JUMPS 


; ARE SHORT DIRECT 


; CALL THE ERROR ROUTINE WITH A FAR 
INDIRECT CALL. A FAR INDIRECT CALL 
IS INDICATED SINCE THE OPERAND HAS 
TYPE "DOUBLEWORD." 


CALL 
[BP].ERR_PROC_ADDR 


; JUMP DIRECTLY TO A LABEL IN ANOTHER SEGMENT. 
A FAR DIRECT JUMP IS INDICATED SINCE 
THE OPERAND HAS TYPE "FAR." 


JMP 
ERR_EXIT 


OK: 
; MULTIPLY CASE NUMBER BY 2 TO GET OFFSET 
INTO CASE_TABLE 
(EACH ENTRY IS 2 BYTES). 


SHL 
BX,1 
; NEAR INDIRECT JUMP THROUGH SELECTED 
ELEMENT OF CASE_TABLE. 
A NEAR 
INDIRECT JUMP IS INDICATED SINCE THE 
OPERAND HAS TYPE "WORD." 
JMP 
CASE_TABLE 
[BX] 


ACTIONO: 
; EXECUTED IF CASE_NO 
= 0 
; CODE TO PROCESS THE ZERO CASE GOES HERE. 
; FOR ILLUSTRATION 
PURPOSES, USE A 
NEAR INDIRECT JUMP THROUGH A 
REGISTER TO BRANCH TO THE POINT 
WHERE ALL CASES CONVERGE. 
A DIRECT JUMP (JMP ENDCASE) IS 
ACTUALLY 
MORE APPROPRIATE 
HERE. 


MOV 
AX, OFFSET ENDCASE 
JMP 
AX 


ACTION1: 
; EXECUTED IF CASE_NO 
= 1 


; CALL A FAR EXTERNAL PROCEDURE. A FAR 
DIRECT CALL IS INDICATED SINCE OPERAND 
HAS TYPE "FAR." 
CALL 
FAR_PROC 


; CALL A NEAR EXTERNAL PROCEDURE. 


CALL 
NEAR_PROC 
; BRANCH TO CONVERGENCE 
POINT USING NEAR 
DIRECT JUMP. NOTE THAT "ENDCASE" 
IS MORE THAN 127 BYTES AWAY 
SO A NEAR DIRECT JUMP WILL BE USED. 


JMP 
ENDCASE 


ACTION2: 
; EXECUTED IF CASE_NO 
= 2 


; CODE GOES HERE 
JMP 
ENDCASE; 
NEAR DIRECT JUMP 


ACTION3: 
; EXECUTED IF CASE_NO 
= 3 


; CODE GOES HERE 
JMP 
ENDCASE; 
NEAR DIRECT JMP 


; ARTIFICIALLY 
FORCE "ENDCASE" 
FURTHER AWAY 
SO THAT ABOVE JUMPS CANNOT BE "SHORT." 


ORG 
500 


ACTION4: 
; EXECUTED IF CASE_NO 
= 4 
; CODE GOES HERE 
JMP 
ENDCASE; 
NEAR DIRECT JUMP 


ACTIONS: 
; EXECUTED IF CASE_NO 
= 5 


; CODE GOES HERE. 
; BRANCH TO CONVERGENCE 
POINT USING 
SHORT DIRECT JUMP SINCE TARGET IS 
WITHIN 127 BYTES. MACHINE INSTRUCTION 
HAS 1-BYTE DISPLACEMENT 
RATHER THAN 
2-BYTE DISPLACEMENT 
REQUIRED FOR 
NEAR DIRECT JUMPS. "SHORT" 
IS 
WRITTEN BECAUSE "ENDCASE" 
IS A FORWARD 
REFERENCE, WHICH ASSEMBLER ASSUMES IS 
"NEAR." 
IF "ENDCASE" 
APPEARED PRIOR 
TO THE JUMP, THE ASSEMBLER WOULD 
AUTOMATICALLY 
DETERMINE IF IT WERE REACHABLE 
WITH A SHORT JUMP. 


JMP 
SHORTENDCASE 


; POP CALLER'S 
REGISTERS HERE. 
; RESTORE BP & SP, DISCARD PARAMETERS 
AND RETURN TO CALLER. 


MOV 
SP, BP 
POP 
BP 
RET 
6 


ENDP 
ENDS 
END 


Figure 2-82 shows how the ASM-86 RECORD 
facility may be used to manipulate 
bit data. The 


example shows how to: 


• 
right-justify a bit field, 


• 
test for a value, 


• 
assign a constant known at assembly time, 


• 
assign a variable, 


• 
set or clear a bit field. 


DATA 
SEGMENT 


; DEFINE A WORD ARRAY 
XREF 
DW 3000 DUP (?) 


; EACH ELEMENT OF XREF CONSISTS OF 3 FIELDS: 
A 2-BIT TYPE CODE, 
A 1-BIT FLAG, 
; 
A 13-BIT NUMBER. 


; DEFINE A RECORD TO LAY OUT THIS ORGANIZATION. 
L1NE_REC 
RECORD 
LINE_TYPE: 
2, 


& 
VISIBLE: 1, 
& 
L1NE_NUM: 
13 
DATA 
ENDS 


CODE 
SEGMENT 
ASSUME CS: CODE, 
DS: DATA 
; ASSUME SEGMENT REGISTERS ARE SET UP PROPERLY 
AND THAT SIINDEXES 
AN ELEMENT OF XREF. 


; A RECORD FIELD-NAME 
USED BY ITSELF RETURNS' 


THE SHIFT COUNT REQUIRED TO RIGHT-JUSTIFY 


; 
THE FIELD. ISOLATE "LINE_TYPE" 
IN THIS 


; 
MANNER. 


MOV 
AL, XREF [SI] 
MOV 
CL, LINE_TYPE 
SHR 
AX, CL 


; THE "MASK" 
OPERATOR APPLIED TO A RECORD 
FIELD-NAME 
RETURNS THE BIT MASK 
REQUIRED TO ISOLATE THE FIELD WITHIN 
THE RECORD. CLEAR ALL BITS EXCEPT 
"L1NE_NUM." 


MOV 
DX, XREF[SI] 
AND 
DX, MASK L1NE_NUM 


; DETERMINE THE VALUE OF THE "VISIBLE" 
FIELD 
TEST 
XREF[SI], MASK VISIBLE 
JZ 
NOT_VISIBLE 
; NO JUMP IF VISIBLE = 1 
NOT_VISIBLE: 
;JUMPHEREIFVISIBLE=O 


; ASSIGN A CONSTANT KNOWN AT ASSEMBLY-TIME 
TO A FIELD, BY FIRST CLEARING THE BITS 
AND THEN OR'ING IN THE VALUE. IN 


. THIS CASE "LINE_TYPE" 
IS SET TO 2 (10B). 


. AND 
XREF[SI], NOT MASK LINE_TYPE· 


OR 
XREF[SI],2 SHL LINE_TYPE 
; THE ASSEMBLER 
DOES THE MASKING AND SHIFTING. 


; THE RESULT IS THE SAMEAS: 


AND 
XREF[SI], 3FFFH 
OR 
XREF[SI], 8000H 


" 
BullS 
MORE READABLE AND LESS SUBJECT 
, 
TO CLERICAL ERROR. 


; ASSIGN A VARIABLE 
(THE CONTENT OF AX) 
TO LINE_TYPE. 
MOV 
SHL 
AND 
OR 


CL, LINE_TYPE 
; SHIFT COUNT 
AX, CL 
; SHIFTTO "LINE 
UP" BITS 
XREF[SI], NOT MASK LINE_TYPE 
; CLEAR BITS 
XREF[SI], AX 
; OR IN NEW VALUE 


; NO SHIFT IS REQUIRED TO ASSIGN TO THE 
RIGHT-MOST FIELD. ASSUMING AX CONtAINS 
A VALID NUMBER (HIGH 3 BITS ARE 0), 
ASSIGN AX TO "L1NE_NUM." 


AND 
XREF[SI], NOT MASK L1NE_NUM 
OR 
XREF[SI], AX 


; A FIELD MAY BE SET OR CLEARED WITH 
ONE INSTRUCTION. 
CLEAR THE "VISIBLE" 
FLAG AND THEN SET IT. 
AND 
OR 
XREF[SI], NOT MASK VISIBLE 
XREF[SI], MASK VISIBLE 


ENDS 
END 


The following considerations 
apply to position- 
independent code sequences: 


• 
A label that is referenced by a direct FAR 
(intersegment) transfer is not moveable. 


• 
A label that 
is referenced 
by an indirect 


transfer (either NEAR or FAR) is moveable 
so long as the register or memory pointer to 
the label contains the label's current address. 


• 
A label that is referenced by a SHORT (e.g., 
conditional 
jump) 
or a direct NEAR 
(in- 


trasegment) transfer is moveable so long as 
the referencing instruction is moved with the 
label as a unit. 
These 
transfers 
are self- 


relative; 
that is they require only that the 


label maintain 
the same distance 
from the 


referencing instruction, 
and actual addresses 


are immaterial. 


• 
Data is segment-independent, 
but not offset- 


independent. 
That 
is, a data item may be 
moved to a different 
segment, but it must 
maintain the same offset from the beginning 
of the segment. Placing constants 
in a unit 
of 
code 
also 
effectively 
makes 
the 
code 
offset-dependent, 
and 
therefore 
is 
not 
recommended. 


• 
A procedure should not be moved while it is 
active or while any procedure it has called is 
active. 


• 
A section of code that has been interrupted 
should not be moved. 


The segment that is receiving a section of code 
must have "room" 
for the code. If the MOVS (or 


MOVSB or MOVSW) 
instruction 
attempts 
to 


auto-increment 
DI past 64k, it wraps around to 0 


and causes the beginning of the segment to be 
overwritten. 
If a segment override is needed for 


the source operand, code similar to the following 
can be used to properly resume the instruction if it 
is interrupted: 


RESUME: 
REP 
MOVS 
DESTINATION, ES:SOURCE 


;IF CX NOT ~ 0 THEN INTERRUPT HAS OCCURRED 


AND 
CX,CX 
; CX=O? 


JNZ 
RESUME 
;NO, FINISH EXECUTION 


;CONTROL COMES HERE WHEN STRING HAS BEEN MOVED. 


If 
the 
MOVS 
is 
interrupted, 
the 
CPU 


"remembers" 
the 
segment 
override, 
but 
"forgets" 
the presence of the REP prefix when 


execution resumes. Testing CX indicates whether 
the instruction is completed or not. Jumping back 
to the instruction resumes it where it left off. Note 
that a segment override cannot be specified with 
MOVSB or MOVSW. 


calls the procedure 
through 
this pointer. 
The 
supervisor 
also has·'access 
to the procedure's 


length in bytes. The procedure is moved with the 
MOVSB 
instruction. 
After 
the 
procedure 
is 


moved, its pointer is updated with the new loca- 
tion. The ASM-86 WORD PTR operator is writ- 
ten to inform the assembler that one word of the 
doubleword pointer is being updated at a time. 


Figure 2-83 ,illustrates one approach 
to moving 


programs in memory at execution time. A "super- 
visor" 
program 
(which 
is not 
moved) 
keeps 
a pointer variable that contains the current loca- 
tion (offset 
and 
segment 
base) of a position- 


independent 
procedure. 
The supervisor 
always 


MAIN_DATA 
SEGMENT 


; SET UP POINTERS TO POSITION-INDEPENDENT 
PROCEDURE 
AND FREE SPACE. 


PIP_PTR 
DD 
EXAMPLE 
FREE_PTR 
DD 
TARGET_SEG 
; SET UP SIZE OF PROCEDURE IN ByTES 
PIP_SIZE 
DW 
EXAMPLE_LEN 
MAIN_DATA 
ENDS 


STACK_TOP 
STACK 


SEGMENT 
DW 
20 DUP (?). 


LABEL 
WORD 
ENDS 


; 20 WORDS FOR STACK 


; TOS BEGINS HERE 


SOURCE_SEG 
SEGMENT 


; THE POSITION-INDEPENDENT 
PROCEDURE IS INITIALLY IN THIS SEGMENT. 


; OTHER CODE MAY PRECEDE IT, I.E., ITS OFFSET NEED NOT BE ZERO. 
ASSUME 
CS:SOURCE_SEG 
EXAMPLE 
PROC 
FAR 


; THIS PROCEDURE READS AN 8-BIT PORT UNTIL 
; BIT 3 OF THE VALUE READ IS FOUND SET. IT 
; THEN READS ANOTHER PORT. IF THE VALUE READ 
; IS GREATER THAN 10H IT WRITESTHE 
VALUE TO 


; A THIRD PORT AND RETURNS; OTHERWISE IT STARTS 
; OVER. 


STATUS_PORT 
EQU 
PORT_READY 
EQU 


INPUT_PORT 
EQU 
THRESHOLD 
EQU 
OUTPUT_PORT 
EQU 
CHECK_AGAIN: 
IN 
TEST 
JNE 
IN 
CMP 
JLE 
OUT 


ODOH 
008H 
OD2H 
010H 
OD4H 
AL,STATUS_PORT 
AL, PORT_READY 
CHECK_AGAIN 
AL,INPUT_PORT 
AL,THRESHOLD 
CHECK_AGAIN 
OUTPUT_PORT,AL 


; GET STATUS 
; DATA READY? 
; NO, TRY AGAIN 
; YES, GET DATA 
; > 10H? 
; NO, TRY AGAIN 
; YES, WRITE IT 


; GET PROCEDURE LENGTH 
EXAMPLE_LEN 
EQU 
(OFFSET THIS BYTE)-(OFFSET 
CHECK_AGAIN) 
ENDP 
EXAMPLE 
ENDP 
SOURCE_SEG 
ENDS 


TARGET_SEG 
SEGMENT 


; THE POSITION-INDEPENDENT 
PROCEDURE 
IS MOVED TO THIS SEGMENT, WHICH IS 


; 
INITIALLY "EMPTY." 
; IN TYPICAL SYSTEMS, A "FREE SPACE MANAGER" 
WOULD 
MAINTAIN 
A POOL OF AVAILABLE 
MEMORY SPACE 


; FOR ILLUSTRATION 
PURPOSES, ALLOCATE ENOUGH 
SPACE TO HOLD IT 
DB 
EXAMPLE_LEN 
DUP (?) 


TARGET_SEG 
ENDS 


MAIN_CODE 
SEGMENT 


; THIS ROUTINE CALLS THE EXAMPLE PROCEDURE 
; AT ITS INITIAL LOCATION, 
MOVES IT, AND 


; CALLS IT AGAIN ATTHE 
NEW LOCATION. 


ASSUME 
& 
CS:MAIN_CODE,SS:STACK, 
DS:MAIN_DAT 
A,ES:NOTHING 


; INITIALIZE SEGMENT REGISTERS & STACK POINTER. 
START: 
MOV 
AX,MAIN_DATA 
MOV 
DS,AX 
MOV 
AX,STACK 
MOV 
SS,AX 
MOV 
SP ,OFFSET STACK_TOP 


; CALL EXAMPLE AT INITIAL LOCATION. 


CALL 
PIP_PTR 


; SET UP CX WITH COUNT OF BYTES TO MOV 
MOV 
CX,PIP _SIZE 


; SAVE DS, SET UP DS/SI AND ESIDI TO 
POINTTO THE SOURCE AND DESTINATION 
ADDRESSES. 


PUSH 
LES 
LDS 


; MOVETHEPROCEDURE. 
CLD 
REPMOVSB 


DS 
DI,FREE_PTR 
SI,PIP_PTR 


; RESTORE OLD ADDRESSABILITY. 


MOV 
AX,DS 
; HOLD TEMPORARILY 


POP 
DS 


; UPDATE POINTER TO POSITION-INDEPENDENT 
PROCEDURE 
MOV 
WORD PTR PIP_PTR+2,ES 
SUB 
DI,PIP _SIZE 
; PRODUCES OFFSET 
MOV 
WORD PTR PIP_PTR,DI 


; UPDATE POINTER TO FREE SPACE 
MOV 
WORD PTR FREE_PTR+2,AX 
SUB 
SI,PIP _SIZE 
; PRODUCES OFFSET 
MOV 
WORD PTR FREE_PTR,SI 


; CALL POSITION-INDEPENDENT 
PROCEDURE AT 
NEW LOCATION AND STOP 
CALL 
PIP_PTR 
MAIN_CODE 
ENDS 
END 
START 


Figure 2-84 shows how memory-mapped 
I/O can 
be used to address a group 
of communication 


lines as an "array." 
In the example, 
indexed 


addressing is used to poll the array of status ports, 
one port at a time. Any of the other 8086/8088 
memory addressing modes may be used in con- 
junction 
with memory-mapped 
I/O 
devices as 
well. 


In figure 2-85 a MOYS instruction is used to per- 
form a high-speed transfer to a memory-mapped 
line printer. 
Using this technique 
requires 
the 


hardware to be set up as follows. Since the MaYS 


instruction 
transfers 
characters 
to 
successive 


memory addresses, the decoding logic must select 
the line printer if any of these locations is written. 
One way of accomplishing this is to have the chip 
select logic decode only the upper 12 lines of the 
address bus (AI9-A8), 
ignoring the contents 
of 


the lower eight lines (A7-AO). When data is writ- 
ten to any address 
in this 256-byte block, 
the 


upper 12 lines will not change, so the printer will 
be selected. 


If an 8086 is being used with an 8-bit printer, the 
8086's 16-bit data bus must be mapped into 8-bits 
by external hardware. 
Using an 8088 provides a 


more direct interface. 


COM_LINES 
SEGMENT 
AT 800H 


; THE FOLLOWING 
IS A MEMORY MAPPED "ARRAY" 
OF EIGHT 8-BIT COMMUNICATIONS 
CONTROLLERS 
(E.G.,8251 USARTS). PORTS HAVE ALL-ODD 
OR ALL-EVEN 
ADDRESSES (EVERY OTHER BYTE 
IS SKIPPED) FOR 8086-COMPATIBILITY. 


COM_DATA 
DB 
? 
DB 
? 
; SKIP THIS ADDRESS 
COM_STATUS 
DB 
? 
DB 
? 
; SKIP THIS ADDRESS 
DB 
28 
DUP(?) 
; REST OF "ARRAY" 
COM_LINES 
ENDS 


CODE 
SEGMENT 


; ASSUME STACK IS SET UP, AS ARE SEGMENT 
REGISTERS (OS POINTING TO COM_LINES). 
FOLLOWING CODE POLLS THE LINES. 


CHAR_RDY 
EQU 
START_POLL: 
MOV 
SUB 


00000010B 
CX,8 
SI,SI 


; CHARACTER PRESENT 
; POLL 8 LINES ZERO 
; ARRAY INDEX 


POLL_NEXT: 
TEST 
JE 
ADD 
LOOP 


READ_CHAR: 
MOV 


; ETC. 
CODE 
ENDS 
END 


COM_STATUS 
[SI), CHAR_RDY 
READ_CHAR;READIFPRESENT 
SI,4 
; ELSE BUMP TO NEXT LINE 
POLLNEXT 
; CONTINUE POLLING UNTIL 
; 
ALL 8 HAVE BEEN CH ECKED 
START_POLL; 
START OVER 


AL,COM_DATA 
[SI] 
;GETTHE 
DATA 


, PRINTER 
SEGMENT 
; THIS SEGMENT CONTAINS A "STRING" 
THAT 
, 
IS ACTUALLY 
A MEMORY-MAPPED 
LINE PRINTER. 
THE SEGMENT (PRINTER) MUST BE ASSIGNED (LOCATED) 
TO A BLOCK OF THE ADDRESS SPACE SUCH 
THAT WRITING TO ANY ADDRESS IN THE 
BLOCK SELECTS THE PRINTER. 


PRINT_SELECT 
DB 133 
DUP (?) 
DB123 
DUP(?) 
PRINTER 
ENDS 


; "STRING" 
REPRESENTING 
PRINTER 


; REST OF 256-BYTE BLOCK 


DATA 
SEGMENT 
PRINT_BUF 
DB133 
DUP(?) 
PRINT_COUNT 
DB 1 
? 


; OTHER PROGRAM DATA 
DATA 
ENDS 


; LINE TO BE PRINTED 
; LINE LENGTH 


CODE 
SEGMENT 


; ASSUME STACK AND SEGMENT REGISTERS HAVE 
BEEN SET UP (DS POINTS TO DATA SEGMENT). 
FOLLOWING CODE TRANSFERS A LINE TO 
THE PRINTER. 


ASSUME 
MOV 
MOV 
SUB 
SUB 
MOV 
CLD 
MOVS 
; ETC. 
ENDS 
END 


ES: PRINTER 
AX, PRINTER 
; PREVENT SEGMENT OVERRIDE 
ES,AX 
DI, DI 
; CLEAR SOURCE AND 
SI, SI 
DESTINATION 
POINTERS 
CX, PRINT_COUNT 
; AUTO-INCREMENT 
PRINT_SELECT, 
PRINT_BUF 


that saves the byte located at that address and 
replaces it with an INT 3 (breakpoint) instruction. 
When 
the 
CPU 
encounters 
the 
breakpoint 


instruction, it calls the type 3 interrupt procedure. 
In the example, this procedure 
places the pro- 


cessor into single-step 
mode 
starting 
with the 


instruction where the breakpoint was placed. 


Figure 2-86 illustrates how a program may set a 
breakpoint. 
In 
the 
example, 
the 
breakpoint 
routine puts the processor into single-step mode, 
but the same general approach could be used for 
other purposes 
as well. A program 
passes the 


address where the break is to occur to a procedure 


INT_PTR_ 
TAB 
SEGMENT 
; INTERRUPT POINTER TABLE-LOCATE 
AT OH 
TYPE_O 
DD 
? 
TYPE_1 
DD 
SINGLE_STEP 
TYPE_2 
DD 
? 
TYPE_3 
DD 
BREAKPOINT 
INT_PTR_TAB 
ENDS 


; NOT DEFINED IN EXAMPLE 


; NO~ DEFINED IN EXAMPLE 


SAVE_SEG 
SAVE_INSTR 
SEGMENT 
DB 1 
DUP (?) 
; INSTRUCTION 
REPLACED 
; BY BREAKPOINT 


MAIN_CODE 
SEGMENT 


; ASSUME STACK AND SEGMENT REGISTERS ARE SET UP. 


; ENABLE SINGLE-STEPPING 
WITH INSTRUCTION AT 
LABEL "NEXT" 
BY PASSING SEGMENT AND 
OFFSETOF 
"NEXT" 
TO "SET_BREAK" 
PROCEDURE 
PUSH 
CS 
LEA 
AX, CS: NEXT 
PUSH 
AX 
CALL 
FAR SET_BREAK 


; ETC. 


NEXT: 
IN 
; ETC. 


MAIN_CODE 
ENDS 


BREAK 
SEGMENT 
SET_BREAK 
PROC 
FAR 


; THIS PROCEDURE SAVES AN INSTRUCTION 
BYTE (WHOSE 
ADDRESS IS PASSED BY THE CALLER) AND WRITES 
AN INT 3 (BREAKPOINT) 
MACHINE INSTRUCTION 
ATTHE TARGET ADDRESS. 


; SET UP BP FOR PARM ADDRESSING & SAVE REGISTERS 
PUSH 
BP 
MOV 
BP,SP 
PUSH 
OS 
PUSH 
ES 
PUSH 
AX 
PUSH 
BX 


; POINT DS/BX TO THE TARGET INSTRUCTION 
LOS 
BX,TARGET 


; POINT ES TO THE SAVE AREA 
MOV 
AX, SAVE_SEG 
MOV 
ES,AX 


; SWAP THE TARGET INSTRUCTION 
FOR INT 3 (OCCH) 
MOV 
AL,OCCH 
XCHG 
AL, OS: [BX] . 


; SAVE THE TARGET INSTRUCTION 
MOV 
ES: SAVE_INSTR, 
AL 


; RESTORE AND RETURN 
POP 
BX 
POP 
AX 
POP 
ES 
POP 
OS 
POP 
BP 
RET 
4 
SET_BREAK 
ENDP 


BREAKPOINT 
PROC 
FAR 


; THE CPU WILL ACTIVATE THIS PROCEDURE WHEN IT 
EXECUTES THE INT 3 INSTRUCTION SET BY THE 
SET_BREAK 
PROCEDURE. THIS PROCEDURE 
RESTORES THE SAVED INSTRUCTION 
BYTE TO ITS 
ORIGINAL LOCATION AND BACKS UPTHE 
INSTRUCTION 
POINTER IMAGE ON THE STACK 
SO THAT EXECUTION WILL RESUME WITH 
THE RESTORED INSTRUCTION. 
IT THEN SETS 
TF (THE TRAP FLAG) IN THE FLAG-IMAGE 
ON THE STACK. THIS PUTS THE PROCESSOR 
IN SINGLE-STEP 
MODE WHEN EXECUTION 
RESUMES. 


FLAG_IMAGE 
EQU 
WORD PTR [BP+6] 
IP_IMAGE 
EQU 
WORD PTR [BP + 2] 
NEXT_INSTR 
EQU 
DWORD PTR [BP+2] 


; SET UP BPTO ADDRESS STACK AND SAVE REGISTERS 
PUSH 
BP 
MOV 
BP, SP 
PUSH 
OS 
PUSH 
ES 
PUSH 
AX 
PUSH 
BX 


; POINT ES AT THE SAVE AREA 
MOV 
AX, SAVE_SEG 
MOV 
ES. AX 


;GETTHESAVEDBYTE 
MOV 
AL. ES: SAVE_INSTR 


; GET THE ADDRESS OF THE TARGET + 1 
(INSTRUCTION 
FOLLOWING THE BREAKPOINT) 
LDS 
BX, NEXT_INSTR 


; BACK UP IP-IMAGE (IN BX) AND REPLACE ON STACK 
DEC 
BX 
MOV 
IP_IMAGE, 
BX 


; RESTORE THE SAVED INSTRUCTION 
MOV 
DS: [BX], AL 
; SET TF ON STACK 
AND 
FLAG_IMAGE,0100H 
; RESTORE EVERYTHING AND EXIT 
POP 
BX 
POP 
AX 
POP 
ES 
POP 
DS 
POP 
BP 
IRET 
BREAKPOINT 
ENDP 


SINGLE 
STEP 
PROC 
FAR 
; ONCE SINGLE-STEP 
MODE HAS BEEN ENTERED, 
THE CPU "TRAPS" 
TO THIS PROCEDURE 
AFTER EVERY INSTRUCTION THAT IS NOT IN 
AN INTERRUPT PROCEDURE. IN THE CASE 
OF THIS EXAMPLE, THIS PROCEDURE WILL 
BE EXECUTED IMMEDIATELY 
FOLLOWING THE 
"IN AL, OFFFH" INSTRUCTION (WHERE THE 
BREAKPOINT 
WAS SET) AND AFTER EVERY 
SUBSEQUENT 
INSTRUCTION. 
THE PROCEDURE 
COULD "TURN 
ITSELF OFF" BY CLEARING 
TF ON THE STACK. 
; SINGLE-STEP CODE GOES HERE. 
; SINGLE_STEP 
ENDP 


ENDS 


END 


Figure 2-87 is a block diagram of a hypothetical 
system that is used to illustrate 
three different 
examples 
of 
interrupt 
handling: 
an 
external 


(maskable) 
interrupt, 
an external non-mask able 


interrupt and a software interrupt. 


In this hypothetical 
system,' an 8253 Program- 


mable Interval Timer is used to generate a time 
base. One of the three timers on the 8253 is pro- 
grammed 
to 
repeatedly 
generate 
interrupt 


requests at 50 millisecond intervals. The output 
from this timer is tied to one of the eight interrupt 
request lines of an 8259A Programmable 
Inter- 


rupt Controller. The 8259A, in turn, is connected 
to the INTR line of an 8086 or 8088. 


COLD 
START-1 
~ 


ADDRESS 
BUS 


DATA 
BUS 


CONTROL 
BUS 


BATTERY 
POWERED 
RAM 


A power-down 
circuit is used in the system to 
illustrate one application 
of the 8086/8088 
NMI 
(non-mask able 
interrupt) 
line. 
If 
the 
ac line 


voltage 
drops 
below 
a certain 
threshold, 
the 
power supply activates ACLO. The power-down 
circuit then sends a power-fail 
interrupt 
(PFI) 
pulse 
to 
the 
CPU's 
NMI 
input. 
After 
5 
milliseconds, 
the power-down 
circuit 
activates 
MPRO 
(memory 
protect) 
to 
disable 
reading 
from and writing to the system's battery-powered 
RAM. This protects the RAM from fluctuations 
that may occur when power is actually lost 7.5 
milliseconds after the power failure is detected. 
The system software must save all vital informa- 
tion in the battery-powered 
RAM segment within 
5 milliseconds of the activation of NMI. 


When 
power 
returns, 
the power-down 
circuit 
activates 
the system RESET line. Pressing 
the 


"cold 
start" 
switch 
also 
produces 
a 
system 
RESET. The PFS (power fail status) line, which is 


connected to the low-order bit of port EO, iden- 
tifies the source of the RESET. If the bit is set, the 
software executes a "warm 
start" 
to restore the 


information 
saved by the power-fail 
routine. 
If 
the PFS bit is cleared, 
the software executes a 


"cold start" 
from the beginning of the program. 


In either case, the software writes a "one" 
to the 


low-order bit of port E2. This line is connected to 
the power-down circuit's PFSR (power fail status 
reset) signal and is used to enable the battery- 
powered RAM segment. 


A software interrupt 
is used to update a simple 


real-time 
clock. 
This 
procedure 
is written 
in 


PL/M-86, while the rest of the system is written in 
ASM-86 to demonstrate 
the interrupt 
handling 


capability of both languages. The system's main 
program 
simply initializes the system following 


receipt 
of 
a RESET 
and 
then 
waits 
for 
an 
interrupt. An example of this interrupt procedure 
is given in figure 2-88. 


INT_POINTERS 
SEGMENT 


; INTERRUPT POINTER TABLE, LOCATE ATOH, ROM-BASED 
TYPE_O 
DO? 
; DIVIDE-ERRORNOT SUPPLIED IN EXAMPLE. 


TYPE_1 
DO? 
; SINGLE-STEP NOT SUPPLIED IN EXAMPLE. 


TYPE_2 
DO 
POWER_FAIL; 
NON-MASKABLE INTERRUPT 
TYPE_3 
DO? 
; BREAKPOINT NOT SUPPLIED IN EXAMPLE. 


TYPE_4 
DO? 
; OVERFLOW NOT SUPPLIED IN EXAMPLE. 
; SKIP RESERVED PART OF EXAMPLE 
ORG 
32"4 
DO 
? 
DO 
? 
DO 
? 
DO 
TIMER_PULSE 
DO 
? 
DO 
? 
DO 
? 
DO 
? 


TYPE_32 
TYPE_33 
TYPE_34 
TYPE_35 
TYPE_36 
TYPE_37 
TYPE_38 
TYPE_39 


; 8259AIRO- AVAILABLE 
; 8259AIR1 - AVAILABLE 
; 8259AIR2 - AVAILABLE 
; 8259AIR3 
; 8259AIR4- AVAILABLE 
; 8259AIR5- AVAILABLE 
; 8259AIR6- AVAILABLE 
; 8259AIR7- A~AILABLE 


BATTERY 
SEGMENT 
; THIS RAM SEGMENT IS BATTERY-POWERED. IT CONTAINS VITAL DATA 
; 
THAT MUST BE MAINTAINED DURING POWER OUTAGES. 
STACK_PTR 
OW? 
;SPSAVEAREA 
STACK_SEG 
OW? 
;SSSAVEAREA' 


; SPACE FOR OTHER VARIABLES COULD BE DEFINED HERE. 
BATTERY 
ENDS 


DATA 
SEGMENT 


; RAM SEGMENT THAT IS NOT BACKED UP BY BATTERY 
N_PULSES 
DB 
1 DUP (0) 
; #TIMER PULSES 


; ETC. 
DATA 


STACK 
SEGMENT 


; LOCATED IN BATTERY-POWERED RAM 
OW 
100DUP (?) 


STACK_TOP 
STACK 
WORD 
ENDS 


INTERRUPT_HANDLERS 
SEGMENT 


; INTERRUPT PROCEDURES EXCEPT TYPE 40 (PLlM-86) 


POWER_FAIL 
PROC 
; TYPE 2 INTERRUPT 


; POWER FAIL DETECT CIRCUIT ACTIVATES NMI LINE ON CPU IF POWER IS 
ABOUT TO BE LOST. THIS PROCEDURE SAVES THE PROCESSOR STATE IN 
RAM (ASSUMED TO BE POWERED BY AN AUXILIARY SOURCE) SOTHAT IT 
CAN BE RESTORED BY A WARM START ROUTINE IF POWER RETURNS 


; IP, CS, AND 
FLAGS 
ARE ALREADY 
ON THE STACK. 
SAVE THE OTHER 
REGISTERS. 
PUSH 
AX 
PUSH 
BX 
PUSH 
CX 
PUSH 
DX 
PUSH 
SI 
PUSH 
DI 
PUSH 
BP 
PUSH 
DS 
PUSH 
ES 


; CRITICAL 
MEMORY 
VARIABLES 
COULD 
ALSO 
BE SAVED 
ON THE STACK 
AT THIS 
POINT. 
ALTERNATIVELY, 
THEY 
COULD 
BE DEFINED 
IN THE "BATTERY" 


SEGMENT, 
WHERE 
THEY 
WILL 
AUTOMATICALLY 
BE PROTECTED 
IF MAIN 
POWER 
IS LOST. 


; SAVE SP AND SS IN FIXED 
LOCATIONS 
THAT 
ARE KNOWN 
BY WARM 
START 
ROUTINE. 


MOV 
AX,BATTERY 
MOV 
ES,AX 


MOV 
ES:STACK_PTR,SP 
MOV 
ES:STACK_SEG,SS 
; STOP GRACEFULLY 
HLT 


TIMER_PULSE 
PROC 
; TYPE 35 INTERRUPT 


; THIS 
PROCEDURE 
HANDLES 
THE 50MS INTERRUPTS 
GENERATED 
BY THE 8253. 


IT COUNTS 
THE INTERRUPTS 
AND ACTIVATES 
THE TYPE 40 INTERRUPT 
PROCEDURE 
ONCE 
PER SECOND. 


; DS IS ASSUMED 
TO BE POINTING 
TO THE DATA 
SEGMENT 


; THE 8253 IS RUNNING 
FREE, 
AND AUTOMATICALLY 
LOWERS 
ITS INTERRUPT 
REQUEST. 
IF A DEVICE 
REQUIRED 
ACKNOWLEDGEMENT, 
THE CODE 
MIGHT 
GO HERE. 


; NOW PERFORM 
PROCESSING 
THAT 
MUST 
NOT BE INTERRUPTED 
(EXCEPT 
FOR NMI). 


INC 
N_PULSES 


; ENABLE 
HIGHER-PRIORITY 
INTERRUPTS 
AND DO LESS CRITICAL 
PROCESSING 
STI 
CMP 
N_PULSES,200; 
1 SECOND 
PASSED? 


JBE 
DONE 
; NO, GO ON. 
MOV 
N_PULSES,O 
; YES, RESET COUNT. 


INT 
40 
; UPDATE 
CLOCK 
; SEND 
NON-SPECIFIC 
END-OF-INTERRUPT 
COMMAND 
TO 8259A, ENABLING 
EQUAL 


; 
OR LOWER 
PRIORITY 
INTERRUPTS. 


DONE: 
MOV 
AL,020H 
; EOI COMMAND 
OUT 
OCOH,AL 
; 8259A PORT 
IRET 
TIMER_PULSE 
ENDP 


CODE 
SEGMENT 


; THIS SEGMENT 
WOULD 
NORMALLY 
RESIDE 
IN ROM. 


11'111 
rnvv 
1"~Mn 
; THIS 
PROCEDURE 
IS CALLED 
FOR BOTH WARM 
AND COLD 
STARTS 
TO INITIALIZE 
THE 8253 AND THE 8259A. THIS 
ROUTINE 
DOES NOT USE STACK, 
DATA, 
OR 
EXTRA 
SEGMENTS, 
AS THEY 
ARE NOT SET PREDICTABLY 
DURING 
A WARM 
START. 
INTERRUPTS 
ARE DISABLED 
BY VIRTUE 
OF THE SYSTEM 
RESET. 


; INITIALIZE 
8253 COUNTER 
1 - OTHER 
COUNTERS 
NOT USED. 
; CLK INPUTTO 
COUNTER 
IS ASSUMED 
TO BE 1.23 MHZ. 


L050MS 
HI50MS 
CONTROL 
COUNT_1 
MODE2 


EQU 
EQU 
EQU 
EQU 
EQU 


; COUNT 
VALUE 
IS 
; 
61440 DECIMAL. 
; CONTROL 
PORT ADDRESS 
; COUNTER 
1 ADDRESS 
; MODE 2, BINARY 


OOOH 
OFOH 
OD6H 
OD2H 
01110100B 


MOV 
DX,CONTROL 
; LOAD CONTROL 
BYTE 
MOV 
AL,MODE2 
OUT 
DX,AL 
MOV 
DX,COUNT_1 
; LOAD 
50MS DOWNCOUNT 
MOV 
AL,L050MS 
OUT 
DX,AL 
MOV 
AL,HI50MS 
OUT 
DX,AL 


; COUNTER 
NOW RUNNING, 
INTERRUPTS 
STILL 
DISABLED. 


; INITIALIZE 
8259A TO: SINGLE 
INTERRUPT 
CONTROLLER, 
EDGE-TRIGGERED, 
INTERRUPT 
TYPES 
32-40 (DECIMAL) 
TO BE SENT TO CPU FOR INTERRUPT 
REQUESTS 
0-7 RESPECTIVELY, 
8086 MODE, 
NON-AUTOMATIC 
END-OF-INTERRUPT. 
MASK 
OFF UNUSED 
INTERRUPT 
REQUEST 
LINES. 


ICW1 
ICW2 
ICW4 
OCW1 
PORT_A 
PORT_B 


00010011 B 
00100000B 
00000001 B 
11110111 B 
OCOH 
OC2H 


; EDGE-TRIGGERED, 
SINGLE 
8259A, ICW4 REQUIRED. 
; TYPE 20H, 32 - 400 
.; 8086 MODE, 
NORMAL 
EOI 
; MASK 
ALL 
BUT IR3 
; ICW1 WRITTEN 
HERE 
; OTHER 
ICW'S 
WRITTEN 
HERE 


EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


MOV 
DX,PORT_A 
; WRITE 
1ST ICW 
MOV 
AL,ICW1 
OUT 
DX,AL 
MOV 
DX,PORT _B 
; WRITE 
2ND ICW 
MOV 
AL,ICW2 
OUT 
DX,AL 
MOV 
AL,ICW4 
; WRITE 
4TH ICW 
OUT 
DX,AL 
MOV 
AL,OCW1 
; MASK 
UNUSED 
IR'S 
OUT 
DX,AL 
; INITIALIZATION 
COMPLETE, 
INTERRUPTS 
STILL 
DISABLED 
RET 
INIT 
ENDP 


USER_PGM: 
; "REAL" 
CODE WOULD 
GO HERE. 
THE EXAMPLE 
EXECUTES 
AN ENDLESS 
LOOP 
UNTIL 
AN INTERRUPT 
OCCURS. 


JMP 
USER_PGM 


; EXECUTION 
STARTS 
HERE WHEN 
CPU IS RESET. 


POWER_FAlL_STATUS 
EQU 
OEOH 
ENABLE_RAM 
EQU 
OE2H 
; PORT ADDRESS 
; PORT ADDRESS 


; ENABLE BATTERY-POWERED RAM SEGMENT 
START: 
MOV 
AL,001H 
OUT 
ENABLE_RAM,AL 


; DETERMINE WARM OR COLD START 
IN 
AL:.,POWER_FAIL_STATUS 
RCR 
AL,1 
; ISOLATE LOW BIT 
JC 
WARM_START 


COLD_START: 
; INITIALIZE SEGMENT REGISTERS AND STACK POINTER. 


ASSUME· 'CS:CODE,DS:DATA,SS:STACK,ES:NOTHING 
; RESET TAKES CARE OF CS AND IP. 
MOV 
AX,DATA 
MOV 
DS,AX 
MOV 
AX,STACK 
MOV 
SS,AX 
MOV 
SP,OFFSET STACK_TOP 


; INITIALIZE 8253AND 8259A. 
CALL 
INIT 


; ENABLE INTERRUPTS 
STI 


; START MAIN PROCESSING 
JMP 


WARM_START: 
; INITIALIZE 8253AND 8259A. 
CALL 


; RESTORE SYSTEM TO STATE ATTHE TIME POWER FAILED 


; MAKE BATTERY SEGMENT ADDRESSABLE 
MOV 
AX,BATTERY 
MOV 
DX,AX 


; VARIABLES SAVED IN THE "BATTERY" 
SEGMENT WOULD BE MOVED 
BACK TO UNPROTECTED RAM NOW. SEGMENT REGISTERS AND 
"ASSUME" 
DIRECTIVES WOULD HAVE TO BE WRITTEN TO GAIN 
ADDRESSABILITY. 


; RESTORE THE OLD STACK 
MOV 
SS,DS:STACK_SEG 
MOV 
SP,DS:STACK_PTR 


; RESTORE THE OTHER REGISTERS 
POP 
ES 
POP 
DS 
POP 
BP 
POP 
DI 
POP 
SI 
POP 
DX 
POP 
CX 


POP 
BX 


POP 
AX 
; RESUME THE ROUTINE THAT WAS EXECUTING WHEN NMI WAS ACTIVATED. 


I.E., POP CS,IP, & FLAGS, EFFECTIVELY "RETURNING" 
FROM THE 
NMI PROCEDURE. 
IRET 
ENDS 


; TERMINATE ASSEMBLY AND MARK BEGINNING OF THE PROGRAM. 


END 
START 


TYPE$40: 
DO; 
DECLARE 
(HOUR, 
MIN, 
SEC) BYTE PUBLIC; 
UPDATE$TOD: 
PROCEDURE 
INTERRUPT 
40; 
I'THE 
PROCESSOR 
ACTIVATES 
THIS 
PROCEDURE 
'TO 
HANDLE 
THE SOFTWARE 
INTERRUPT 
'GENERATED 
EVERY 
SECOND 
BY THE TYPE 35 
'EXTERNAL 
INTERRUPT 
PROCEDURE, 
THIS 
'PROCEDURE 
UPDATES 
A REAL-TIME 
CLOCK, 


'IT 
DOES 
NOT PRETEND 
TO BE "REALISTIC" 
'AS 
THERE 
IS NO WAY TO SETTHE 
CLOCK,'I 


SEC=SEC 
+ 1; 
IF SEC = 60 THEN 
DO; 
SEC = 0; 
MIN = MIN + 1; 
IF MIN = 60 THEN 
DO; 
MIN=O; 
HOUR=HOUR 
+ 1; 
IF HOUR 
= 24 THEN 
DO; 
HOUR 
= 0; 
END; 
END;. 


END; 
END UPDATEHOD; 
END; 


(the index register is auto-decremented) 
to find 
the last period (".") 
in the string. Finally a byte 
string 
of EBCDIC 
characters 
is translated 
to 
ASCII. The translation 
is stopped at the end of 
the string or when a carriage return character is 
encountered, 
whichever occurs first. This is an 
example of using the string primitives in combina- 
tion with other instructions to build up more com- 
plex string processing operations. 


Figure 2-89 illustrates typical use of string instruc- 
tions and repeat prefixes. The XLAT instruction 
also is demonstrated. 
The first example simply 
moves 80 words of a string using MOVS. Then 
two 
byte 
strings 
are 
compared 
to 
find 
the 
alphabetically lower string, as might be done in a 
sort. Next a string is scanned from right to left 


ALPHA 
SEGMENT 
; THIS IS THE DATA THE STRING INSTRUCTIONS WILL USE 
OUTPUT 
OW 100 
DUP (?) 
INPUT 
OW 100 
DUP (?) 
NAME_1 
DB 'JONES, JON A' 
NAME_2 
DB 'JONES, JOHN' 
SENTENCE 
DB 80 
DUP (?) 
EBCDIC_CHARS 
DB 80 
DUP (?) 
ASCII_CHARS 
DB 80 
DUP (?) 
CONV_ TAB 
DB 64 
DUP(OH) 
; EBCDIC TO ASCII 


; ASCII NULLS ARE SUBSTITUTED 
FOR "UNPRINTABLE" 
CHARS 
DB 1 
20H 
DB 9 
DUP (OH) 
DB? 
'¢','.','<','(','+',OH,'&' 


DB 9 
DUP (OH) 
DBB 
I!', '$', '*', ')',';', 
I I, '_','/' 


DB 8 
DUP (OH) 
DB6 
I 
't I,', 1%', '_', 
I>', I?' 
DB 9 
DUP (OH) 
D817 
I ',':','#','@','''','=','IJ', 
OH, 'a', 'b', Ie', cd', Ie', If', ~g', 'hi, Ii' 
DUP (OH) 


ALPHA 
ENDS 


STACK 
SEGMENT 
DW 100 
.DUP (?) 


STACK_BASE 
LABEL 
WORD 


\ . STACK 
ENDS 


CODE 
SEGMENT 
BEGIN: 
; SET UP SEGMENT REGISTERS. NOTICE THAT 
; ES & DS POINT 1:0 THE SAME SEGMENT, MEANING 
; THAT THE CURRENT EXTRA & DATA 
; SEGMENTS FULLY OVERLAP. THIS ALLOWS 
; ANY STRJNG IN "ALPHA" 
TO BE USED 
; AS A SOURCE OR A DESTINATION. 
ASSUME CS: CODE, SS: STACK, 


& 
DS:ALPHA,ES:ALPHA 
MOV 
AX, STACK 
MOV 
SS, AX 
MOV 
SP, OFFSET STACK~BASE; 
INITIAL TOS 
MOV 
AX, ALPHA 
MOV 
DS, AX 
MOV 
ES, AX 


; MOVE THE FIRST 80 WORDS OF "INPUT" 
TO 
THE LAST 80 WORDS OF "OUTPUT". 


LEA 
SI, INPUT 
; INITIALIZE 
LEA 
DI, OUTPUT + 20 
; INDEX REGISTERS 


DB? 
DB9 
DB? 
DB9 
DB22 
DB 10 
DB6 
DB 10 
DB6 
DB10 
DB6 
DB 10 
DB6 


Ij', 'kit 'I', 'm', In', 'a', 'pi, 'q', 'r' 
DUP (OH) 
I~', '5', 
'1', lU', 
'v', 'Wi, 'x', 
'y', 'z' 


DUP (OH) 


I 
" 'A', 'B', 'e', '0', 'E', 'F', 'G', 'H', 'I' 


; THIS IS AN ARBITRARY STACK SIZE 
; FOR ILLUSTRATION 
ONLY. 
; INITIAL TOS 


MOV 
CLD 
REP 
MOVS 


; REPETITION COUNT 
; AUTO-INCREMENT 


; FIND THE ALPHABETICALLY 
LOWER OF 2 NAMES. 


MOV 
SI, OFFSET NAME_1 
MOV 
01, OFFSET NAME_2 
MOV 
CX, SIZE NAME_2 
CLD 
REPE 
CMPS 
JB 
NAME_2, 
NAME_1 
NAME_2_LOW 
; NOT IN THIS EXAMPLE 
; CONTROL COMES HERE IN THIS EXAMPLE. 
; 01POINTS TO BYTE ('H') THAT 
; COMPARED UNEQUAL. 


; ALTERNATIVE 
; TO LEA 
; CHAR. COUNT 
; AUTO-INCREMENT 
"WHILE 
EQUAL" 


NAME_1_LOW: 
NAME_2_LOW: 


; FIND THE LAST PERIOD ('.') IN A TEXT STRING. 


MOV 
01, OFFSET SENTENCE + 
& 
LENGTH SENTENCE 
; START AT END 
MOV 
CX, SIZE SENTENCE 
STD 
; AUTO-DECREMENT 
MOV 
AL, '.' 
; SEARCH ARGUMENT 
REPNE 
SCAS 
SENTENCE 
; "WHILE 
NOT =" 


JCXZ 
NO_PERIOD 
; IF CX=D, NO PERIOD FOUND 
PERIOD: 
; IF CONTROL COMES HERE THEN 
; 
01 POINTS TO LAST PERIOD IN SENTENCE. 


NO_PERIOD: 
; ETC. 


; TRANSLATE 
A STRING OF EBCDIC CHARACTERS 
TO ASCII, STOPPING IF A CARRIAGE RETURN 
(DOHASCII) IS ENCOUNTERED. 


MOV 
BX,OFFSETCONV 
__TAB; 
POINTTO TRANSLATE 
TABLE 
MOV 
SI, OFFSET EBCDIC __CHARS 
; INITIALIZE 
MOV 
01, OFFSET ASCII_CHARS 
INDEX REGISTERS 
MOV 
CX, SIZE ASCII_CHARS 
; 
AND COUNTER 
CLD 
; AUTO-INCREMENT 
NEXT: 
LODS 
EBCDIC_CHARS 
; NEXT EBCDIC CHAR IN AL 
XLAT 
CONV_TAB 
; TRANSLATE TO ASCII 
STOS 
ASCII_CHARS 
; STORE FROM AL 
TEST 
AL, DOH 
; IS IT CARRIAGE RETURN? 


LOOPNE 
NEXT 
; NO, CONTINUE WHILE CX NOT D 
JE 
CR_FOUND 
; YES, JUMP 


; CONTROL COMES HERE IF ALL CHARACTERS 
HAVE BEEN TRANSLATED 
BUT NO 
; 
CARRIAGE RETURN IS PRESENT. 
; ETC. 


; 01-1 POINTS TO THE CARRIAGE RETURN 
IN ASCII_CHARS. 


ENDS 
END 


The iAPX 8089 
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Processor 
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This chapter 
describes 
the 8089 Input/Output 
Processor 
(lOP). 
Its 
organization 
parallels 
Chapter 
2; that 
is, sections 
generally 
proceed 
from hardware to software topics as follows: 


I. 
Processor Overview 


2. Processor Architecture 


3. Memory 


4. Input/Output 


5. Multiprocessing Features 


6. Processor Control and Monitoring 


7. Instruction Set 


8. Addressing Modes 


9. Programming 
Facilities 


10. Programming Guidelines and Examples 


As in Chapter 
2, the discussion is confined 
to 


covering the hardware 
in functional 
terms; tim- 


ing, electrical characteristics 
and other physical 


interfacing data are provided in Chapter 4. 


The 
8089 Input/Output 
Processor 
is a high- 


performance, 
general-purpose 
I/O 
system 


implemented on a single chip. Within the 8089 are 
two independent 
I/O 
channels, 
each of which 


combines attributes of a CPU with those of a very 
flexible DMA (direct memory access) controller. 
For example, channels can execute programs like 
CPUs; the lOP instruction 
set has about 50 dif- 


ferent types of instructions 
specifically designed 


for efficient input/output 
processing. Each chan- 


nel also can perform high-speed DMA transfers; a 
variety of optional operations allow the data to be 
manipulated 
(e.g., translated or searched) as it is 


transferred. 
The 8089 is contained 
in a 40-pin 


dual in-line package 
(figure 3-1) and operates 
from a single + 5V power source. An integral 
member of the 8086 family, the lOP is directly 
compatible 
with both the 8086 and 8088 when 


these 
processors 
are 
configured 
in maximum 


mode. The lOP also may be used in any system 
that incorporates 
Intel's 
Multibus™ 
shared bus 


architecture, 
or a superset 
of the Multibus ™ 


design. 


Vss 
Vcc 


Al41D14 
Al51D1S 


Al31D13 
Al81S3 


Al21D12 
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Figure 3-1. 8089 Input/Output 
Processor 


Pin Diagram 


Figure 3-2 depicts the general trend in CPU and 
I/O device relationships 
in the first three genera- 


tions of microprocessors. 
First generation CPUs 


were forced 
to 
deal 
directly 
with 
substantial 


numbers of TTL components, 
often performing 


transfers 
at the bit level. Only a very limited 


number 
of 
relatively 
slow 
devices 
could 
be 


supported. 


Single-chip interface controllers were introduced 
in the second generation. 
These devices removed 


the lowest level of device control from the CPU 
and let the CPU transfer 
whole bytes at once. 


With the introduction 
of DMA controllers, 
high- 


speed devices could be added to a system, and 
whole blocks of data could be transferred without 
CPU 
intervention. 
Compared 
to the previous 


generation, 
I/O 
device 
and 
DMA 
controllers 


allowed microprocessors 
to be applied to prob- 


lems that required moderate levels of I/O, both in 
terms of the numbers of devices that could be sup- 
ported and the transfer 
speeds of those devices. 


The 
controllers 
themselves, 
however, 
stilI 
required a considerable amount of attention from 
the CPU, 
and in many cases the CPU had to 
respond to an interrupt 
with every byte read or 
written. The CPU also had to stop while DMA 
transfers were performed. 


The 
8089 introduces 
the 
third 
generation 
of 
input/output 
processing. It continues the trend of 
simplifying the CPU's "view" 
of I/O devices by 
removing another level of control from the CPU. 
The CPU performs an I/O operation by building 
a message in memory that describes the function 
to be performed; 
the lOP reads the message, car- 
ries out the operation and notifies the CPU when 
it has finished. All I/O devices appear to the CPU 
as transmitting 
and receiving whole blocks of 
data; the lOP can make both byte- and word-level 
transfers invisible to the CPU. The lOP assumes 
all device controller overhead, performs both pro- 
grammed and DMA transfers, 
and can recover 
from "soft" 
I/O errors without CPU interven- 
tion; all of these activities may be performed 
while the CPU is attending to other tasks. 


Since the 8089 is a new concept in microprocessor 
components, 
this section surveys the basic opera- 
tion of the lOP as background 
to the detailed 
descriptions provided in the rest of the chapter. 
This summary deliberately omits some operating 
details in order to provide an integrated overview 
of basic concepts. 


A CPU communicates with an lOP in two distinct 
modes: 
initialization 
and 
command. 
The 
initialization 
sequence 
is typically 
performed 
when the system is powered-up or reset. The CPU 
initializes the lOP by preparing a series of linked 
message blocks in memory. On a signal from the 
CPU, the lOP reads these blocks and determines 
from them how the data buses are configured and 
how access to the buses is to be controlled. 
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Following initialization, 
the CPU directs all com- 


munications to either of the lOP's two channels; 
indeed, during norm~l operation the lOP appears 
to be two separate devices-channel 
1 and chan- 


nel2. All CPU-to-channel 
communications 
center 
on the channel control block (CB) illustrated in 
figure 
3-3. The 
CB is located 
in the CPU's 


memory space, and its address is passed to the 
lOP during initialization. 
Half of the block is 


dedicated to each channel. The channel maintains 
the BUSY flag that indicates whether it is in the 
midst of an operation 
or is avai1<i.blefor a new 


command. The CPU sets the CCW (channel com- 
mand word) to indicate what kind of operation 
the lOP is to perform. 
Six different commands 


allow 
the 
CPU 
to 
start 
and 
stop 
programs, 


remove interrupt requests, etc. 


If the CPU is dispatching a channel to run a pro- 
gram, it dir~cts the channel to a parameter block 
(PB) and a task block (TB); these are also shown 
in figure 3-3. The parameter block is analogous to 
a parameter 
list passed 
by 
a program 
to a 


subroutine; 
it contains 
variable 
data 
that 
the 


channel 
program 
is to use in carrying 
out its 


assignment. The parameter 
block also may con- 


tain space for variables (results) that the channel 
is to return to the CPU. Except for the first two 
words, the format and size of a parameter 
block 


are completely open; the PB may be set up to 
exchange any kind of information 
between the 


CPU and the channel program. 


A task block is a channel program-a 
sequence of 


8089 instructions that will perform an operation. 
A typical channel program might use parameter 
block data to set up the lOP and a device con- 
troller for a transfer, perform the transfer, return 
the results, and then halt. However, there are no 
restrictions on what a channel program can do; its 
function may be simple or elaborate 
to suit the 


needs of the application. 


Before the CPU starts a channel program, it links 
the program (TB) to the parameter block and the 
parameter block to the CB as shown in figure 3-3. 
The links are standard 
8086/8088 
doubleword 


pointer variables; the lower-add;essed 
word con- 


tains an offset, 
and the higher-addresseQ 
word 
contains 
a segment base value. A system may 


have many different parameter 
and task blocks; 


however, only one of each is ever linked to a 
channel at any given time. 
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After the CPU has filled in the CCW and has 
linked the CB to a parameter 
block and a task 
block, if appropriate, 
it issues a channel attention 
(CA). This is done by activating 
the lOP's 
CA 
(channel attention) and SEL (channel select) pins. 
The state of SEL at the falling edge of CA directs 
the channel attention to channell 
or channel 2. If 
the lOP is located in the CPU's 
1/0 space, it 
appears to the CPU as two consecutive I/O ports 
(one for each channel), and an OUT instruction 
to the port 
functions 
as a CA. If the lOP 
is 
memory-mapped, 
the channels 
appear 
as two 
consecutive memory locations, 
and any memory 
reference instruction 
(e.g., MOV) to these loca- 
tions causes a channel attention. 


An lOP channel attention 
is functionally similar 
to a CPU interrupt. 
When the channel recognizes 
the CA, it stops what it is doing (it will typically 
be idle) and examines the command in the CCW. 
If it is to start a program, 
the channel loads the 
addresses of the parameter 
and task blocks into 
internal registers, sets its BUSY flag and starts 
executing the channel program. After it has issued 
the CA, the CPU is free to perform other process- 
ing; the channel 
can perform 
its function 
in 
parallel, 
subject to limitations 
imposed by bus 
configurations (discussed shortly). 


When the channel has completed its program, 
it 
notifies the CPU by clearing its BUSY flag in the 
CB. Optionally, 
it may issue an interrupt request 
to the CPU. 


The CPU/lOP 
communication 
structure is sum- 
marized in figure 3-4. Most communication 
takes 
place via "message 
areas" 
shared 
in common 
memory. The only direct hardware communica- 
tions between the devices are channel attentions 
and interrupt requests. 


Each 
of 
the 
two 
lOP 
channels 
operates 
independently, 
and each has its own register set, 
channel attention, 
interrupt 
request 
and DMA 
control signals. At a given point in time, a chan- 
nel may be idle, executing a program, performing 
a DMA transfer, 
or responding 
to a channel 
attention. 
Although 
only one channel 
actually 
runs at a time, the channels can be active concur- 
rently, alternating their operations 
(e.g., channel 
1 may execute instructions i'n the periods between 
successive DMA transfer cycles run by channel 2). 
A built-in 
priority 
system allows high-priority 
activities on one channel to preempt less critical 
operations on the other channel. The CPU is able 
to further adjust priorities to handle special cases. 
The CPU starts the channel and can halt it, sus- 
pend it, or cause it to resume a suspended opera- 
tion by placing different values in the CCW. 


Channel 
programs 
are written in ASM-89, the 
8089 assembly language. About 50 basic instruc- 
tions are available. These instructions operate on 
bit, byte, word and doubleword (pointer) variable 
types; a 20-bit physical address variable type (not 
used by the 8086/8088) 
can also be manipulated. 
Data may be taken from registers, immediate con- 
stants 
and 
memory. 
Four 
memory 
addressing 
modes 
allow 
flexible 
access to both 
memory 
variables and I/O 
devices located anywhere 
in 
either the CPU's 
megabyte memory space or in 
the 8089's 64k I/O space. 


The lOP instruction set contains general purpose 
instructions 
similar to those found in CPUs as 
well as instructions 
specifically tailored for I/O 


MESSAGES 
IN 
MEMORY 


operations. 
Data 
transfer, 
simple 
arithmetic, 
logical and address manipulation 
operations 
are 
available. Unconditional 
jump "and call instruc- 
tions also are provided so that channel programs 
can link to each other. An individual bit may be 
set or cleared with a single instruction. 
Condi- 
tional jumps can test a bit and jump if it is set (or 
cleared), or can test a value and jump if it is zero 
(or non-zero). 
Other instructions 
initiate DMA 
transfers, 
perform 
a 
locked 
test-and-set 
semaphore 
operation, 
and 
issue an 
interrupt 
request to the CPU. 


The 8089 XFER (transfer) 
instruction 
prepares 
the channel for a DMA transfer. 
It executes one 
additional 
instruction, 
then 
suspends 
program 
execution and enters the DMA transfer 
mode. 
The transfer 
is governed 
by channel 
registers 
setup 
by the program 
prior 
to executing 
the 
XFER instruction. 


Data is transferred from a source to a destination. 
The source and destination may be any locations 
in the CPU's memory space or in the lOP's 
I/O 
space; 
the lOP 
makes' no distinction 
between 
memory 
components 
and 
I/O 
devices. 
Thus 
transfers 
may 
be made 
from 
I/O 
device 
to 
memory, 
memory 
to 1/0 device, 
memory 
to 
memory and I/O device to I/O device. The lOP 
automatically 
matches 8- and 16-bit components 
to each other. 


Individual transfer cycles (Le., the movement of a 
byte or a word) may be synchronized by a signal 
(DMA 
request) 
from 
the source 
or from 
the 
destination. 
In the synchronized mode, the chan- 
nel waits for the synchronizing signal before start- 
ing the next transfer cycle. The transfer also may 
be unsynchronized, 
in which case the channel 
begins the next transfer cycle immediately upon 
completion of the previous cycle. 


A transfer cycle is performed in two steps: fetch- 
ing a byte or word from the source into the lOP 
and then storing it from the lOP into the destina- 
tion. 
The 
lOP 
automatically 
optimizes 
the 
transfer to make best use of the available data bus 
widths. For example, if data is being transferred 
from an 8-bit device to memory that resides on a 
16-bit bus (e.g., 8086 memory), the lOP will nor- 
mally run two one-byte fetch cycles and then store 
the full word in a single cycle. 


Between the fetch and store cycles, the lOP can 
operate on the data. A byte may be translated to 
another code (e.g., EBCDIC to ASCII), or com- 
pared to a search value, or both, if desired. 


A 
transfer 
can 
be 
terminated 
by 
several 
programmer-specified 
conditions. 
The 
channel 
can stop the transfer when a specified number (up 
to 64k) of bytes has been transferred. 
An external 
device may stop a transfer 
by signaling on the 
channel's external terminate pin. The channel can 
stop the transfer when a byte (possibly translated) 
compares equal, or unequal, 
to a search value. 
Single-cycle termination, 
which stops uncondi- 
tionally after one byte or word has been stored, is 
also available. 


When 
the 
transfer 
terminates, 
the 
channel 
automatically 
resumes program 
execution. 
The 
channel program can determine the cause of the 
termination in situations where multiple termina- 
tions are possible (e.g., terminating when 80 bytes 
are transferred 
or a carriage return character 
is 
encountered, whichever occurs first). As an exam- 
ple of post-transfer 
processing, the channel pro- 
gram could read a result register from the I/O 
device controller to determine if the transfer was 
performed successfully. If not (e.g., a CRC error 
was detected by the controller), 
the channel pro- 
gram could 
retry 
the operation 
without 
CPU 
intervention. 


A channel program typically ends by posting the 
result of the operation 
to a field supplied in the 
parameter 
block, 
optionally 
interrupting 
the 
CPU, and then halting. When the channel halts, 
its BUSY flag in the channel 
control 
block is 
cleared to indicate 
its availability 
for another 
operation. 
As an alternative to being interrupted 
by the channel, 
the CPU can poll this flag to 
determine 
when 
the 
operation 
has 
been 
completed. 


As shown 
in figure 
3-5, the lOP 
can access 
memory 
or 
ports 
(I/O 
devices) 
located 
in a 
I-megabyte 
system space and memory or ports 
located in a 64-kilobyte I/O space. Although the 
lOP only has one physical data bus, it is useful to 
think of the lOP as accessing the system space via 
a system data bus and the I/O space over an I/O 
data 
bus. The distinction 
between 
the "two" 
buses is based on the type-of-cycle signals output 


by the 8288 Bus Controller. 
Components 
in the 


system space respond 
to the memory read and 


memory write signals, whether they are memory 
or I/O 
devices. Components 
in the I/O 
space 


respond to the I/O read and I/O write signals. 
Thus I/O devices located in the system space are 
memory-mapped 
and memory in the I/O space is 


I/O-mapped. 
The two basic configuration 
op- 


tions differ in the degree to which the lOP shares 
these buses with the CPU. Both configurations 
re- 


quire an 8086/8088 CPU to be strapped in max- 
imummode. 


In the local configuration, 
shown in figure 3-6, 
the lOP (or lOPs if two are used) shares both 
buses with the CPU. The system bus and the I/O 
bus are the same width (8 bits if the CPU is an 


8088 or 16 bits if the CPU is an 8086). The lOP 
system space corresponds 
to the CPU memory 


space, and the lOP I/O space corresponds to the 
CPU I/O space. Channel programs are located in 
the system space; I/O devices may be located in 
either space. The lOP requests use of the bus for 
channel program instruction fetches as well as for 
DMA and programmed 
transfers. 
In the local 


configuration, 
either the lOP or the CPU may use 


the buses, 
but 
not 
both 
simultaneously. 
The 


advantage 
of 
the 
local 
configuration 
is that 


intelligent DMA may be added to a system with 
no additional components 
beyond the lOP. The 


disadvantage is that parallel operation of the pro- 
cessors is limited to cases in which the CPU has 
instruction 
in its queue 
that 
can be executed 


without using the bus. 
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In the remote configuration 
(figure 3-7), the lOP 
(or lOPs) shares a common system bus with the 
CPU. Access to this bus is controlled by 8289 Bus 
Arbiters. 
The 
lOP's 
I/O 
bus, 
however, 
is 
physically separated from the CPU in the remote 
configuration. 
Two lOPs can share the local I/O 
bus. Any number of remote lOPs may be con- 
tained in a system, configured in remote clusters 
of one or two. The local I/O bus need not be the 
same physical width as the shared system bus, 
allowing an lOP, for example, to interface 8-bit 
peripherals to an 8086. In the remote configura- 
tion, the lOP can access local I/O 
devices and 
memory 
without 
using the shared 
system bus, 
thereby reducing bus contention 
with the CPU. 


Contention can further be reduced by locating the 
lOP's 
channel programs 
in the local I/O space. 
The lOP can then also fetch instructions without 


accessing the system bus. 
Parameter, 
channel 
control 
and 
other 
CPU/IOP 
communication 
blocks 
must 
be 
located 
in 
system 
memory, 


however, so that both processors can access them. 
The 
remote 
configuration 
thus 
increases 
the 
degree to which an lOP and a CPU can operate in 
parallel 
and 
thereby 
increases 
a 
system's 
throughput 
potential. 
The price paid for this is 
that 
additional 
hardware 
must 
be 
added 
to 
arbitrate 
use of the shared bus, and to separate 
the shared and local buses (see Chapter A for 
details). 


It is also possible to configure an lOP remote to 
one CPU, and local to another CPU (see figure 
3-8). The local CPU could be used to perform 
heavy computational 
routines for the lOP. 
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A Sample Transaction 


Figure 3-9 shows how a CPU and an lOP might 
work together to read a record (sector) from a 
floppy disk. This example is not illustrative of the 
lOP's full capabilities, but it does review its basic 
operation and its interaction with a CPU. 


The CPU must first obtain 
exclusive use of a 
channel. This can be done by performing 
a "test 
and set lock" operation on the selected channel's 
BUSY flag. 
Assuming 
the CPU 
wants 
to use 
channell, 
this 
could 
be 
accomplished 
in 
PL/M-86 by coding similar to the following: 


DO WHILE LOCKSET (@CH1.BUSY,OFFH); 
END; 


In ASM-86 a loop containing the XCHG instruc- 
tion prefixed 
by LOCK would accomplish 
the 
same thing, namely testing the BUSY flag until it 
is clear (OH), and immediately setting it to FFH 
(busy) to prevent another task or processor from 
obtaining use of the channel. 


Having obtained tlie' channel, the CPU fills in a 
parameter block (see figure 3-10). In this case, the 
CPU passes the following parameters to the chan- 
nel: the address of the floppy disk controller, the 
address of the buffer 
where the data is to be 
placed, and the drive, track and sector to be rea~l. 
It also supplies space for the lOP to return the 
result of the operation. 
Note that this is quite a 
"low-level" 
parameter 
block in that it implies 
that the CPU has detailed knowledge of the-I/O 
system. 
For 
a 
"real" 
system, 
a higher-level 
parameter block would isolate the CPU from I/O 
device characteristics. 
Such-a block might contain 
more general parameters 
such as file name and 
record key. 


After setting up the parameter 
block, the CPU 
writes a "start 
channel program" 
command 
in 
channell's 
CCW. 
Then 
the CPU 
places the 
address of the desired channel program 
in the 
parameter 
block and writes the parameter 
block 
address 
in the CB. Notice 
that 
in this simple 
example, the CPU "knows" 
the address of the 
channel program for reading from the disk, and 
presumably also "knows" 
the address of another 
program for writing, etc. A more general solution 
would be to place a function code (read, write, 


delete, etc.) in the parameter block and let a single 
channel 
program 
execute 
different 
routines 
depending on which function is requested. 


After the communication 
blocks have been setup, 


the CPU dispatches the channel by issuing a chan- 
nel attention, typically by an OUT instruction for 
an I/O-mapped 
8089, or a MOY or other memory 
reference i~struction fora memory-mapped 
8089. 


The" channel begins executing the channel pro- 
gram (task block) whose address has been placed 
in the parameter 
block by the CPU. In this case 
the program initializes the 8271 Floppy Disk Con- 
troller by sending it a "read 
data" 
command 
followed by a parameter indicating the track to be 
read. The program initializes the channel registers 
that define and control the DMA transfer. 


Having prepared the 8271 and the channel itself, 
the channel program executes a XFER instruction 
and sends a final parameter (the sector to be read) 
to the 8271. (The 8271 enters DMA transfer mode 
immediately upon receiving the last of a series of 
parameters; 
sending the last parameter 
after the 
XFER instruction gives the channel time to setup 
for the transfer.) The DMA transfer begins when 
the 8271 issues a DMA request to the channel. 
The transfer 
continues 
until the 8271 issues an 
interrupt 
request, 
indicating 
that 
the data 
has 
been transferred 
or tha.t an error has occurred. 


The 8271's interrupt 
request line is tied to the 
lOP's EXTl (external terminate on channell) 
pin 
so that the channel interprets an interrupt request 
as an external 
terminate 
condition. 
Upon 
ter- 
mination 
of the transfer, 
the channel 
resumes 


executing instructions 
and reads the 8271 result 
register to determine 
if the data was read suc- 
cessfully. If a soft (correctable) error is indicated, 
the lOP retries the transfer. 
If a hard (uncorrect- 
able) error is detected, or if the transfer has been 
successful, the lOP posts the content of the result 
register to the parameter 
block result field, thus 
passing the result back to the CPU. The channel 
then interrupts the CPU (to inform the CPU that 
the request has been processed) and halts. 


When 
the 
CPU 
recognizes 
the 
interrupt, 
it 
inspects the result field in the parameter block to 
see if the content of the buffer is valid. If so, it 
uses the data; otherwise it typically executes an 
error routine. 
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tion; 
conversely, 
the 
operating 
system 
and 
application teams do not need to be expert in the 
operation of I/O devices. Standard high-level I/O 
systems 
can 
be 
used 
in 
multiple 
application 
systems. Because the application and I/O systems 
are 
almost 
independent, 
application 
system 
changes can be introduced 
without affecting the 
I/O 
system. New peripherals 
can similarly 
be 
incorporated 
into a system without 
impacting 
applications 
or operating 
system software. 
The 
lOP's simple CPU interface also is designed to be 
compatible with future Intel CPUs. 


TRACK 


RESULT 


DRIVE 


SECTOR 


Combining the raw speed and responsiveness of a' 
traditional 
DMA 
controller, 
an, I/O-oriented 
instruction 
set, and a flexible bus organization, 
the 8089 lOP 
is a very versatile 
I/O 
system. 
Applications 
with demanding 
I/O requirements, 
previously beyond the abilities of microcomputer 
systems, can be undertaken 
with the lOP. These 
kinds of I/O-intensive applications include: 


• 
systems that employ high-bandwidth, 
low- 
latency 
devices 
such 
as 
hard 
disks 
and 
graphics terminals; 


• 
systems 
with 
many 
devices 
requiring. 


asynchronous service; and 


• 
systems with high-overhead peripherais such 
as intelligent CRTs and graphics terminals. 


In addition, 
virtually every application 
that per- 
forms a moderate 
amount 
of I/O 
can benefit 
from the design philosophy embodied in the lOP:' 
system functions 
should 
be distributed 
among 
special-purpose processors. An lOP channel pro- 
gram is likely to be both faster and smaller than 
an equivalent program implemented with a CPU. 
Programming 
also is more straightforward 
with 
the lOP's specialized instruction set. 


Removing I/O from the CPU and assigning it to 
one or more lOPs 
simplifies 
and structures 
a 
system's design. The main interface to the I/O 
system can be limited to the parameter 
blocks. 
Once these are defined, 
the I/O system can be 
designed and implemented in parallel with the rest 


Keeping in mind the true general-purpose 
nature 
of the lOP, some of the situations where it can be 
used to advantage are: 
• 
Bus matching - The lOP can transfer 
data 
between virtually any combination 
of 8- and 
16-bit memory 
and I/O 
components. 
For 
example, it can interface a 16-bit peripheral 
to an 8-bit CPU bus, such as the 8088 bus. 
The 
lOP 
also provides 
a straightforward 
means of performing DMA between an 8-bit 
peripheral 
and 8086 memory 
that 
is split 
into odd- and even-addressed 
banks. 
The 
8089 
can 
access 
both 
8- 
and 
16-bit 
peripherals connected to a 16-bit bus. 


• 
String processing - The 8089 can perform a 
memory move, translate, 
scan-for-match 
or 
scan-for-nonmatch 
operation 
much 
faster 
than the equivalent instructions in an 8086 or 
8088. Translate 
and scan operations 
can be 
setup so that the source and destination refer 
to the same addresses to permit the string to 
be operated on in place. 


• 
Spooling - Data from low-speed devices such 
as terminals and paper tape readers can be 
read by the 8089 and placed in memory or on 
disk until the transmission 
is complete. The 
lOP can then transfer the data at high speed 
when it is needed by an application program. 
Conversely, output data ultimately destined 
for a low-speed device such as a printer, can 
be, temporarily 
spooled 
to disk and 
then 
printed later. This permits batches of data to 
be gathered 
or distributed 
by low-priority 
programs that run in the background, 
essen- 
tially using up "spare" 
CPU and lOP cycles. 


Application 
programs 
that use or produce 
the data can execute faster because they are 
not bound by the low-speed devices. 


HIUll.ll.a.;)l\.1l15 
vl-'~lu\.J.U.6 
"J.ne 
•.'1"'&-",v.I~ 
•.L"P...l •.."C\.I·-"""" 
rt.•.••_')~et""l"'''n 
. 


lIO 
tasks 
to 
channels 
with 
an 
absolute 
channel and a floppy disk on the other chan- 
minimum 
of overhead. 
Because a remote 
nel. This configuration 
makes use of approx- 
channel can run in parallel with the CPU, the 
imately 30 percent of the available bus band- 


operating 
system's 
capacity 
for 
servicing 
width. Performance 
can be increased within 
application 
tasks can increase dramatically, 
the available bus bandwidth 
by adding an 
as can its ability to handle more, and faster, 
8086 or 8088 CPU to a remote 
lOP con- 
lIO devices. If both channels of an lOP are 
figuration. 
This configuration 
can provide 


active concurrently, 
the lOP automatically 
scaling, 
rotation 
or 
other 
sophisticated 


gives preference to the higher-priority 
activ- 
display transformations. 


ity (e.g., DMA normally preempts channel 
program 
execution). 
The operating 
system 
can adjust the priority mechanism and also 
can halt or suspend a channel to take care of 
a critical asynchronous event. 


Disk systems - The lOP can meet the speed 
and latency requirements 
of hard disks. It 


can be used to implement 
high-level, 
file- 
oriented systems that appear to application 
programs 
as 
simple 
commands: 
OPEN, 
READ, 
WRITE, 
etc. The lOP can search 
and update disk directories and maintain free 
space maps. "Hierarchical 
memory" systems 
that 
automatically 
transfer 
data 
among 
memory, 
high-speed 
disks 
and 
low-speed 
disks, based on frequency of use, can be built 
around 
lOPs. 
Complex 
database 
searches 
(reading data directly or following pointer 
chains) can appear 
to programs 
as simple 
commands 
and can execute in parallel with 
application programs if an lOP is configured 
remotely. 


Display terminals - The 8089 is well suited to 
handling 
the DMA requirements 
of CRT 


controllers. The lOP's transfer bandwidth is 
high enough to support 
both alphanumeric 
and graphic displays. The 8089 can assume 
responsibility for refreshing the display from 
memory data; in the remote configuration, 
the refresh overhead can be removed from 
the system bus entirely. 
Linked-list display 


algorithms may be programmed 
to perform 


sophisticated modes of display. 


Each time it performs 
a refresh operation, 
the lOP can scan a keyboard for input and 
translate 
the key's row-and-column 
format 
into an ASCII or EBCDIC 
character. 
The 
8089 can buffer the characters, 
scanning the 


stream 
until 
an end-of-message 
character 


(e.g., carriage return) is detected, 
and then 
interrupt the CPU. 


The 8089 is internally divided into the functional 
units depicted schematically 
in figure 3-11. The 


units are connected by a 20-bit data path to obtain 
maximum internal transfer rates. 


All lOP operations 
(instructions, 
DMA transfer 
cycles, channel attention responses, etc.) are com- 
posed of sequences of more basic processes called 
internal 
cycles. A bus cycle takes one internal 
cycle; the execution of an instruction 
may require 
several internal 
cycles. There 
are 23 different 


types of internal cycles each of which takes from 
two to eight clocks 
to execute, 
not 
including 


possible wait states and bus arbitration 
times. 


The common control unit (CCU) coordinates 
the 


activities of the lOP primarily by allocating inter- 
nal cycles to the various processor units; i.e., it 
determines which unit will execute the next inter- 
nal cycle. For example, when both channels are 
active, the CCU determines 
which channel has 


priority and lets that channel run; if the channels 
have equal priority, the CCU "interleaves" 
their 


execution (this is discussed more fully later in this 
section). The CCU also initializes the processor. 


The ALU can perform unsigned binary arithmetic 
on 8- and 
16-bit binary 
numbers. 
Arithmetic 


results may be up to 20 bits in length. Available 
arithmetic 
instructions 
include 
addition, 
incre- 


ment and decrement. Logical operations ("and," 
"or" 
and "not") 
may be performed on either 8- 
or 16-bit quantities. 


f 
f 
f 
f 
ORQ1 EXT1 SINTR·l 
ORQ2 EXT2 SINTR·2 


All data entering 
the chip flows through 
these 
registers. When data is being transferred between 
different 
width 
buses, 
the 
8089 
uses 
the 
assembly/disassembly 
registers 
to 
effect 
the 
transfer 
in the fewest possible bus cycles. In a 
DMA transfer from an 8-bit peripheral to l6-bit 
memory, 
for example, 
the lOP 
runs two bus 
cycles, 
picking 
up 
eight 
bits 
in each 
cycle, 
assembles a l6-bit word, and then transfers 
the 
word to memory in a single bus cycle. (The first 
and last cycles of a transfer 
may be performed 
differently 
to 
accommodate 
odd-addressed 


words; 
the lOP 
automatically 
adjusts 
for this 
condition.) 


This unit controls 
instruction 
fetching 
for the 
executing channel (one channel actually runs at a 
time). If the bus over which the instructions are 
being fetched is eight bits wide, then the instruc- 
tions are obtained 
one byte at a time, and each 
fetch requires one bus cycle. If the instructions 
are being fetched 
over a l6-bit 
bus, then the 
instruction fetch unit automatically 
employs a 1- 
byte queue to reduce the number of bus cycles. 
Each channel has its own queue, and the activity 
of one channel does not affect the other's queue. 


During 
sequential 
execution, 
instructions 
are 
fetched one word at a time from even addresses; 
each fetch requires one bus cycle. This process is 
shown graphically in figure 3-12. When the last 
byte of an instruction falls on an even address, the 
odd-addressed byte (the first byte of the following 
instruction) 
of the fetched word is saved in the 
queue. When the channel begins execution of the 
next instruction, 
it fetches the first byte from the 
queue rather 
than 
from 
memory. 
The queue, 
then, keeps the processor fetching words, rather 
than bytes, thereby reducing its use of the bus and 
increasing throughput. 


The processor fetches bytes rather than words in 
two cases. If a program transfer instruction (e.g., 
JMP 
or 
CALL) 
directs 
the 
processor 
to 
an 
instruction 
located at an odd address, 
the first 
byte of the instruction 
is fetched 
by itself as 
shown -infigure 3-13. This is because the program 
transfer invalidates the content of the queue by 
changing the serial flow of execution. 
, 


The second case arises when an LPDI instruction 
is located at an odd address. In this situation, the 
six-byte LPDI instruction 
is fetched: byte, word, 


byte, byte, byte, and the queue is not used. The 
first byte of the following instruction is fetched in 
one bus cycle as if it had been the target of a pro- 
gram transfer. 
Word fetching resumes with this 


instruction's second byte. 
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3 
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4 
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WHICH 
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The BIU runs all bus cycles, transferring 
instruc- 
tions and data 
between 
the lOP 
and external 
memory 
or 
peripherals. 
Every 
bus 
access 
is 
associated with a register tag bit that indicates to 
the BIU whether the system or I/O space is to be 
addressed. The BIU outputs the type of bus cycle 
(instruction fetch from I/O space,..Q..at~tore into 
system space, etc.) on status lines SO, SI, and S2. 
An 8288 Bus Controller 
decodes these lines and 
provides signals that selectively enable one bus or 
the other (see Chapter 4 for details). 


The 
BIU 
further 
distinguishes 
between 
the 
physical and logical widths of the system and I/O 
buses. The physical widths of the buses are fixed 
and 
are 
communicated 
to 
the 
BIU 
during 
initialization. 
In the local configuration, 
both 
buses must be the same width, either 8 or 16 bits 
(matching the width of the host CPU bus). In the 
remote configuration, 
the lOP system bus must 
be the same physical width as the bus it shares 
with the CPU. The width of the lOP's 
I/O bus, 
which 
is local 
to the 
8089, may 
be selected 
independently. 
If 
any 
16-bit 
peripherals 
are 
located in the I/O space, then a 16-bit I/O bus 
must be used. If only 8-bit devices reside on the 
I/O bus, then either an 8- or a 16-bit I/O bus may 
be selected. A 16-bit I/O bus has the advantage of 
easy accommodation 
of future 16-bit devices and 
fewer instruction fetches if channel programs are 
placed in the I/O space. 


For a given DMA transfer, 
a channel program 
specifies the logical width of the system and the 
I/O 
buses; 
each channel 
specifies logical bus 
widths independently. 
The logical width of an 
8-bit physical bus can only be eight bits. A 16-bit 
physical bus, however, can be used as either an 8- 
or 16-bit logical bus. This allows both 8- and 
16-bit devices to be accessed over a single 16-bit 
physical 
bus. 
Table 
3-1 lists 
the 
permissible 
physical and logical bus widths for both locally 
and remotely configured lOPs. Logical bus width 
pertains to DMA transfers only. Instructions are 
fetched and operands 
are read and written 
in 
bytes or words depending on physical bus width. 


In addition 
to performing 
transfers, 
the BIU is 
responsible for local bus arbitration. 
In the local 
configuration, 
the 
BIU 
uses 
the 
RQ/GT 
(request/grant) 
line to obtain the bus from the 
CPU and to return it after a transfer has been per- 
formed. 
In the remote configuration, 
the BIU 


uses RQ/GT 
to coordinate 
use of the local I/O 
bus with another lOP or a local CPU, if present. 
System bus arbitration 
in the remote configura- 
tion is performed 
by an 8289 Bus Arbiter 
that 
operates 
invisibly 
to 
the 
lOP. 
The 
BIU 
automatically 
asserts the LOCK (bus lock) signal 
during execution 
of a TSL (test and set lock) 
instruction 
and, if specified by the channel pro- 
gram, can assert the LOCK signal for the dura- 
tion of a DMA transfer. 
Section 3.5 contains a 
complete discussion of bus arbitration. 


Configuration 
System Bus 
110 Bus 
Physical:Logical 
Physical:Logical 


Local 
8:8 
8:8 
16:8/16 
16:8/16 


8:8 
8:8 


Remote 
16:8/16 
16:8/16 
16:8/16 
8:8 
8:8 
16:8/16 


Although 
the 8089 is a single processor, 
under 
most circumstances 
it is useful to think of it as 
two independent 
channels. 
A channel may per- 
form DMA transfers 
and may execute channel 
programs; 
it also 
may 
be idle. 
This 
section 
describes the hardware features that support these 
operations. 


Each channel contains its own I/O control section 
that governs the operation 
of the channel during 
DMA transfers. 
If the transfer 
is synchronized, 


the channel waits for a signal on its DRQ (DMA 
request) line before performing 
the next fetch- 
store sequence in the transfer. 
If the transfer is to 
be terminated 
by an external signal, the channel 
monitors 
its EXT (external terminate) 
line and 
stops the transfer 
when this line goes active. 


Between the fetch and store cycles (when the data 
is in the lOP) 
the channel 
optionally 
counts, 
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Figure 3-14 il1ustrates the channel register set, and 
table 3-2 summarizes 
the uses of each register. 
Each channel has an independent set of registers; 
they are not accessible to the other channel. Most 
of the registers play different roles during channel 
program execution than in DMA transfers. Chan- 
nel programs 
must 
be careful 
to 
save these 
registers in memory prior to a DMA transfer if 
their values are needed fol1owing the transfer. 


General Purpose A (GA). A channel program 
may use GA for a general 
register or a base 
register. A general register can be an operand of 
most lOP instructions; 
a base register is used to 
address 
memory 
operands 
(see 
section 
3.8). 
Before initiating 
a DMA transfer, 
the channel 
program 
points 
GA 
to 
either 
the 
source 
or 
destination address of the transfer. 


General Purpose B (GB). GB is functional1y 
interchangeable 
with GA. If GA points to the 


source of a DMA transfer, then GB points to the 
destination, and vice versa. 


~~nl:l 
GENERAL PURPOSE A 


GENERAL PURPOSE B 


GENERAL PURPOSE C 


TASK POINTER 


PARAMETER BLOCK POINTER 


INDEX 


BYTE COUNT 


MASK/COMPARE 


CHANNEL CONTROL 


General Purpose C (GC). GC may be used as a 
general register or a base register during channel 
program execution. If data is to be translated dur- 
ing a DMA transfer, 
then the channel program 
loads GC with the address of the first byte of a 
translation table before initiating the transfer. GC 
is not altered by a transfer operation. 


Task Pointer (TP). The CCU loads TP from the 
parameter block when it starts or resumes a chan- 
nel program. 
During 
program 
execution, 
the 
channel automatical1y updates TP to point to the 


Program 
System 
Register 
Size 
Access 
or I/O 
Use by Channel 
Programs 
Use in DMA Transfers 
Pointer 


GA 
20 
Update 
Either 
General, base 
. 


Source/destination 
pointer 


J 


GB 
20 
Update 
Either 
General, base 
Sourcejdestination 
pointer 


GC 
20 
Update 
Either 
General, base 
Translate table pointer 


TP 
20 
Update 
Either 
Procedure return, 
Adjusted to reflect cause of 
instruction 
pointer 
termination 


PP 
20 
Reference 
System 
Base 
N/A 
, 


IX 
16 
Update 
N/A 
General, auto-increment 
N/A 


BC 
16 
Update 
N/A 
General 
Byte counter 


MC 
16 
Update 
N/A 
General, masked compare 
Masked compare 


CC 
16 
Update 
N/A 
Restricted use recommended 
Defines transfer options 


an instruction 
pointer or program counter. Pro- 
gram transfer 
instructions 
(JMP, 
CALL, 
etc.) 
update TP to cause 'nonsequential 
execution. A 
procedure (subroutine) returns to the calling pro- 
gram by loading TP with an address previously 
saved by the CALL instruction. 
The task pointer 
is fully accessible to channel programs; it can be 
used as a general register or as a base register. 
Such use is not recommended, 
however, as it can 
make programs very difficult to understand. 


t 


Parameter 
Block 
Pointer 
(PP). The 
CCU 
loads 
this 
register 
with 
the 
address 
of 
the 
parameter 
block before it starts a channel pro- 
gram. The register cannot be altered by a channel 
program, but is very useful as a base register for 
accessing data in the parameter 
block. PP is not 
used during DMA transfers. 


Index (IX). IX may be used asa general register 
during channel program execution. It also may be 
used as an index register 
to address 
memory 
operands (the address of the operand is computed 
by adding the content of IX to the content of a 
base 
register). 
When 
specified 
as 
an' index 
register,IX 
may be optionally auto-incremented 
as the last step in the instruction to provide a con- 
venient means of "stepping" 
through arrays or 
strings. IX is not used in DMA transfers. 


Byte Count (BC). BC may be used as a general 
register 
during 
channel 
program 
execution. 
If 
DMA is to be terminated when a specific number 
of bytes has been transferred; 
BC should 
be 
loaded 
with 
the 
desired 
byte 
count 
before 
initiating 
the 
transfer. 
'During 
DMA, 
BC is 
decremented 
for each byte transferred, 
whether 
byte count termination 
has been selected or not. 


If BC reaches zero, the transfer is stopped only if 
byte count termination 
has been specified. If byte 
count 
termination 
has 
not 
been 
selected, 
BC 
"wraps 
around" 
from OH to FFFFH 
and con- 
tinues to be decremented. 


Mask/Compare (MC). A channel program may 
use MC for a general register. This register also 
may be used in either a channel program or in a 
DMA transfer to perform a masked compare of a 
byte value. To use MC in this way, the program 
loads a compare value in the low-order eight bits 
of the register and a mask value in the upper eight 
bits (see figure 3:15). k"I" in a mask bit selects 
the bit in the corresponding 
position in the c0!l1-, 


pare value; a "0" 
in a mask bit masks the cor- 
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3-15, a value compared 
with MC will be con- 
sidered equal if its low-order five bits contain the 
value 00100; the upper three bits may contain any 
value 
since 
they 
are 
masked 
out 
of 
the 
comparison. 
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Channel 
Control 
(CC). The 
content 
of 
the 
channel control register governs 'a DMA transfer 
(see figure 3-16). A channel program 
loads this 
register with appropriate 
values before beginning 
the transfer 
operation; 
section 
3.4 covers 
the 
encoding of each field in detail. Bit 8 (the chain 
bit) of CC pertains to channel program execution 
rather than to a DMA transfer. 
When this bit is 
zero, the channel program runs at normal prior- 
ity; when it is one, the priority of the program is 
raised to the same level as DMA (priorities are 
covered later in this section). Although a channel 
program may use CC as a general register, such 
use is not 
recommended 
because 
of the 
side 
effects on the chain bit and thus on the priority of 
the channel program. 
Channel programs 
should 
restrict their use of CC to loading control values 
in preparation 
for a DMA transfer, 
setting and 
clearing the chain bit, and storing the register. 


Each channel maintains 
its own program 
status 
word (PSW) as shown in figure 3-17. Channel 
programs 
do not have access to thePSW. 
The 
PSW records the state of the the channel so that 
channel operation 
may be suspended 
and then 
resumed later. When the CPU issues a "suspend" 
command, 
the 
channel 
saves the 
PSW, 
task 
pointer, and task pointer tag bit in the first four 
bytes of the channel's 
parameter 
block as shown 
in figure 3-18. Upon 
receipt 
of a subsequent 


TT--C 


TERMINATE 
ON BYTE 
COUNT 


TERMINATE 
ON EXTERNAL 
SIGNAL 


TERMINATE 
AFTER 
SINGLE 
TRANSFER 


CHAINED 
CHANNEL 
PROGRAM 
EXECUTION 
LOCK 
BUS DURING 
TRANSFER 


SOURCE/DESTINATION 


SYNCHRONIZATIQN 


TRANSLATE 


FUNCTION 
(PORT 
TO 
PORT, 
PORT 
TO MEMORY, 
ETC.) 


"resume" 
command, 
the PSW, TP, and TP tag 


bit are restored 
from the parameter 
block save 


area and execution resumes. 


Two 
conditions 
override 
the 
normal 
channel 


priority mechanism. If one channel is performing 
DMA (priority 1) and the channel receives a chan- 
nel attention (priority 2), the channel attention is 
serviced at the end of the current DMA transfer 
cycle. 
This 
override 
prevents 
a 
synchronized 


DMA transfers 
from "shutting 
out" 
a channel 


attention. 
DMA terminations 
and chained chan- 


nel programs 
postpone 
recognition 
of a CA on 


the other channel; 
the CA is latched, 
however, 
and is serviced as soon as priorities permit. 


The 
lOP's 
LOCK 
(bus 
lock) 
signal 
also 


supersedes channel switching. A running channel 
will not relinquish control of the processor while 
LOCK is active, regardless of the priorities of the 
activities on the two channels. This is consistent 
with 
the 
purpose 
of 
the 
LOCK 
signal: 
to 


guarantee exclusive access to a shared resource in 
a multiprocessing 
system. Refer to sections 3.5 


and 3.7 for futher 
information 
on the LOCK 


signal and the TSL instruction. 


Registers GA, GB, GC, and TP are called pointer 
registers because they may be used to access, or 
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point to, addresses in either the system space or 
the I/O space. The pointer registers may address 
either memory or I/O devices (lOP instructions 
do not distinguish 
between 
memory 
and 
I/O 
devices since the latter are memory-mapped). 
The 


tag bit associated with each register (figure 3-14) 
determines 
whether 
the 
register 
points 
to an 


address in the system space (tag=O) or the I/O 
space (tag= I). 


The CCU sets or clears TP's tag bit depending on 
whether the command it receives from the CPU is 
"start 
channel 
program 
in system space," 
or 


"start 
channel program in I/O space." 
Channel 
programs alter the tag bits of GA, GB, GC, and 
TP by using different instructions for loading the 
registers. 
Briefly, a "load 
pointer" 
instruction 


clears a tag bit, a "move" 
instruction 
sets a tag 
bit, and a "move 
pointer" 
instruction 
moves a 
memory value (either 0 or I) to a tag bit. Section 
3.9 covers these instructions in detail. 


If a register points to the system space, all 20 bits 
are placed on the address lines to allow the full 
megabyte to be directly addressed. 
If a register 


points to the I/O space, the upper four bits of the 
address lines are undefined; the lower 16 bits are 
sufficient to access any location in the 64k byte 
I/O space. 


Both channels may be active concurrently, 
but 


only one can actually run at a time. At the end of 


each internal cycle, the CCU lets one channel or 
the other execute the next internal cycle. No extra 
overhead is incurred 
by this channel switching. 


The basis for 
making 
the determination 
is a 


priority 
mechanism 
built 
into 
the 
lOP. 
This 


mechanism 
recognizes 
that 
some 
kinds 
of 


activities (e.g., DMA) are more important 
than 


others. Each activity that a channel can perform 
has a priority that reflects its relative importance 
(see table 3-3). 


Two new activities are introduced 
in table 3-3. 


When a DMA transfer 
terminates, 
the channel 


executes a short internal channel program. 
This 


DMA termination program adjusts TP so that the 
user's 
program 
resumes 
at 
the 
instruction 


specified when the transfer 
was setup (this is 


discussed in detail in section 3.4). Similarly, when 
a channel 
attention 
is recognized, 
the channel 
executes an internal program 
that examines the 


CCW and carries out its command. Both of these 
programs 
consist of standard 
8089 instructions 


that 
are 
fetched 
from 
internal 
ROM. 
Intel 


Application 
Note AP-50, Debugging 
Strategies 
and Considerations 
for 8089 Systems, 
lists the 


instructions in these programs. Users monitoring 
the bus during debugging may see operands read 
or written by the termination or channel attention 
programs. The instructions themselves, however, 
wlll not appear on the bus as they are resident in 
the chip. 


Notice also that, according to table 3-3, a channel 
program may run at priority 3 or at priority 
I. 


Channel 
Activity 
Priority 
Interleave 
Boundary 
(1 = highest) 
By DMA 
By Instruction 


DMA transfer 
1 
Bus cycle' 
Bus cycle' 


DMA termination sequence 
1 
Internal cycle 
None 


Channel program (chained) 
1 
Internal cycle2 
Instruction 
" 


Channel attention sequence 
2 
Internal cycle 
None 


Channel program (not chained) 
3 
Internal cycle2 
Instruction 


Idle 
4 
Twoclocks 
Two clocks 


'DMA 
is not interleaved while LOCKis active. 


2Except TSL instruction; see section 3.7. 


Channel program 
priority 
is determined 
by the 
chain bit in the channel control register. If this bit 
is cleared, the program 
runs at normal priority 
(3); if it is set, the program is said to be chained, 
and it runs at the same priority as DMA. Thus, 
the chain bit provides a way to raise the priority 
of a critical channel program. 


The CCU lets the channel with the highest priority 
run. If both channels are running activities with 
the same priority, the CCU examines the priority 
bits in the PSWs. If the priority bits are unequal, 
the channel with the higher value (1) runs. Thus, 
the priority bit serves as a "tie breaker" 
when the 
channels are otherwise at the same priority level. 
The value of the priority bit in the PSW is loaded 
from a corresponding 
bit in the CCW; therefore, 
the CPU can control which channel will run when 
the channels are at the same priority level. The 
priority 
bit 
has 
no 
effect 
when 
the 
channel 
priorities are different. If both channels are at the 
same priority level and if both priority bits are 
equal, the channels run alternately 
without any 
additional overhead. 


The CCU switches channels only at certain points 
called interleave boundaries; these vary according 
to the type of activity running in each channel and 
are shown in table 3-3. In table 3-3 and in the 
following discussion, the terms "channel A" and 
"channel B" are used to identify two active chan- 
nels that 
are bidding 
for control 
of an lOP. 
"Channel A" is the channel that last ran and will 
run again unless the CCU switches to "channel 
B." Where the CCU switches from one channel 
(channel A) to another 
(channel B) depends on 
whether 
channel 
B is performing 
DMA 
or is 
executing 
instructions. 
For 
this determination, 
instructions 
in the internal ROM are considered 
the same as instructions 
executed in user-written 
channel programs (chained or not chained). Table 
3-3 shows that a switch from channel A to chan- 
nel B will occur sooner if channel B is running 
DMA. DMA, then, interleaves instruction execu- 
tion at internal cycle boundaries. 
Since instruc- 
tions 
are 
often 
composed 
of 
several 
internal 
cycles, instruction execution on channel A can be 
suspended by DMA on channel B (when channel 
A next runs, the instruction 
is resumed from the 
point 
of suspension). 
DMA 
on channel 
A is 
interleaved by DMA on channel B after any bus 
cycle (when channel 
A runs again, 
the DMA 
transfer 
sequence is resumed from the point of 
suspension). 
If both channels are executing pro- 
grams, the interleave boundaries are extended to 


instruction boundaries: 
a program on channel B 
will not run until channel A reaches the end of an 
instruction. 
Note 
that 
a 
DMA 
termination 
sequence or channel attention sequence on chan- 
nel A cannot 
be interleaved 
by instructions 
on 
channel 
B, regardless 
of channel 
B's priority. 


These internal programs are short, however, and 
will not delay channel B for long (see Chapter 4 
for timing information). 


Table 
3-4 summarizes 
the 
channel 
switching 
mechanism with several examples. It is important 
to remember that channel switching occurs only 
when both channels are ready to run. In typical 
applications, 
one of the channels 
will be idle 
much of the time, either because it is waiting to be 
dispatched by the CPU or because it is waiting for 
a DMA request in a synchronized transfer. 
(Dur- 
ing a synchronized 
transfer, 
the channel is idle 
between bMA requests; for many peripherals, the 
channel will spend much more time idling than 
executing DMA cycles.) The real potential for one 
channel "shutting out" a priority 1activity on the 
other channel is largely limited to unsynchronized 
DMA transfers 
and locked transfers 
(synchro- 
nized or unsynchronized). 
Long, chained channel 
programs and high-speed synchronized DMA will 
slow a priority 1activity on the other channel, but 
will not shut it out because the channels will alter- 
nate (assuming their priority 
bits are equal). A 
chained channel program will shut out any lower 
priority activity on the other channel, including a 
channel 
attention. 
(The 
channel 
attention 
is 
latched by the lOP, 
however, so it will execute 
when the other channel drops to a lower priority.) 
Chained 
channel programs 
should therefore 
be 
used with discretion and should be made as short 
as possible. 


The 8089 can access memory components located 
in two different address spaces. The system space, 
which coincides with the CPU's 
memory space, 
may contain 
up to 
1,048,576 bytes. 
The 
I/O 
space, which may either coincide with the CPU's 
I/O space or be local (private) to the lOP, may 
contain up to 65,536 bytes. Memory components 
in the system space should respond to the memory 
read and write commands issued by the 8288 Bus 
Controller. Memory components in the I/O space 
must respond to 8288 I/O read and write com- 
mands. 
Memory 
in 
either 
space 
may 
be 


Channel 
A (Ran Last) 
ChannelB 
Result 


Activity 
Chain 
Priority 
LOCK 
Activity 
Chain 
Priority 
Bit 
Bit 
Bit 
Bit 


DMA transfer 
X 
X 
Inactive 
Idle 
X 
X 
A r~ns. 


DMA transfer 
X 
X 
Inactive 
Channel 
attention 
X 
X 
A runs until end of current 
transfer 
cycle; 
then S runs. 


Channel 
program 
X 
0 
Inactive 
Channel 
program 
X 
1 
S runs. 


Channel 
program 
X 
0 
Inactive 
Channel 
program 
X 
0 
A and S alternate 
by 


instruction. 
Channel 
program 
1 
X 
Inactive 
Channel 
program 
0 
X 
A runs. 


DMA transfer 
X 
1 
Inactive 
Channel 
program 
1 
1 
·S 
runs 
one 
bus 
or 
internal 


cycle following 
each bus cycle 


run by A.' 
Channel 
attention 
X 
X 
Inactive 
Channel 
program 
1 
X 
A runs if it has started 
the 


sequence; 
otherwise 
S runs. 


DMA transfer 
X 
X 
Active 
Channel 
attention 
X 
X 
A runs until DMA terminates. 


Channel 
program 
0 
X 
Active 
DMA transfer 
X 
X 
A completes 
TSL instruction, 


(TSL instruction) 
LOCK goes inactive 
and S 
runs. 


implemented like 8086 memory (l6-bit words split 
into even- and odd-addressed 
8-bit banks) or 8088 
memory (a single 8-bit bank). See Chapter 4 for 
physical implementation 
considerations. 


From 
a 
software 
point 
of 
view, 
both 
8089 
memory 
spaces are organized 
as unsegmented 
arrays 
of 
individually 
addressable 
8-bit 
bytes 


(figure 3-19). Instructions and data may be stored 
at any 
address 
without 
regard 
for 
alignment 


(figure 3-20). 


The lOP views the system space differently from 
the 8086 or 8088 with which it typically shares the 
space. The 8086 and 8088 differentiate 
between a 
location's 
logical (segment and offset) 
address 
and its physical (20-bit) address. 


The 8089 does not "see" 
the logically segmented 


structure of the memory space; it uses its 20-bit 
pointer 
registers 
to access all locations 
in the 
system space by their physical addresses. Memory 
in the 8089 I/O space is treated similarly except 
that 
only 
16 bits are 
needed 
to address 
any 
location. 
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Following Intel convention, 
word data is stored 
with 
the 
most-significant 
byte 
in 
the 
higher 
address (see figure 3-21). The 8089 recognizes the 
doubleword pointer variable used by the 8086 and 
8088 (figure 3-22). The lower-addressed 
word of 
the pointer 
contains 
an offset 
value, 
and the 
higher-addressed 
word contains 
a segment ba.se 
address. Each word is stored conventionally, 
with 
the higher-addressed 
byte containing 
the most- 
significant eight bits of the word. The 8089 can 
convert 
a 
doubleword 
pointer 
into 
a 
20-bit 
physical address when it is loaded into a pointer 
register to address system memory. A special 3- 
byte variable, 
called a physical address pointer 
(figure 3-23), is used to save and restore pointer 
registers and their associated·tag bits. 


Dedicated 
and Reserved 
Memory 
locations 


The extreme low and high addresses of the system 
space are dedicated 
to specific processor 
func- 
tions or are reserved for use by other Intel hard- 


ware and software products; the locations are OH 
through 7FH (128 bytes) and FFFFOH through 
FFFFFH (16 bytes), as shown in figure 3-24. The 
low addresses are used for part of the 8086/8088 
interrupt 
pointer 
table. 
Locations 
FFFFOH- 
FFFFBH are used for 8086, 8088 and 8089 startup 
sequences; the remaining 
locations 
are reserved 
by Intel. 


If an lOP is configured locally, its 1/0 space coin- 
cides with the CPU's 
I/O 
space, 
and it must 
respect the reserved 
addresses 
F8H-FFH. 
The 
entire I/O 
space of a remotely-configured 
lOP 
may be used without restriction. 


Using any dedicated 
or reserved addresses may 
inhibit the compatibility of a system with current 
or future Intel hardware and software products. 


The 8089 is very well-suited to environments 
in 
which programs 
do not occupy static memory 
locations, but' are moved about during execution. 
Dynamic code relocation allows systems to make 
efficient 
use of limited 
memory 
resources 
by 
transferring 
programs 
between external 
storage 
and memory, 
and by combining 
scattered 
free 
areas of memory into larger, more useful, con- 
tinuous spaces. 
. 


lOP channel programs 
are inherently 
posItlon- 
independent, 
the only restriction being that chan- 
nel programs 
that 
transfer 
to each 
other 
or 
share data must be moved as a unit. Since the lOP 
, 
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receives the address of a channel program and its 
associated parameter 
block when it is dispatched 
by the CPU, 
the location 
of these blocks 
is 
immaterial and can change from one dispatch to 
the next. (Note, however, that the channel control 
block cannot be moved without reinitializing the 
lOP.) Typically, then, the CPU would direct the 
movement 
of 
lOP 
channel 
programs 
and 
parameter 
blocks. These blocks, of course, can- 
not be moved while they are in use. 


While the CPU may be in charge of relocation, 
the lOP is an excellent vehicle for performing the 
actual transfer 
of channel programs, 
parameter 
blocks, and CPU programs as well. A very simple 
channel 
program 
can 
transfer 
code 
between 
memory locations by DMA much faster than the 
equivalent 
CPU 
instructions, 
and 
transfers 
between disk and memory also can be performed 
more efficiently. 


Memory accesses are always performed 
using a 
pointer register and its associated tag bit. The tag 
bit indicates whether the access is to the system 
space (tag=O) or the I/O 
space (tag=I). 
The 
pointer register contains the base address of the 
location; i.e., the pointer register is used as a base 
register. Only the low-order 16 bits of the pointer 


register are used for I/O space locations; 
all 20 


bits are used for system space addresses. Different 
types of memory accesses use base registers as 
shown in table 3-5. The 8089 addressing modes 
allow the base address of a memory operand to be 
modified by other registers and constant values to 
yield the effective address of the operand (see sec- 
tion 3.8). 


Notice 
that 
table 
3-5 indicates 
that 
memory 


operands may be addressed using register PP in 
addition to GA, GB, and Gc. PP is maintained 
by the lOP and can neither be read nor written by 
a channel program; 
it can be used, however, to 


access data in the parameter 
block. PP has no 


associated tag bit; a reference to it implies the 
system space, where a parameter 
block always 


resides. 


Memory Access 
Base Register 


Instruction 
Fetch 
TP 
DMA Source 
GAorGB' 
DMA Destination 
GA orGB' 
DMA Translate Table 
GC 
Memory Operand 
GA or GB or GC or PP' 


'As specified in CC register 
'As specified in instruction 


The lOP is told the physical widths of the system 
and I/O buses when it is initialized. If a bus is 
eight bits wide, the lOP accesses memory on this 
bus lik~ an 8088. Instruction 
fetches and operand 
reads and writes are performed one byte at a time; 
one bus cycle is run for each memory 
access. 
Word operands are accessed in two cycles, com- 
pletely 
transparent 
to 
software. 
Instruction 
fetches are made as needed, and the instruction 
stream is not queued. 


The lOP accesses memory on a 16-bit bus like an 
8086. As mentioned 
in the previous section, the 
instruction 
stream is generally fetched in words 
from even addresses with the second byte held in 
the one-byte queue. If a word operand is aligned 
(i.e., located at an even address), 
the 8089 will 
access it in a single 16-bit bus cycle. If a word 
operand 
is unaligned 
(i.e., 
located 
at an odd 
address), the word will be accessed in two con- 
secutive 
8-bit 
bus 
cycles. 
Byte 
operands 
are 
always accessed in 8-bit bus cycles. 


For memory 
on 
16-bit buses, 
performance 
is 
improved and bus contention 
is reduced if word 
operands 
are 
stored 
at 
even 
addresses. 
The 
instruction 
queue tends to reduce the effect of 
alignment on instructions fetched on a 16-bit bus, 
In tight loops, performance 
can be increased by 
word-aligning transfer targets. 


Notice that the correct operation of a program is 
completely independent of memory bus width. A 
channel program written for one system that uses 
an 
8-bit 
memory 
bus 
will 
execute 
without 
modification if the bus is increased to 16 bits. It is 
good practice, though, 
to write all programs 
as 
though they are to run on 16-bit systems; i.e., to 
align word operands. 
Such programs 
will then 
make optimal use of the bus in whatever system 
they are run. 


The 
8089 
combines 
the 
programmed 
I/O 
capabilities of a CPU with the high-speed block 
transfer facility of a DMA controller. It also pro- 
vides 
additional 
features 
(e.g., 
compare 
and 
translate during DMA) and is more flexible than a 
typical 
CPU 
or 
DMA 
controller. 
The 
8089 
transfers data from a source address to a destina- 
tion address. 
Whether 
the component 
mapped 


into a given address is actually memory or I/O is 
immaterial. All addresses in both the system and 
I/O spaces are equally accessible, and transfers 
may be made between the two spaces as well as 
within either address space. 


A channel program performs 
I/O similar to the 
way a CPU communicates 
with memory-mapped 
I/O devices. Memory reference instructions 
per- 
form the transfer 
rather 
than "dedicated" 
I/O 
instructions, 
such as the 8086/8088 IN and OUT 
instructions. Programmed 
I/O is typically used to 
prepare a device controller 
for a DMA transfer 
and to obtain status/result 
information 
from the 
controller 
following termination 
of the transfer. 


It may be used, however, with any device whose 
transfer rate does not require DMA. 


Since the 
8089 does 
not 
distinguish 
between 
memory 
components 
and 
I/O 
devices, 
any 
instruction 
that accepts a byte or word memory 
operand 
can be used to access an I/O 
device. 
Most memory reference instructions take a source 
operand or a destination 
operand, 
or both. The 
instructions generally obtain data from the source 
operand, operate on the data, and then place the 
result of the operation in the destination operand. 
Therefore, 
when a source operand 
refers to an 
address where an I/O device is located, 
data is 
input from the device. Similarly, when a destina- 
tion operand refers to an I/O device address, data 
is output to the device. 


Most I/O 
device controllers 
have one or more 
internal 
registers 
that 
accept 
commands 
and 
supply status or result information. 
Working with 
these registers typically involves: 


• reading or writing the entire register; 
• setting or clearing some bits in a register while 
leaving others alone; or 
• testing a single bit in a register. 


Table 3-6 shows some of the 8089 instructions 
that are useful 
for performing 
these kinds of 
operations. 
Secti0n 3.7 covers the 8089 instruc- 
tion set in detail. 


Table 3-6. Memory Reference Instructions 
Used for I/O 


Instruction 
Effect on I/O Device 


MOV/MOVB Read or write word /byte 


AND/ANDB Clear multiple bits in word/ byte 


OR/ORB 
Set multiple bits in word /byte 


CLR 
Clear' single bit (in byte) 


SET 
Set single bit (in byte) 


JBT 
Read (byte) and jump if 
single bit =1 


Read'(byte) and jump if 
, 


JNBT 
single bit =0 


Since memory reference instructions 
are used to 
perform 
programmed 
I/O, 
device addressing. is 
very similar to memory addressing. 
An operand 
that refers to an I/O device always specifies one 
of the pointer registers GA, GB, or GC (PP is 
legal, but an I/O device would not normally be 
mapped 
into 
a 
parameter 
block). 
The 
base 
address of the device is taken from the specified 
pointer register. Any of the memory addressing 
modes (see section 3.8) may be used to modify the 
base address 
to produce 
the effective 
(actual) 
address of the device. The pointer register's tag 
bit locates the device in the system space (tag=O) 
or in the I/O space (tag=l). 
If the device is in 
the I/O space, only the low-order 
16 bits of the 
pointer register are used for the base address; all 
20 bits are used for a system space address. The 
lOP's system and I/O spaces are fully compatible 


with the corresponding 
address 
spaces of the 
other 8086 family processors. 


Table 3-7 shows the number of bus cycles the lOP 
runs for all combinations of bus size, transfer size 
(byte or word), 
and transfer 
address 
(even or 
odd). 
Bus 
width 
refers 
to 
the 
physical 
bus 
implementation; 
the instruction mnemonic deter- 
mines whether a byte or a word is transferred. 


Both 8- and l6-bit devices may reside on a l6-bit 
bus. All l6-bit devices should be located at even 
addresses so that transfers will be performed 
in 
one bus cycle. The 8-bit devices on a l6-bit bus 
may be located at odd or even addresses. 
The 
internal registers in an 8-bit device on a l6-bit bus 
must be assigned all-odd 
or all-even addresses 
that are two bytes apart (e.g., IH, 3H, 5H, or 2H, 
4H, 6H). All 8-bit peripherals 
should be refer- 
enced with byte instructions, 
and 16-bit devices 
should 
be referenced 
with 
word 
instructions. 
Odd-addressed 
8-bit 
devices 
must 
be able 
to 
transfer data on the upper eight bits of the l6-bit 
physical data bus. 


Only 8-bit devices should be connected to an 8-bit 
bus, and these should only be referenced 
with 
byte instructions. 
An 8-bit device on an 8-bit bus 
may be located at an odd or even address, and its 
internal 
registers 
may be assigned 
consecutive 
addresses (e.g., lH, 2H, 3H). Assigning all-odd 
or all-even addresses, however, will simplify con- 
version to a l6-bit bus at a later date. 


Bus Width: 
8 
16 


Instruction: 
byte 
word" 
byte 
word 


Device Address: 
even 
odd 
even 
odd 
even 
odd 
even 
odd" 


Bus Cycles: 
1 
1 
2 
2 
1 
1 
1 
2 


In addition 
to 
byte- 
and 
word-oriented 
pro- 
grammed 
I/O, 
the 8089 can transfer 
blocks of 
data by direct memory access. A block may be 
transferred 
between any two addresses; memory- 
to-memory 
transfers 
are performed 
as easily as 
memory-to-port, 
port-to-memory 
or port-to-port 
exchanges. There is no limitation 
on the size of 
the block that can be transferred 
except that the 
block cannot exceed 64k bytes if byte count ter- 
mination 
is used. A channel 
program 
typical1y 
prepares 
for a DMA transfer 
by writing com- 
mands to a device control1er and initializing chan- 
nel registers that are used during the transfer. No 
instructions 
are 
executed 
during 
the 
transfer, 
however, and very high throughput 
speeds can be 
achieved. 


Most control1ers that can peform DMA transfers 
are quite flexible in that they can perform several 
different 
types of operations. 
For example, an 
8271 Floppy Disk Control1er can read a sector, 
write a sector, seek to track 0, etc. The control1er 
typical1y has one or more internal registers that 
are "programmed" 
to perform a given operation. 
Often, 
certain 
registers 
will 
contain 
status 
information 
that can be read to determine if the 
control1er is busy, if it has detected an error, etc. 


An 8089 channel 
program 
views these device 
registers as a series of memory locations. 
The 
channel program typical1y places the device's base 
address in a pointer register and uses programmed 
I/O to communicate with the registers., 


Some 
control1ers 
start 
a 
DMA 
transfer 
immediately upon receiving the last of a series of 


parameters'. 
If this type of controller 
is being 
used, the channel program instruction 
that sends 
the last parameter 
should follow the 8089 XFER 
instruction. 
(The XFER 
instruction 
places the 
channel in DMA mode after the next instruction; 
this is explained 
in more 
detail 
later 
in this 
section.) 


For a channel to perform a DMA transfer, it must 
be provided with information 
that describes the 
operation. 
The channel 
program 
provides 
this 
information 
by 
loading 
values 
into 
channel 
registers and, in one case, by executing a special 
instruction (see table 3-8). 


Source 
and 
Destination 
Pointers. 
One 
register is loaded to point to the transfer source; 
the other points to the destination. 
A bit in the 
channel control register is set to indicate which 
register 
is the source 
pointer. 
If a register 
is 
pointed at a memory location, 
it should contain 
the address where the transfer is to begin - 
i.e., 
the lowest address 
in the buffer. 
The channel 
automatically increments a memory pointer as the 
transfer proceeds. 
If the tag bit selects the I/O 
space, the upper 
four 
bits of the register are 
ignored; if the tag selects the system space, all 20 
bits are used. The source and destination 
may be 
located in the same or in different address spaces. 


Translate Table Pointer. 
If the data is to be 
translated 
as it is transferred, 
GC should 
be 
pointed at the first (lowest-addressed) 
byte in a 
256-byte translation 
table. 
The 
table 
may be 
looated in either the system or I/O space, and GC 


Source Pointer 
Destination Pointer 
Translate Table Pointer 
Byte Count 
Mask/Compare Values 
Logical Bus Width 
Channel Control 


GAorGB 
GAorGB 
GC 
BC 
MC 
WID 
CC 


Required 
Required 
Optional 
Optional 
Optional 
Optional" 
Required 
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operation 
is only defined for byte data; source 
bytes transferred to the destination. 


and destination 
logical bus widths must both be 


set to eight bits. 


The channel translates a byte by treating it as an 
unsigned 
8-bit binary 
number. 
This number 
is 
added to the content 
of register GC to form a 
memory address; GC is not altered by the opera- 
tion. If GC points to the I/O space, its upper four 
bits are ignored in the operation. The byte at this 
address (which is in the translate 
table) is then 


fetched from memory, replacing the source byte. 
Figure 3-25 illustrates the translate process. 


Byte Count. 
If the transfer is to be terminated 
on byte count- 
i.e., after a specific number of 
bytes have been transferred-the 
desired count 


should be loaded into~register BC as an unsigned 
16-bit number. The channel decrements BC as the 
transfer proceeds, whether or not byte count ter- 
mination 
has 
been 
specified. 
There 
are cases 
(discussed later in this section) where the dif- 


Mask/Compare Values. 
If the transfer is to be 


terminated 
when a byte (possibly translated) 
is 


found equal or unequal 
to a search value, MC 


should be loaded as described in section 3.2. MC 
is not altered during the transfer. 
Normally, 
the 


logical destination 
bus width is set to eight bits 


when transferred 
data is being compared. 
If the 


logical destination width is 16 bits, only the low- 
order byte of each word is compared. 


Logical Bus Width. The 8089 WID (logical bus 
width) instruction is used to set the logical width 
of the source and destination 
buses for a DMA 


transfer. 
Any bus whose physical width is eight 


bits can only have a logical width of eight bits. A 
16-bit physical bus, however, can have a logical 
width of 8 or 16 bits; Le., it can be used as either 
an 8-bit or 
16-bit bus in any given transfer. 


Logical bus widths are set independently for each 
channel. 


00200 
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be set equal to the peripheral's 
width; i.e., 8 or 16 
bits. Transfers to or from 16-bit memory will run 
at maximum speed if the logical bus width is set to 
16 since the channel will fetch/store 
words. In the 
following 
cases, 
however, 
the 
logical 
width 
should be set to 8: 
• 
the data is being translated, 
• 
the data is being compared under mask, and 
the 16-bit memory is the destination 
of the 
transfer. 


The WID instruction sets both logical widths and 
remains in effect until another WID instruction is 
executed. Following processor reset, the settings 
of 
the 
logical 
bus 
widths 
are 
unpredictabfe. 
Therefore, the WID instruction must be executed. 
before the first DMA transfer. 


Channel Control. The 16 bits of the CC register 
are divided into 10 fields that specify how the 
DMA transfer is to be executed (see figure 3-26). 
A channel program typically sets these fields by 
loading a word into the register. 


The 
function 
field 
(bits 
15-14) identifies 
the 
source and destination 
as memory or ports (I/O 
• 


devices). 
During 
the 
transfer, 
the 
channel 
increments 
source/ destination 
pointer 
registers 
that refer to memory 
so that the data will be 
placed in successive locations. Pointers that refer 
to I/O 
devices remain constant 
throughout 
the 
transfer. 


The translate field (bit 13) controls data transla- 
tion. If it is set, each incoming byte is translated 
using 
the 
table 
pointed 
to 
by 
register 
Gc. 


Translate 
is defined only for byte transfers; 
the 
destination bus must have a logical width of eight. 


The synchronization 
field 
(bits 12-11) specifies 
how 
the 
transfer 
is 
to 
be 
synchronized. 
Unsynchronized 
("free 
running") 
transfers 
are 
typically used in memory-to-memory 
moves. The 
channel begins the next transfer cycle immediately 
upon completion of the current cycle (assuming it 
has the bus). Slow memories, which cannot run as 
fast as the channel, 
can extend bus cycles by 
signaling 
"not 
ready" 
to 
the 
8284 
Clock 
Generator, 
which will insert wait states into the 
bus cycle. A similar technique may be used with 
peripherals 
whose speed exceeds the channel's 
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through the use of wait states. Chapter 4 discusses 
synchronization in more detail. 


Source synchronization 
is typically selected when 
the source is an I/O device and the destination is 
memory. The I/O device starts the next transfer 
cycle by activating 
the channel's 
DRQ (DMA 
request) line. The channel then runs one transfer 
cycle and waits for the next DRQ. 


Destination 
synchronization 
is most often used 
when the source is memory and the destination is 
an I/O device. Again, the I/O device controls the 
transfer frequency by signaling on DRQ when it is 
ready to rece,ivethe next byte or word. 


The source field (bit 10) identifies register GA or 
GB as the source pointer (and the other as the 
destin<;ltionpointer). 


The lock field (bit 9) may be used to instruct the 
channel to assert the processor's bus lock (LOCK) 
signal 
during 
the 
transfer. 
In 
a 
source- 
synchronized 
transfer, 
LOCK is active from the 
time the first DMA request is received until the 
channel 
enters the termination 
sequence. 
In a 
destination-synchronized 
transfer LOCK is active 
from 
the first fetch 
(which precedes 
the first 
DMA request) until the channel enters the ter- 
.mination sequence. 


The chain field 
(bit 8) is not used during 
the 
transfer. 
As discussed 
previously, 
setting 
this 
bit raises channel program execution to priority 
level 1. 


Th.e terminate on single transfer field (bit 7) can 
be used to cause the channel to run one complete 
transfer cycle only-i.e., 
to transfer one byte or 
word and immediately resume channel program 
execution'. When single transfer is specified, any 
other termination 
conditions 
are ignored. Single 
transfer terminatIon 
can be used with low-speed 
devices, such as keyboards 
and communication 
lines, to translate and/or 
compare one byte as it 
transferred, 
. 


The three low-order fields in register CC instruct 
the 
channel 
when 
to 
terminate 
the 
transfer, 
assuming 
that 
single 
transfer 
has 
not 
been 
selected. Three termination 
conditions 
may be 
specified singly or in combination. 


£. 
FUNCTION 
00 
PORT TO PORT 
01 
MEMORY TO PORT 
10 
PORT TO MEMORY 
11 
MEMORYTO 
MEMORY 


TFt 
TRANSLATE 
o 
NO TRANSLATE 
1 
TRANSLATE 


SYN 
SYNCHRONIZATION 
00 
NO SYNCHRONIZATION 
01 
SYNCHRONIZE 
ON SOURCE 
10 
SYNCHRONIZE 
ON DESTINATION 
11 
RESERVED BY INTEL 


S 
SOURCE 
o 
GA POINTS TO SOURCE 
1 
GB POINTS TO SOURCE 


L 
LOCK 
o 
NO LOCK 
1 
ACTUATE LOCK DURING TRANSFER 


C 
CHAIN 
o 
NO CHAINING 
1 
CHAINED: 
RAISE TB TO PRIORITY 1 


TS 
TERMINATE 
ON SINGLE TRANSFER 
o 
NO·SINGLE TRANSFER TERMINATION 
1 
TERMINATE 
AFTER SINGLE TRANSFER 


TX 
TERMINATE 
ON EXTERNAL SIGNAL 
00 
NO EXTERNAL TERMINATION 
01 
TERMINATE 
ON EXT ACTIVE; OFFSET = 0 
10 
TERMINATE 
ON EXT ACTIVE; OFFSET = 4 
11 
TERMINATE 
ON EXT ACTIVE; OFFSET = 8 


TBC 
TERMINATE 
ON BYTE COUNT 
00 
NO BYTE COUNT TERMINATION 
01 
TERMINATE 
ON BC = 0; OFFSET = 0 
10 
TERMINATE 
ON BC = 0; OFFSET = 4 
11 
TERMINATE 
ON BC = 0; OFFSET = 8 


TMC 
TERMINATE 
ON MASKED COMPARE 
000 
NO MASK/COMPARE 
TERMINATION 
001 
TERMINATE 
ON MATCH; OFFSET = 0 
010 
TERMINATE 
ON MATCH; OFFSET = 4 
011 
TERMINATE 
ON MATCH; OFFSET = 8 
100 
(NO EFFECT) 
101 
TERMINATE 
ON NON-MATCH; 
OFFSET = 0 
110 
TERMINATE 
ON NON-MATCH; 
OFFSET = 4 
111 
TERMINATE 
ON NON-MATCH; 
OFFSET = 8 


External 
termination 
allows 
an 
I/O 
device 
(typically, 
the 
one 
that 
is synchronizing 
the 
transfer) 
to stop the transfer 
by activating 
the 
channel's 
EXT (external terminate) line. If byte 
count termination 
is selected, 
the channel 
will 
stop when BC=O. If masked compare termination 
is specified, 
the channel 
will stop the transfer 
when a byte is found that is equal or unequal (two 
options are available) to the low-order byte in MC 
as masked by MC's high-order byte. The byte that 
stops the termination 
is transferred. 
If translate 
has 
been 
specified, 
the 
translated 
byte 
is 
compared. 


When a DMA transfer ends, the channel adds a 
value called the termination 
offset to the task 
pointer and resumes channel program execution 
at that point in the program. The termination off- 
set may assume a value of 0, 4, or 8. Single 
transfer termination 
always results in a termina- 
tion offset of O. Figure 3-27 shows how the ter- 
mination 
offsets can be used as indices into a 
three-element 
"jump 
table" 
that identifies 
the 
condition that caused the termination. 


As an example of using the jump table, consider a 
case in which a transfer is to terminate when 80 
bytes 
have 
been 
transferred 
or 
a 
!inefeed 
character is detected, whichever occurs first. The 
program 
would load 80H into BC and OOOAH 
into MC (ASCII line feed, no bits masked). The 
channel program could assign byte count termina- 
tion an offset of 0 and masked compare termina- 
tion an offset of 4. If the transfer is terminated by 
byte count (no line feed is found), the instruction 
at location TP + 0 will be executed first after the 
termination. 
If the linefeed is found before the 
byte count expires, the instruction 
at TP + 4 will 
be executed first. The LJMP (long unconditional 
jump, 
see section 3.7) instruction 
is four bytes 
long and can be placed at TP + 0 and TP + 4 to 
cause the channel program to jump to a different 
routine, 
depending 
on 
how 
the 
transfer 
terminates. 


If the transfer can only terminate in one way and 
that condition is assigned an offset of 0, there is 
no need for the jump table. Code which is to be 
unconditionally 
executed when the transfer ends 
can 
immediately 
follow 
the 
instruction 
after 
XFER. This is also the case when single transfer is 
specified (execution always resumes at TP + 0). 


It is possible, however, for two, or even three, ter- 
mination conditions to arise at the same time. In 


OFFSET_a_CODE:! 
1 


EXECUTED 
IF TERMINATION 
T 
OFFSET. 
0 
J 


OFFSET _4_cODE:I 
1 


EXECUTED 
IF TERMINATION 
T 
OFFSET. 
4 
T 


OFFSET 
.I_CODE:! 
1 


EXECUTED 
IF TERMINATION 
T 
OFFSET_' 
T 


the preceding example, 
this would occur if the 
80th character were a !inefeed. When multiple ter- 
minations 
occur 
simultaneously, 
the 
channel 
indicates that termination 
resulted from the con- 
dition 
with 
the 
largest 
offset 
value. 
In 
the 
preceding example, if byte count and search ter- 
mination occur at the same time, the channel pro- 
gram resumes at TP + 4. 


The 8089 XFER (transfer) 
instruction 
puts the 
channel 
into 
DMA 
transfer 
mode 
after 
the 
following 
instruction 
has been executed. 
This 
technique gives the channel time to set itself up 
when it is used with device controllers, such as the 
8271 Floppy Disk Controller, 
that begin transfer- 
ring immediately 
upon receipt of the last in a 
series of parameters or commands. If the transfer 
is to or from such a device, the last parameter 
should 
be sent to the device after 
the XFER 
instruction. 
If this type of device is not being 
used, 
the 
instruction 
following 
XFER 
would 
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troller. If a memory-to-memory 
transfer is being 


made, any instruction 
may follow XFER except 


one that alters GA, GB, or CC. The HL T instruc- 
tion 
should 
normally 
not 
be coded 
after 
the 


XFER; doing so clears the channel's BUSY flag, 
but allows the DMA transfer to proceed. 


A DMA transfer cycle is illustrated in figure 3-28; 
a complete transfer is a series of these cycles run 
until a termination 
condition is encountered. 
The 


figure is deliberately 
simplified 
to explain the 


general operation 
of a DMA transfer; 
in par- 
ticular, the updating of the source and destination 
pointers (GA and GB) can be more complex than 
the figure indicates. Notice that it is possible to 
start an unending transfer by not specifying a ter- 
mination condition in CC or by specifying a con- 
dition that never occurs; it is the programmer's 
responsibility to ensure that the transfer eventu- 
ally stops. 


If the transfer is source-synchronized, 
the channel 


waits until the synchronizing device activates the 
channel's DRQ line. The other channel is free to 
run during this idle period. The channel fetches a 
byte or a word, depending on the source address 
(contained 
in GA or GB) and the logical bus 
width. Table 3-9 shows how a channel performs 
the fetch/store 
sequence for all combinations 
of 
addresses and bus widths. If the destination is on 
a l6-bit logical bus and the source is on an 8-bit 
logical bus, and the transfer is to an even address, 
the channel fetches a second byte and assembles a 
word internally. 
During each fetch, the channel 


decrements 
BC according 
to whether a byte or 


word is obtained. 
Thus BC always indicates the 
number of bytes fetched. 


The channel samples its EXT line after every bus 
cycle in the transfer. 
If EXT is recognized after 


the first of two scheduled 
fetches, 
the second 


fetch is not run. After the fetch sequence has been 
completed, the channel translates the data if this 
option is specified in CC. 


If a word has been fetched or assembled, 
and 


bytes are to be stored (destination bus is eight bits 
or transfer 
is to an odd address), 
the channel 


disassembles 
the word 
into 
two bytes. 
If the 


transfer 
is destination-synchronized 
(only 
one 


Address 
Logical Bus Width 


(Source ....• 
Source ....•Destination) 


Destination) 
8....•8 
8....•16 
16....•8 
16....•16 


EVEN ....•EVEN 
B....•B B/B ....•W W....•B/B W....•W 


EVEN ....•ODD 
B....•B B....•B 
W....•B/B W....•B/B 


ODD ....•EVEN 
B....•B B/B ....•W B....•B 
B/B ....•W 


ODD ....•ODD 
B....•B B....•B 
B....•B 
B....•B 


B= Byte Fetched or Stored in 1 Bus Cycle 
W= Word Fetched or Stored in 1 Bus Cycle 
B/B= 2 Bytes Fetched or Stored in 2 Bus Cycles 


type of synchronization 
may be specified for a 
given transfer), the channel waits for DRQ before 
running 
a store cycle. It stores a word or the 


lower-addressed byte (which may be the only byte 
or the first of two bytes). Table 3-9 shows the 
possible combinations of even/odd addresses and 
10}5icalbus widths that define the store cycle. 
Whenever stores are to memory on a 16-bit logical 
bus, the channel stores words, except that bytes 
may be stored on the first and last cycles. 


The channel samples EXT again after the first 
store cycle and, if it is active, the channel prevents 
the second store cycle from running. If specified 
in the CC register, the low-order byte is compared 
to the value in MC. A "hit" 
on the comparison 
(equal 
or 
unequal, 
as indicated 
in CC) 
also 


prevents the second of two scheduled store cycles 
from running. In both of these cases, one byte has 
been "overfetched," 
and this is reflected in BC's 
value. It would be unusual, 
however, for a syn- 
chronizing device to issue EXT in the midst of a 
DMA cycle. Note also that EXT is valid only 
when DRQ is inactive. Chapter 4 covers the tim- 
ing requirements for these two signals in detail. 


GA and GB are updated 
next. 
Only memory 


pointers are incremented; pointers to I/O devices 
remain constant throughout the transfer. 


If any termination condition has occurred during 
this cycle, the channel stops the transfer. 
It uses 


the content of the CC register to assign a value to 
the termination 
offset, to reflect the cause of the 
termination. 
The channel adds this offset to TP 


and resumes channel program 
execution 
at the 


location now addressed 
by TP. This offset will 


always be zero, four, or eight bytes past the end 
of the instruction 
following the XFER instruc- 
tion. 


If 
no 
termination 
condition 
is detected 
and 
another 
byte remains to be stored, 
the channel 
stores this byte, waiting for DRQif 
necessary, 
and updates the source and destination 
pointers. 
After the store, it again checks for termination. 


A DMA transfer updates register Be, register GA 
(if it points to memory), 
and register GB (if it 
points to memory). 
If the original contents 
of 
these registers are needed following the transfer, 
the contents should be saved in memory prior to 
executing the XFER instruction. 


A program may determine the address of the last 
byte stored by a DMA transfer by inspecting the 
pointer 
registers 
as shown 
in table 
3-10. The 


number of bytes stored is equal to: 


last_byte_address - first_byte_address 
+ 1. 


For port-to-port 
transfers, 
the number of bytes 


transferred 
can be determined by subtracting the 


final value of BC from its original value provided 
that: 


• 
the original BC > final BC, 


• 
a transfer cycle is not "chopped 
off" before 
it completes by a masked compare or exter- 
nal termination. 


In general, programs should not use the contents 
of GA, GB and BC following a transfer except as 
noted above and in table 3-10. This is because the 
contents of the registers are affected by numerous 
conditions, 
particularly 
when the transfer is ter- 


minated by EXT. In particular, 
when a program 


is performing 
a sequence of transfers, 
it should 


reload these registers before each transfer. 


The 
8089 shares 
the multiprocessing 
facilities 
common to the 8086 family of processors. It has 
on-chip logic for arbitrating 
the use of the local 


bus with a CPU 
or another 
lOP; 
system bus 


arbitration 
is delegated to an 8289 Bus Arbiter. 


The 8089's TSL (test and set while locked) in- 
struction enables it to share a resource, such as a 
buffer, 
with 
other 
processors 
by 
means 
of 
semaphore (see section 2.5 for a discussion of the 
use of semaphores 
to control 
access to shared 


resources). Finally, the 8089 can lock the system 
bus for the duration of a DMA transfer to ensure 
that the transfer 
completes without interference 


from other processors on the bus. 


In the remote configuration, 
the 8089 is electric- 


ally compatible 
with Intel's 
Multibus ™ 
multi- 
master bus design. This means that the power and 
convenience of 8089 I/O processing can be used 
in 8080- or 8085-based systems that implement the 
Multibus 
protocol 
or 
a 
super set 
of 
it. 
This 


includes single-board 
computers 
such as Intel's 


iSBC 80120™ and iSBC 80/30™ boards. In addi- 
tion, 
the 
lOP 
can 
access 
other 
iSBC board 


products 
such as memory and communications 
controllers. 


The 8089 shares its system bus with a CPU, and 
may also share its I/O bus with an lOP or another 
CPU. Only one processor at a time may drive a 
bus. When two (or more) processors want to use a 
shared bus, the system must provide an arbitra- 
tion mechanism that will grant the bus to one of 
the processors. 
This section 
describes 
the bus 
arbitration 
facilities that may be used with the 


8089 and covers their applicability 
to different 


lOP configurations. 


Termination 
Source 
Destination 
Synchronization 
Last Byte Stored 


memory 
memory 
any 
destination pointer' 


byte count 
memory 
port 
any 
source pointer 


port 
memory 
any 
destination pointer 


memory 
memory 
any 
destination pointer 


masked compare 
memory 
port 
any 
source pointer 


port 
memory 
any 
destination pointer 


memory 
memory 
unsynchronized 
destination pointer 


external 
memory 
port 
destination 
source pointer' 


port 
memory 
source 
destination pointer 


'Source pointer may also be used. 
'If transfer is BI B-W, source pointer must be decremented by 1to point to last byte transferred. 


When 
an 
8089 
is 
directly 
connected 
to 
another 
8089, 
an 
8086 
or 
an 
8088, 
the 
RQ/GT 
(request/grant) 
lines built 
into all of 
these processors 
are used to arbitrate 
use of a 
local bus. 
In the local mode, 
RQ/GT 
is used 
to control 
access to both 
the system and the 
I/O bus. 


As 
discussed 
in 
section 
2.6, 
the 
CPU's 
request/grant 
lines 
(RQ/GTO 
and 
RQ/GTl) 
operate as follows: 


• 
an external processor 
sends a pulse to the 
CPU to request use of the bus; 


• 
the CPU finishes its current bus cycle, if one 
is in progress, ,and sends a pulse to the pro- 
cessor to indicate that it has been granted the 
bus; and 


• 
when the external processor is finished with 
the bus, it sends a final pulse to the CPU, to 
indicate that it is releasing the bus. 


The 8089's request/grant 
circuit can operate in 
two modes; the mode is selected when the lOP is 
initialized (see section 3.6). Mode 0 is compatible 
with 
the 
8086/8088 
request/grant 
circuit 
and 
must be specified when the 8089's RQ/GT line is 
connected 
to RQ/GTO 
or RQ/GTI 
of one of 
these",£PUs. 
Mode 0 may 
~ 
s~ified 
when 
RQ/GT of one 8089 is tied to RQ/GT of another 
8089. When mode 0 is used with a CPU, the CPU 
is 
designated 
the 
master, 
and 
the 
lOP 
is 
designated a slave. When mode 0 is used with 
another lOP, one lOP is the master, and the other 
is the slave. Master/slave 
designation also is made 
at initialization 
time as discussed' in section 3.6. 
The master has the bus when the system is in- 
itialized and keeps the bus until it is requested by 
the slave. When the slave requests the bus, the 
master grants it if the master is idle. In this sense, 
the CPU becomes idle at the end of the current 
bus cycle. An lOP master, 
on the other hand, 
does not become idle until both channels have 
halted program execution or are waiting for DMA 
requests. Once granted the bus, the slave (always 
an lOP) uses it until both channels are idle, and 
then releases it to the master. 
In mode 0, the 
master 
has no way of requesting 
the slave to 
return the bus. 


Mode I operation of the request/grant 
lines may 
only be used to arbitrate use of a private I/O bus 


between 
two lOPs. 
In this case, 
one lOP 
is 
designated the master, and the other is designated 
the slave. However, the only difference between a 
master and a slave running in mode I is that the 
master has the bus at initialization 
time. Both 
processors may request the bus from each other at 
any time. The processor 
that has the bus will 
grant it to the requester 
as soon as one of the 
following occurs on either channel: 


• 
an unchained channel program instruction is 
completed, or 


• 
a channel goes idle due to a program halt or 
the completion 
of a synchronized 
transfer 
cycle (the channel waits for a DMA request). 


Execution of a chained channel program, a DMA 
termination 
sequence, 
a 
channel 
attention 
sequence, or a synchronized DMA transfer (i.e., a 
high-priority 
operation) 
on 
either 
channel 
prevents the lOP from granting 
the bus to the 
requesting lOP. 


The handshaking sequence in mode I is: 


• 
the 
~uesting 
processor 
pulses 
once 
on 
R:Q/GT; 


• 
the 
processor 
with 
the 
bus 
grants 
it by 
pulsing once; and 


• 
if the processor 
granting 
the bus wants it 
back immediately (for example, to fetch the 
next instruction), 
it will pulse RQ/GT again, 
two clocks after the grant pulse. 


The 
fundamental 
difference 
between 
the 
two 
modes is the frequency with which the bus can be 
switched between the two processors when both 
are active. In mode 0, the processor that has the 
bus will tend to keep it for relatively long periods 
if it is executing a channel program. 
Mode I in 
effect places unchained 
channel programs 
at a 
lower priority since the processor will give up the 
bus at the end of the next instruction. 
Therefore, 
when both processors 
are running channel pro- 
grams or synchronized DMA, they will share the 
bus more 
or less equally. 
When 
a processor 
changes to what would typically be considered a 
higher-priority 
activity such as chained program 
execution or DMA termination, 
it will generally 
be able to obtain the bus quickly and keep the bus 
for the duration of the more critical activity. 
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When an lOP is configured remotely, an 8-289Bus 
In the local configuration, 
all bus arbitration 
is 
Arbiter is used to control its access to the shared 
performed 
by the 
request/grant 
lines without 


system bus (the CPU also has its own 8289). In a 
additional hardware. One lOP may be connected 
remote cluster of two lOPs or an lOP and a CPU, 
to each of the CPU's RQ/GT lines. The lOP con- 
one 8289 controls 
access to the system bus for 
nected to RQ/GTO will obtain the bus if both pro- 
both 
processors 
in the cluster. 
The 8289 has 
cessors make simultaneous requests. 


several operating modes; when used with an 8089, 
the 8289 is usually 
strapped 
in its lOB (I/O 
Peripheral Bus) mode. 


The 8289 monitors the lOP's 
status lines. When 
these indicate that the lOP needs a cycle on the 
system bus, and the lOP does not presently have 
the bus, the 8289 activates a bus request signal. 
This signal, along with the bus request lines of 
other 8289s on the same bus, can be routed to an 
external priority-resolving 
circuit. At the end of 
the current bus cycle, this circuit grants the bus to 
the requesting 
8289 with the highest 
priority. 
Several different 
prioritizing 
techniques may be 
used; in a typical system, an lOP would have 
higher bus priority than a CPU. If the 8289 does 
not obtain the bus for its processor, it makes the 
bus appear "not ready" as if a slow memory were 
being accessed. The processor's 
clock generator 
responds to the "not ready" 
condition by insert- 
ing wait states into the lOP's 
bus cycle, thereby 
extending the cycle until the bus is acquired. 


When the CPU initializes an lOP, it must inform 
the lOP whether it is a master or a slave, and 
which request/grant 
mode is to be used. This sec- 
tion 
covers 
the 
requirements 
and 
options 
available for each lOP configuration; 
section 3.6 
describes how the information 
is communicated 
at initialization time. 


Since a s~le 
lOP in a remote configuration 
does 
not use RQ/GT, 
its mode may be set to 0 or I 
without affect. The single remote lOP, however, 
must be initialized as a master. 
If two remote 
lOPs share an I/O bus, one must be a master and 
the other a slave; both must be initialized to use 
the same request/grant 
mode. Normally, mode I 
will be selected for its improved responsiveness, 
and the designation of master will be arbitrary. 
If 
one lOP must have the I/O bus when the system 
comes up, it should be initialized as the master. 


When a remote lOP shares its I/O 
bus with a 
local CPU, 
it must 
be a slave and must 
use 
request/grant 
mode O. 


A locally configured 
lOP effectively has higher 
bus priority 
than the CPU since the CPU will 
grant the bus upon request from the lOP. One or 
two local lOPs can potentially 
monopolize 
the 
bus at the expense of the CPU. Of course, if the 
lOP activities are time-critical, this is exactly what 
should happen. On the other hand, there may be 
low-priority 
channel 
programs 
that 
have 
less 
demanding performance requirements. 


In such cases, the CPU may set a CCW bit called 
bus load limit to constrain the channel's use of the 
bus during normal (unchained) channel program 


Master/ 
Slave 
RQ/GT 
Mode 
Master/ 
Slave 


Remote With 
Local CPU 


ROtGT 
Mode 
Master/ 
Slave 
RQ/GT 
Mode 


Sameas 
Master 


execution. 
When 
this 
bit 
is set, 
the channel 
decrements a 7-bit counter from 7F (127) to OH 
with each instruction executed. Since the counter 
is decremented once per clock period, the channel 
waits a minimum of 128 clock cycles before it exe- 
cutes the next instruction. 
By forcing the execu- 
tion time of all instructions to 128 clocks, the use 
of the bus is reduced to between 3 and 25 percent 
of the available bus cycles. 


Setting the bus load limit effectively enables a 
CPU to slow the execution of a normal channel 
program, 
thus freeing up bus cycles. This is of 
most use in local configurations, 
but also may be 
effective in remote 
configurations, 
particularly 
when channel programs are executed from system 
memory. Bus load limit has no effect on chained 
channel 
programs, 
DMA transfers, 
DMA ter- 
mination, or channel attention sequences. 


Like the 8086 and 8088, the 8089 has a LOCK 
(bus lock) signal which can be activated by soft- 
ware. The LOCK output is normally connected to 
the LOCK input of an 8289 Bus Arbiter. 
When 
LOCK is active, the bus arbiter will not release the 
bus to another processor regardless of its priority. 
A channel automatically locks the bus during exe- 
cution of the TSL (test and set while locked) 
instruction and may lock the bus for the duration 
of a DMA transfer. 


If bit 9 of register CC is set, the 8089 activates its 
LOCK output 
during a DMA transfer 
on that 
channel. If the transfer is synchronized, 
LOCK is 
active 
from 
the 
time 
that 
the 
first 
DRQ 
is 
recognized. 
If the transfer 
is unsynchronized, 
LOCK is active throughout 
the entire transfer 
(there are no idle periods in an unsynchronized 
transfer). 
LOCK goes inactive when the channel 
begins the DMA termination sequence. 


A locked transfer ensures that the transfer will be 
completed in the shortest possible time and that 
the transferring 
channel has exclusive use of the 
bus. Once the channel obtains the bus and starts a 
locked transfer, 
the channel, in effect, becomes 
the highest-priority processor on that bus. 


The 
8089 TSL 
(test 
and 
set 
while 
locked) 
instruction 
can 
be 
used 
to 
implement 
a 
semaphore. 
(See section 2.5 for a discussion of 
how a semaphore 
may be used to control 
the 


access 
of 
multiple 
processors 
to 
a 
shared 
resource.) 
The instruction 
activates 
LOCK and 
inspects the value of a byte in memory. 
If the 
value of the byte is OH, it is changed (set) to a 
value specified in the instruction and the follow- 
ing instruction 
is executed. If the byte does not 
contain OH, control is transferred to another loca- 
tion specified in the instruction. The bus is locked 
from the time the byte is read until it is either writ- 
ten or control is transferred to ensure that another 
processor does not access the variable after TSL 
has read it, but before 
it has updated 
it (i.e., 
between bus cycles). The following line of code 
will repeatedly test a semaphore pointed to by GA 
until it is found to contain zero: 


When the semaphore is found to be zero, it is set 
to FFH and the program continues with the next 
instruction. 


3.6 
Processor 
Control and 
Monitoring 


This section focuses on lOP /CPU 
interaction, 
i.e., how the CPU initializes the lOP and sub- 
sequently sends commands to channels, and how 
the channels 
may interrupt 
the CPU. 
It also 
covers the channels' DMA control signals and the 
status signals that 
external 
devices can use to 
monitor lOP activities. 


Before the 8089 channels can be dispatched 
to 
perform 1/0 tasks, the lOP must be initialized. 
The initialization sequence (figure 3-29) provides 
the lOP with a definition of the system environ- 
ment: physical bus widths, request/grant 
mode, 


and the location of the channel control block. 


The sequence begins when the lOP's RESET line 
is activated. This halts any operation in progress, 
but does not affect any registers. Upon the first 


WAIT 
FOR 
CHANNEL 
ATTENTION 


READ 
INITIALIZATION 
CONTROL 
BLOCKS 


WAIT 
FOR 
CHANNEL 
ATTENTION 


PREPARE 
INITIALIZATION 
CONTROL 
BLOCKS 


ISSUE 
CHANNEL 
ATTENTION 


lOP IS READY; 


CPU MAY 
INITIALIZE 
ANOTHER 
lOP 


RESET after power-up, 
the content of all lOP 


registers 
is undefined. 
Register 
contents 
are 


preserved 
if the lOP 
is subsequently 
RESET, 
except that RESET always clears the chain bit in 
register CC. 


The lOP initializes itself by reading jnformation 
from initialization 
control blocks located in the 


system space (see figure 3-30). The three blocks 
are the SCP (system configuration 
pointer), SCB 


(system configuration 
block) and the CB (channel 


control block). The CB is normally RAM-based; 


the SCP and the SCB may be in RAM or ROM. It 
is the CPU's responsibility to properly setup the 
control blocks. 


The CPU starts the initialization sequence by issu- 
ing a channel attention to channel I (SEL low) or 
to channel 
2 (SEL high). 
The CPU 
typically 


accesses the channels as two consecutive addresses 
in its I/O or memory space. An OUT instruction 
(for an I/O-mapped 
lOP) or a memory reference 


instruction (such as MOV) then issues the channel 
attention. 


SYSTEM 
CONFIGURATION 
POINTER 
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FFFFEH 


FFFFCH 
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FFFFAH 
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CONFIGURATION 
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If channel I is selected (SEL=low), the lOP con- 
siders itself a master (as discussed in section 3.5). 
If channel 
2 is selected (SEL=high), 
the lOP 
operates as a slave. The lOP ignores, and does 
not latch, any subsequent channel attentions that 
occur during initialization. 


If the lOP is a master, it assumes that it has the 
bus immediately. If it is a slave, it pulses RQ/GT 
to request the bus from the CPU (local configura- 
tion) or the other lOP 
(remote ,configuration). 


When the lOP has obtained the bus, it assumes 
that the system bus is eight bits wide and reads the 


rrrrot1. 
In sysrem memory_ 
InlS nyte-reus 
tfie-----...-U~~,-LTI'""'"l~lOl""~ 
,:a.lll~v~ u\,,; 
H1VV\,,;U 
UUJ.HJ.5 
\";.I\.\,,,- 


lOP the actual physical width of the system bus; 
cutlOn except by relllitlahzlllg the lOP. 


all subsequent accesses take advantage of a 16-bit 
bus if it is available; i.e., even-addressed 
words 
are fetched in single bus cycles. It is therefore 
advantageous to word-align the control blocks. 


Next, the lOP reads the SCB address located at 
FFFF8H. This is a standard doubleword pointer, 
and the lOP constructs a 20-bit physical address 
from it by shifting the segment base left four bits 
and adding the offset word of the pointer. 


Having obtained the SCB address, the lOP reads 
the SOC (system operation command). 
This byte 


(see figure 3-32) tells the lOP the request/grant 
mode and the width of the I/O bus. 


R = REQUESTIGRANT 
MODE 
I = 0 = 8-BIT 
1/0 BUS 
I = 1 = 16-BIT 
1/0 BUS 


Then the lOP reads the doubleword pointer to the 
channel control block, converts the pointer into a 
20-bit physical address, and stores it in an internal 
register. This register is not accessible to channel 


After loading the address of the CB, the lOP 
clears the channell 
BUSY flag to OH. The other 


fields in the CB are used when a channel is dis- 
patched 
and 
are 
not 
read 
or 
altered 
in the 
initialization sequence. 


After 
the 
CPU 
has 
started 
the 
initialization 


sequence, it should monitor 
channell's 
BUSY 


flag in the CB to determine when the sequence has 
been completed. 
When the BUSY flag has been 
cleared, the CPU can dispatch either channel. It 
also can begin the initialization 
of another lOP. 


Since each lOP normally has a separate CB, the 
CPU must allocate the CB and update the pointer 
in the SCB before initializing the next lOP. Alter- 
natively, multiple SCBs could be employed, each 
pointing to a different CB area. In this case the 
CPU would update the pointer in the SCP before 
initializing the next lOP. It follows from this that 
in multi-lOP systems, either the SCB or SCP, or 
both, must be RAM-based. 
When all lOPs have 


been initialized, the CPU may use RAM occupied 
by the SCB for another purpose. 


After 
initialization, 
any 
channel 
attention 
is 


interpreted 
as 
a 
command 
to 
channel 
1 


(SEL=low) 
or 
to 
channel 
2 (SEL=high). 
As 


discussed in section 3.2, the channel attention, 
depending on the activities of both channels, may 
not 
be 
recognized 
immediately. 
The 
channel 


attention 
is latched, 
however, so that it will be 


serviced as soon as priorities allow. 


When the channel recognizes the CA, it sets its 
BUSY flag in the CB to FFH. This does not pre- 
vent the CPU from issuing another CA, but pro- 
vides status information 
only. In its response to a 


CA, the channel reads various control fields from 
system memory. 
It is the responsibility 
of the 


CPU to ensure that the appropriate 
fields are 


properly initialized before issuing the CA. 


After setting its BUSY flag, the channel reads its 
CCW from the CB. It examines the command 
field (see figure 3-33) and executes the command 
encoded there by the CPU. 


7 
EEEEI 


o 


CF I 


CF 
COMMAND 
FIELD 


000 
UPDATE 
PSW 
001 
START CHANNEL 
PROGRAM 
LOCATED 
IN I/O SPACE. 
010 
(RESERVED) 
011 
START CHANNEL 
PROGRAM 
LOCATED 
IN SYSTEM SPACE. 
100 
(RESERVED) 
101 
RESUME SUSPENDED 
CHANNEL 
OPERATION 
110 
SUSPEND 
CHANNEL 
OPERATION 
111 
HALT CHANNEL OPERATION 


ICF 
INTERRUPT 
CONTROL 
FIELD 
00 
IGNORE, 
NO EFFECT 
ON INTERRUPTS. 
01 
REMOVE 
INTERRUPT 
REQUEST; 
INTERRUPT 
IS ACKNOWLEDGED. 


10 
ENABLE 
INTERRUPTS. 
11 
DISABLE 
INTERRUPTS. 


B 
BUS LOAD LIMIT 
o 
NO BUS LOAD LIMIT 
1 
BUS LOAD LIMIT 


P 
PRIORITY 
BIT 


Figure 3-34 illustrates the channel's 
response to 
each type of command. Note that if CF contains a 
reserved 
value 
(010 
or 
100), 
the 
channel's 
response is unpredictable. 


The CPU can use the "update 
PSW" command 


to alter the bus load limit and priority bits in the 
PSW (see figure 3-17) without otherwise affecting 
the channel. This command also allows the CPU 
to control interrupts 
originating 
in the channel; 
this topic is discussed in more detail later in this 
section. 


The two "start 
program" 
commands differ only 
in their affect on the TP tag bit. If CF=OO1, the 
channel sets the tag to 1 to indicate that the pro- 
gram resides in the I/O space. If CF=OII, the tag 
is cleared to 0, and the program is assumed to be 
in the system space. The channel 
converts the 
doubleword 
parameter 
block pointer to a 20-bit 
physical address and loads this into PP. It loads 
the doubleword 
task block 
(channel 
program) 
pointer into TP, updates the PSW as specified by 
the ICF, Band P fields of the CCW and starts the 
program with the instruction pointed to by TP. 


The 
CPU 
may 
suspend 
a channel 
operation 


(either program execution or DMA transfer) by 
setting CF to 110. The channel saves its state (TP, 
its tag bit, and PSW) in the first two words of the 
parameter block (see figure 3-18 for format) and 
clears its BUSY flag to OH. Note the following in 
regard to a suspended operation: 


• 
The content of the doubleword pointer to the 
beginning of the channel program is replaced 
by the channel state save data. Therefore, 
a 


suspended 
operation 
may be resumed, 
but 


cannot be started from the beginning without 
recreating the doubleword pointer. 


• 
TP 
is 
the 
only 
register 
saved 
by 
this 


operation. 
If another 
channel 
program 
is 


started on this channel, the other registers, 
including 
PP, 
are 
subject 
to being 
over- 


written. In general, suspend is used to tem- 
porarily halt a channel, not to "interrupt" 
it 


with another program. Section 3.10 provides 
an example of a program that can be used to 
save another program's registers. 
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• 
Suspending a DMA transfer does not affect 
any I/O 
devices ~an I/O 
deviee will act as 
though the transfer is proceeding). The CPU 
must provide for conditions that may arise if, 
for 
example, 
a 
device 
requests 
a 
DMA 
transfer, 
but 
the 
channel 
does 
not 
acknowledge the request because it has been 
suspended. Similarly, an I/O device may be 
in a different condition when the operation is 
resumed. 


A suspended operation may be resumed by setting 
CF to 101. This command causes the channel to 
reload TP, its tag bit, and the PSW from the first 
two words of PB. Resuming an operation that has 
not been suspended will give unpredictable results 
since the first two words of PB will not contain 
the required channel state data. A resume com- 
mand does not affect any channel registers other 
than TP. 


The CPU 
may abort 
a channel 
operation 
by 
issuing a "halt" 
command (CF=III). 
The chan- 
nel clears its BUSY flag to OH and then idles. 
Again, the CPU must be prepared for the effect 
aborting 
a DMA transfer 
may have on an I/O 
device. 


The synchronizing device in a DMA transfer uses 
the DRQ line to indicate when it is ready to send 
or receive the next byte or word. The channel 
recognizes a signal on this line ,only during 
a, 


DMA transfers, i.e., after the instruction follow- 
ing XFER has been executed and before a ter- 
mination 
condition 
has occurred. 
The channels 
have separate 
DMA 
request 
lines (DRQI 
and 
DRQ2). 


An external device (typically the synchronizing 
device) can terminate a DMA transfer by signal- 
ing on this line. Each channel has its own external 
terminate 
line (EXT! 
and EXT2). The channel 
stops the transfer as soon as the current fetch or 
store cycle is completed. An external terminate in 
an unsynchronized 
transfer could result in a loss 
of data, although this would not be a typical use 
of EXT. 
In a synchronized 
traQsfer, 
the syn- 
chronizing device will normally issue EXT instead 


of DRQ following the last transfer cycle. If EXT 
is activated during a transfer cycle, a fetched byte 
may not be stored as explained in section 3.4. 


A channel does not recognize EXT if it is not per- 
forming a DMA transfer. If EXT! and EXT2 are 
activated 
simultaneously, 
EXT! 
is recognized 
first. 
. 


Each channel has a separate system interrupt line 
(SINTRI and SINTR2). A channel program may 
generate a CPU interrupt 
request by executing a 
SINTR 
instruction. 
Whether 
this 
instruction 
actually 
activates 
the 
SINTR 
line, 
however, 


depends upon the state of the interrupt control bit 
(bit 3 of the PSW; see figure 3-17). If this bit is 
set, interrupts from the channel are enabled, and 
execution 
of 
the 
SINTR 
instruction 
activates 
SINTR. If the interrupt control bit is cleared, the 
SINTR instruction has no effect; interrupts 'from 
the channel are disabled. 


The CPU can alter a channel's 
interrupt control 
bit by sending any' command to the channel with 
the value of ICF (interrupt 
control field) in the 
CCW set to 10 (enable) or II (disable). Thus, the 
CPU can prevent interrupts from either channel. 


Once activated, 
SINTR remains active until the 
CPU sends a channel command with ICF set to 01 
(interrupt 
acknowledge). 
When 
the 
channel 
receives this command, 
it clears the interrupt ser- 
vice bit iri the PSW (figure 3-17) and removes the 
interrupt request. Disabling interrupts also clears 
the interrupt service bit and lowers SINTR. 


The lOP emits signals on the SO-S2status lines to 
indicate to external devices the type of bus cycle 
the processor 
is starting. 
Table 3-12 shows the 
signals that are output 
for each type of cycle. 


These status lines are connected to an 8288 Bus 
Controller. The bus controller decodes these lines 
and outputs the signals that control components 
attached to the bus. The lOP indicates "instruc- 
tion fetch" on these lines when it is reading and 
writing memory operands as well as when it is fet- 


ched instructions. 
In the remote configuration, 
an 
8289 Bus Arbiter monitors the SO-S2 status lines 
to determine when a system bus access is required. 


52 51 50 
Type of Bus Cycle 


0 
0 
0 
Instruction fetch from I/O space 


0 
0 
1 
Data fetch from 1/0 space 


0 
1 
0 
Data store to 1/0 space 


0 
1 
1 
(not used) 


1 
0 
0 
Instruction fetch from system 
space 


1 
0 
1 
Data fetch from system space 


1 
1 
0 
Data store to system space 


1 
1 
1 
Passive; no bus cycle run 


Status lines S3-S6 indicate whether the bus cycle is 
DMA or non-DMA, 
and which channel is run- 


ning the cycle (see table 3-13). Note that when the 
lOP is not running a bus cycle (e.g., when it is idle 
or when it is executing an internal cycle that does 
not use the bus), the status lines reflect the last 
bus cycle run. 


56 55 54 53 
Bus Cycle 


1 
1 
0 
0 
DMAcycle on chann~11 


1 
1 
0 
1 
DMAcycle on channel 2 


1 
1 
1 
0 
Non-DMAcycle on channel 1 


1 
1 
1 
1 
Non-DMAcycle on channel 2 


This section .divides the lOP's 53 instructions into 
five functional categories: 
1. 
data transfer, 


2. 
arithmetic, 


3. 
logic and bit manipulation, 


4. 
program transfer, 
5. 
processor control. 
II 


The 
description 
of 'each 
instruction 
in these 


categories explains how the instruction 
operates 


and how it may be used in channel programs. 
Instructions 
that 
perform 
essentially 
the same 


operation 
(e.g., 
ADD and ADDB, 
which add 
words 
and 
bytes 
respectively), 
are 
described 


together. A reference table at the end of the sec- 
tion lists every instruction alphabetically and pro- 
vides execution time, encoded length, and sample 
ASM-89 coding 
for each permissable 
operand 


combination. 
For information 
on how the 8089 


machine instructions are encoded in memory, see 
section 4.3. 


In reading this section, it is important 
to recall 


that 
the instruction 
set does not 
differentiate 


between 
·memory 
addresses 
and 
I/O 
device 


addresses. 
Instructions 
that 
are 
described 
as 


accepting byte and word memory operands may 
also be used to read and write I/O devices. 


These instructions 
move data between memory 


and channel registers. Traditional 
byte and word 


moves 
(including 
memory-to-memory) 
are 
available, 
as are special instructions 
that 
load 
addresses into pointer 
registers and update 
tag 


bits in the process. 


MOV transfers a byte or word from the source to 
the destination. Four instructions are provided: 


MOV 
MOVB 
MOVI 
MOVBI 


Move Word Variable, 
Move Byte Variable, 
Move Word Immediate, 
Move Byte Immediate. 


Figure '3-35 shows how these instructions 
af(ect 
register operands. 
Notice 
that 
when a pointer 


register is specified as the destination 
of a MOV, 


its tag bit is unconditionally 
set to 
1. MOV 


instructions are therefore used to load I/O space 
addresses into pointer registers, 


Tag 
19 
15 
7 
0 


op:~~~on 
~1j~S~~~s 
S S S S S S SiR 
R R R R R R R I 


Word 
rlr-- 
~ 
I 
I 
Operation L 1J LS~ ~ S R R R R R R R R R R R R R R R R 


~xJ0~~~xxxxxxxxITTTTTTTT 
I 


~XJG~~~TTTTTTTTITTTTTTTT 
I 


T = bit is transferred to destination operand 
R = bit is replaced by source operand 
S = bit is sign extension of high-order bit transferred 
X = bit is ignored 
1 = bit is unconditionally set 


MOVP 
(move 
pointer) 
transfers 
a 
physical 


address variable between a pointer register and 
memory. 
If the source is a pointer register, its 


content and tag bit are converted 
to a physical 


address pointer (see figure 3-23). If the source is a 
memory location, the three bytes are converted to 
a 20-bit physical address and a tag value, and are 
loaded into the pointer 
register and its tag bit. 


MOVP 
is typically 
used 
to save and 
restore 


pointer registers. 


LPD (load pointer with doubleword) 
converts a 


doubleword 
pointer (see figure 3-22) to a 20-bit 
physical address and loads it into the destination, 
which must be a pointer 
register. 
The pointer 
register's tag bit is unconditionally 
cleared to 0, 


indicating a system address. Two instructions are 
provided: 


Load 
Pointer 
With 
Doubleword 
Variable 
Load 
Pointer 
With 
Doubleword 
Immediate 


An 8086 or 8088 can pass any address 
in its 


megabyte memory space to a channel program in 
the form of a doubleword 
pointer. The channel 


program can access the location by using LPD to 
load the location address into a pointer register. 


The arithmetic instructions interpret all operands 
as unsigned binary numbers of 8, 16 or 20 bits. 
Signed values may be represented 
in standard 


two's complement 
notation 
with the high-order 


bit representing the sign (O=positive, 1=negative). 
The processor, however, has no way of detecting 
an overflow into a sign bit so this possibility must 
be provided for in the user's software. 


The 8089 performs 
arithmetic 
operations 
to 20 


significant 
bits 
as 
follows. 
Byte 
and 
wmd 


operands are sign-extended to 20 bits (e.g., bit 7 
of a byte operand is propagated 
through bits 8-19 


of an internal register). Sign extension does not 
affect the magnitude of the operand. The opera- 
tion is then performed, 
and the 20-bit result is 


returned to the destination 
operand. 
High-order 
bits are truncated as necessary to fit the result in 
the available space. A carry out of, or borrow 
into, 
the 
high-order 
bit 
of 
the 
result 
is not 
detected. However, if the destination is a register 
that is larger than the source operand, carries will 
be reflected in the upper register bits, up to the 
size of the register. 


Figure 3-36 shows how the arithmetic instructions 
treat registers when they are specified as source 
and destination operands. 


The sum of the two operands replaces the destina- 
tion 
operand. 
Four 
addition 
instructions 
are 
provided: 


ADD 
ADDB 
ADDI 
ADDBI 


Add Word Variable 
Add Byte Variable 
Add Word Immediate 
Add Byte Immediate 


The destination is incremented by I. Two instruc- 
tions are available: 


INC 
INCB 
Increment Word 
Increment Byte 


The destination 
is decremented 
by I. Word and 
byte instructions are provided: 


DEC 
DECB 
Decrement Word 
Decrement Byte 


Logical and Bit Manipulation 
Instructions 


The 
logical 
instructions 
include 
the 
boolean 
operators AND, OR and NOT. Two bit manipu- 
lation 
instructions 
are provided 
for setting or 


Byte 
Operation 
~XJ~~'§:"RJRRRRRRRRIRRRRRRRR 
I 
~x.J~~~~xxxxxxxxlpppppppp 
I 


Word 
Operation 
~xJ~~~~RRRRRRRRIRRRRRRRR 
I 
~x~~~~~pppppppplpppppppp 
I 


x = bit is ignored in operation 
R = bit is replaced by operation result 
p = bit participates in operation 


clearing a single bit in memory or in an I/O device 
register. 
As shown 
in figure 
3-37, the logical 
operations 
always leave the upper four bits of 
20-bit destination 
registers undefined. 
These bits 
should not be assumed to contain reliable values 
or the same values from one operation 
to the 
next. Notice also that when a register is specified 
as the destination 
of a byte operation, 
bits 8-15 
are overwritten by bit 7 of the result. Bits 8-15 can 
be preserved 
in AND and OR instructions 
by 
using word operations in which the upper byte of 
the source operand is FFH or OOH,respectively. 


The two operands 
are logically ANDed and the 
result replaces the destination 
operand. 
A bit in 
the result is set if the bits in the corresponding 
positions of the operands are both set, otherwise 
the result bit is cleared. 
The following 
AND 
instructions are available: 


AND 
ANDB 
ANDI 
ANDBI 


Logical AND Word Variable 
Logical AND Byte Variable 
Logical AND Word Immediate 
Logical AND Byte Immediate 


AND is useful when more than one bit of a device 
register must be cleared while leaving the remain- 
ing bits intact. 
For example, 
ANDing an 8-bit 
register with EEH only clears bits 0 and 4. 


The two operands 
are logically ORed, 
and the 
result replaces the destination 
operand. 
A bit in 
the result is set if either or both of the correspond- 
ing bits of the operands are set; if both operand 
bits are cleared, 
the result bit is cleared. 
Four 
types of OR instructions are provided: 


OR 
ORB 
ORI 
ORBI 


Logical OR Word Variable 
Logical OR Byte Variable 
Logical OR Word Imm.ediate 
Logical OR Byte Immediate 


OR can be used to selectively set multiple bits in a 
device register. 
For 
example, 
ORing 
an 8-bit 
register with 30H sets bits 4 and 5, but does not 
affect the other bits. 


Tag 
19 
15 
7 
0 


Byte 
r. - - - 
Operation 
[~~ 
~ ~~s 
s s s s s S siR 
R R R R R R R I 


Word 
r:l~--~ 
I 
Operation 
L ~ 
~ ~ 
~ U R R R R R R R R R R R R R R R R I 


[ xJ ~ S~ ~ X X X X X X X XIp p p p p p p p I 


X = bit is ignored 
in operation 


U = bit is undefined 
following 
operation 


R = bit participates 
in operation 
and is replaced 
by result 


S = bit is sign-extension 
of high-order 
result 
bit 


P = bit participates 
in operation, 
but is unchanged 


NOT inverts the bits of an operand. 
If a single 
operand is coded, the inverted result replaces the 
original value. If two operands 
are coded, 
the 
inverted bits of the source replace the destination 
value (which must be a register), but the source 
retains its original value. In addition to these two 
operand forms, separate mnemonics are provided 
for word and byte values: 


NOT 
NOTB 
Logical NOT Word 
Logical NOT Byte 


NOT followed 
by INC will negate (create the 


two's complement of) a positive number. 


The bit-select operand 
specifies one bit in the 
destination, which must be a memory byte, that is 
unconditionally 
set to I. A bit-select value of 0 


specifies the low-order bit of the destination while 
the high-order bit is set if bit-select is 7. SETB is 
handy for setting a single bit in an 8-bit device 
register. 


CLR operates exactly like SETB except that the 
selected bit is unconditionally 
cleared to O. 


Register TP controls the sequence in which chan- 
nel program 
instructions 
are executed. As each 


instruction is executed, the length of the instruc- 
tion is added to TP so that it points to the next 
sequential 
instruction. 
The 
program 
transfer 
instructions can alter this sequential execution by 
adding a signed displacement 
value to TP. The 
displacement is contained in the program transfer 
instruction 
and may be either 8 or 16 bits long. 
The displacement is encoded in two's complement 
notation, and the high-order bit indicates the sign 
(O=positive displacement, 
I=negative 
displace- 
ment). 
An 
8-bit 
displacement 
may 
cause 
a 
transfer to a location in the range -128 through 
+127 bytes from the end of the transfer instruc- 
tion, while a 16-bit displacement 
can transfer to 


any location 
within 
-32,768 
through 
+32,767 
bytes. An instruction containing an 8-bit displace- 
ment is called a short transfer and an instruction 
containing a 16-bit displacement 
is called a long 


transfer. 


The program transfer instructions 
have alternate 


mnemonics. If the mnemonic begins with the let- 
ter "L," 
the transfer is long, and the distance to 


the 
transfer 
target 
is expressed 
as 
a 
16-bit 


displacement 
regardless 
of how 
far 
away the 


target is located. If the mnemonic does not begin 
with "L," 
the ASM-89 assembler may build a 


short or long displacement 
according 
to rules 


discussed in section 3.9. 


The "self-relative" 
addressing technique used by 


program transfer instructions 
has two important 


consequences. 
First, 
it 
promotes 
position- 
independent code, i.e., code that can be moved in 
memory 
and 
still execute correctly. 
The only 


restriction here is that the entire program must be 
moved as a unit so that the distance between the 
transfer 
instruction 
and 
its 
target 
does 
not 


change. Second, the limited addressing range of 
these instructions 
must be kept in mind when 


designing large (over 32k bytes of code) channel 
programs. 


CALL invokes an out-of-line routine, saving the 
value of TP so that the subroutine 
can transfer 
back to the instruction following the CALL. The 
instruction stores TP and its tag bit in the TPsave 
operand, 
which 
must 
be 
a 
physical 
address 


variable, and then transfers to the target address 
formed by adding the target operand's 
displace- 
ment to TP. The subroutine 
can return 
to the 


instruction 
following 
the 
CALL 
by 
using 
a 


MOVP instruction to load TPsave back into TP. 


Notice that the 8089's facilities for implementing 
subroutines, 
or procedures, 
is less sophisticated 
than its counterparts 
in the 8086/8088. The prin- 


cipal difference is that the 8089 does not have a 
built in stack mechanism. 
8089 programs 
can 


implement a stack using a base register as a stack 
pointer. 
On the other hand, 
since channel pro- 
grams are not subject to interrupts, 
a stack will 


not be required for most channel programs. 


JMP causes an unconditional 
transfer (jump) to 


the target location. 
Since the task pointer is not 


saved, no return to the instruction 
following the 


JMP is implied. 


JZ (jump if zero) effects a transfer to the target 
location if the source operand is zero; otherwise 
the instruction 
following JZ is executed. Word 


and 
byte 
values 
may 
be 
tested 
by alternate 


instructions: 


JZ/LJZ 
JZB/LJZB 
Jump/Long 
Jump if Word Zero 
Jump/Long 
Jump if Byte Zero 


If the source operand is a register, only the low- 
order 16 bits are tested; any additional high-order 
bits in the register are ignored. To test the low- 
order byte of a register, clear bits 8-15 and then 
use the word form of the instruction. 


JNZ operates exactly like JZ except that control is 
transferred 
to the target if the source operand 
does not contain all a-bits. Word and byte sources 
may be tested using these mnemonics: 


JNZ/LJNZ 
Jump/Long 
Jump if Word Not 
Zero 


JNZB/LJNZB 
Jump/Long 
Jump 
if Byte Not 
Zero. 


This instruction (jump if masked compare equal) 
effects a transfer 
to the target location jf the 


source (a memory byte) is equal to the lower byte 
in register MC as masked by the upper byte in 
Me. 
Figure 
3-15 illustrates 
how a-bits 
in the 


upper half of MC cause the corresponding 
bits in 


the lower half of MC and the source operand to 
compare equal, regardless of their actual values. 
For example, if bits 8-15 of MC contain the value 
a 1H, then the transfer will occur if bit a of the 
source and register MC are equal. This instruction 
is useful for testing multiple bits in 8-bit device 
registers. 


This instruction causes a jump to the target loca- 
tion if the source 
is not equal 
to the mask/ 


compare value in MC. It otherwise operates iden- 
tically to JMCE. 


JBT (jump if bit true) tests a single bit in the 
source operand and jumps to the target if the bit 
is a I. The source must be a byte in memory or in 
an 110 device register. The bit-select value may 
range from a through 7, with a specifying the low- 
order bit. This instruction 
may be used to test a 


bit in an 8-bit device register. If the target is the 
JBT instruction 
itself, the operation 
effectively 


becomes "wait until bit is a." 


This instruction operates exactly like JBT, except 
that the transfer is made if the bit is not true, Le., 
if the bit is a. 


These instructions 
enable channel 
programs 
to 


control lOP hardware facilities such as the LOCK 
and SINTRI-2 
pins, logical bus width selection, 


and the initiation of a DMA transfer. 


Figure 3-38 illustrates the operation 
of the TSL 


(test and set while locked) instruction. TSL can be 
used to implement 
a semaphore 
variable 
that 


controls 
access 
to 
a 
shared 
resource 
in 
a 


multiprocessor 
system (see section 2.5). If the 


target operand 
specifies the address of the TSL 


instruction, 
the instruction 
is repetively executed 


until the semaphore (destination) is found to con- 
tain zero. Thus the channel 
program 
does not 


proceed until the resource is free. 


WID (set logical bus widths) alters bits a and 1 of 
the PSW, thus specifying logical bus widths for a 
DMA transfer. The operands may be specified as 


ASSIGN· 


SET·VALUETO 
DESTINA 
TION 


8 or 16 (bits), with the restriction that the logical 
width of a bus cannot exceed its physical width. 
The logical bus widths are undefined following a 
processor RESET; therefore the WID instruction 
must 
be 
executed 
before 
the 
first 
transfer. 
Thereafter 
the logical widths retain their values 
until 
the 
next 
WID 
instruction 
or 
processor 
RESET. 


XFER (enter DMA transfer mode after following 
instruction) 
prepares 
the channel 
for a DMA 
transfer 
operation. 
In a synchronized 
transfer, 


the instruction 
following XFER may ready the 
synchronizing device (e.g., send a "start" 
com- 
mand or the last of a series of parameters). 
Any 
instruction, including NOP and WID, may follow 
XFER, except an instruction 
that alters GA, GB 
or Gc. 


This instruction sets the interrupt service bit in the 
PSW and activates the channel's 
SINTR line if 
the interrupt control bit in the PSW is set. If the 


interrupt 
control bit is cleared (interrupts 
from 


this channel are disabled), the interrupt service bit 
is set, but SINTRI-2 
is not activated. A channel 


program 
may use this instruction 
to interrupt 
a 
CPU. 


This instruction 
consumes clock cycles but per- 


forms no operation. As such, it is useful in timing 
loops. 


This instruction 
concludes 
a channel 
program. 
The channel clears its BUSY flag and then idles. 


Table 
3-16 
lists 
every 
8089 
instruction 


alphabetically 
by its ASM-89 
mnemonic. 
The 


ASM-89 coding format is shown (see table 3-14 
for an explanation 
of operand identifiers) along 


with the instruction name. For every combination 
of operand 
types (see table 3-15 for key), the 


instruction's 
execution 
time and 
its length 
in 


bytes, and a coding example are provided. 


The instruction timing figures are the number of 
clock periods required to execute the instruction 
with 
the 
given combination 
of 
operands. 
At 


5 MHz, one clock period is 200 ns; at 8 MHz a 
clock period is 125 ns. Two timings are provided 
when an instruction operates on a memory word. 
The first (lower) figure indicates execution time 
when the word is aligned on an even address and 
is accessed over a 16-bit bus. The second figure is 
for odd-addressed 
words on 16-bit buses and any 


word accessed via an 8-bit bus. 


Instruction 
fetch time is shown in table 3-17 and 
should be added to the execution times shown in 
table 3-16 to determine how long a sequence of 
instructions will take to run. (Section 3.2 explains 
the effect 
of the instruction 
queue 
on 
16-bit 


instruction 
fetches.) External delays such as bus 


arbitration, 
wait states and activity on the other 


channel will increase the elapsed time over the 
figures shown 
in tables 
3-16 and 
3-17. These 


delays are application dependent. 


data transfer, 
arithmetic, 
bit manipulation 


~ata transfer, 
arithmetic, 
bit manipulation 


A register or memory location that may contain data operated on 
by the instruction, and which receives (is replaced by) the result 
of the operation. 


A register, memory location, or immediate value that is used in 
the operation, but is not altereaby the instruction. 


A 24-bit memory location where the address of the next sequen- 
tial instruction is to be saved. 


Specification of a bit location within a byte; O=least-significant 
(rightmost) bit, 7=most-significant (leftmost) bit. 


8089 INPUT/OUTPUT 
PROCESSOR 
EXPLANATION 


(no operands) 


register 


ptr-reg 


immed8 


immed16 


mem8 


mem16 


mem24 


mem32 


label 


short-label 


0-7 


8/16 


No operands are written 


Any general register 


A pointer register 


A constant in the range O-FFH 


A constant in the range O-FFFFH 


An 8-bit memory location (byte) 


A 16-bit memory location (word) 


A 24-bit memory location (physical address pointer) 


A 32-bit memory location (doubleword 
pointer) 


A label within -32,768 to +32,767 bytes of the end of the instruction 


A label within -128 to +127 bytes of the end of the instruction 


A constant in the range: 0-7 


The constant 8 or the constant 16 


ADD 
destination, 
source 
Add Word Variable 


Operands 
Clocks 
Bytes 
Coding Example 


register, mem16 
11/15 
2-3 
ADD BC, [GA).LENGTH 
mem16, register 
,- 
16/26 
2-3 
ADD [GB}, GC 


ADDS 
destination, 
source 
Add Byte Variable 


Operands 
Clocks 
Bytes 
Coding Example 


register, mem8 
11 
2-3 
ADDB GC, [GA).N_CHARS 
mem8, register 
16 
2-3 
ADDB [PP).ERRORS,MC 


ADDSI 
destination, 
source 
Add Byte Immediate 


Operands 
Clocks 
Bytes 
Coding Example 


register, immed8 
3 
3 
ADDBI MC,10 
mem8, immed8 
16 
3-4 
ADDBI [PP+IX+).RECORDS,2CH 


< 


ADDI 
destination, 
source 
Add Word Immediate 


Operands 
Clocks 
Bytes 
Coding Example 


register, immed16 
3 
4 
ADDI GB,OC25BH 
mem16, immed16 
16/26 
4-5 
ADDI [GB).POINTER,5899 
. 


AND 
destination, 
source 
Logical AND Word Variable 


Operands 
Clocks 
Bytes 
Coding 
Example 


register, mem16 
11/15 
2-3 
AND MC, [GA].FLAG_WORD 
mem16, register 
16/26 
2-3 
AND [GCl.STATUS, BC 


ANDB 
destination, 
source 
Logical AND Byte Variable 


Operands 
Clocks 
Bytes 
Coding 
Example 


register, mem8 
11 
2-3 
AND BC, [GCl 
mem8, register 
16 
2-3 
AND [GA+IX].RESULT, GA 
- 


ANDBI 
destination, 
source 
Logical AND Byte Immediate 


Operands 
Clocks 
Bytes 
Coding 
Example 


register, immed8 
3 
3 
GA,01100000B 
- 
mem8, immed8 
16 
3-4 
[GC+IX], 2CH 


ANDI 
destination, 
source 
Logical AND Word Immediate 


Operands 
Clocks 
Bytes 
Coding 
Example 


register, immed16 
3 
4 
IX,OH 
mem16, immed16 
16/26 
4-5 
[GB+IX].TAB,40H 


CALL 
TPsave, target 
Call 
I 


Operands 
Clocks 
Bytes 
Coding 
Example 


mem24, label 
17/23 
3-5 
CALL [GC+IX].SAVE, GET_N EXT 


CLR 
destination, 
bit select 
Clear Bit To Zero 


Operands 
Clocks 
Bytes 
Coding 
Example 


mem8,0-7 
16 
2-3 
CLR [GA], 3 
- 


DEC 
destination 
Decrement Word By 1 


Operands 
Clocks 
Bytes 
Coding 
Example 


register 
3 
2 
mem16 
16/26 
2-3 
DEC [PP].RETRY 


y 


DECB 
destination 
Decrement Byte By 1 


Operands 
Clocks 
Bytes 
Coding 
Example 


mem8 
... 
16 
2-3 
DECB [GA+IX+].TAB 


HLT 
(no operands) 
. 
Halt Channel Program 


Operands 
Clocks 
Bytes 
Coding 
Example 


(no operands) 
, 
11 
2 
HLT 
, 


INC 
- 


destination 
Increment Word by 1 


Operands 
Clocks 
Bytes 
Coding 
Example 


register 
3 
2 
INC GA 
mem16 
16/26 
2-3 
INC [GA).COUNT 


INCB 
destination 
Increment Byte by 1 


Operands 
- . 


Clocks 
Bytes 
Coding 
Example 
r 
" 


mem8 
, 
16 
2-3 
INCB [GB].POINTER 


JBT 
source, bit-select, target 
Jump if Bit True (1) 


Operands 
Clocks 
Bytes 
Coding 
Example 


mem8, 0-7, label 
14 
3-5 
JBT [GA].RESULLREG, 3, DATA_VALID 


JMCE 
source, target 
Jump if Masked Compare Equal 


Operands 
Clocks 
Bytes 
Coding 
Example 


mem8, label 
14 
3-5 
JMCE [GB].FLAG, STOP_SEARCH 


JMCNE 
source, target 
Jump if Masked Compare Not Equal 


Operands 
Clocks 
Bytes 
Coding 
Example 


mem8,Iabei 
14 
3-5 
JMCNE [GB+IXJ, NEXT_ITEM 


JMP 
- 
target 
Jump Unconditionally 


Operands 
Clocks 
Bytes 
Coding 
Example 


label 
3 
3-4 
JMP READ_SECTOR 


J N BT 
source, bit-select, 
target 


Operands 


mem8, 0-7, label 


J NZ 
source, target 


Operands 


register, label 
mem16, label 
5 
12/16 


JNZB 
source, target 


Operands 


JZ 
source, target 


Operands 


register, label 
mem16, label 
5 
12/16 


Jump if Bit Not True (Ot 


Bytes 
Coding Example 


3-5 
JNBT [GCl. 3, RE_READ 


Jump if Word Not Zero 


Bytes 
Coding Example 


3-4 
JNZ BC, WRITE_LINE 
3-5 
JNZ [PP].NUM_CHARS, 
PUT_BYTE 


Jump if Byte Not Zero 


Bytes 
Coding Example 


3-5 
JNZB [GAl. MORE_DATA 


Jump if Word is Zero 


Bytes 
Coding Example 


3-4 
JZ BC, NEXT_LINE 
3-5 
JZ [GC+IX].INDEX, BUF_EMPTY 


. 
. 


JZB 
source, target 
Jump if Byte Zero 


Operands 
. 
Clocks 
Bytes 
Coding Example 


mem8,Iabei 
12 
3-5 
JZB [PP]~L1NES_LEFT, RETURN 


LCALL 
TPsave, target 
Long Call 


Operands 
Clocks 
Bytes 
Coding Example 


mem24, label 
17/23 
4-5 - 
LCALL [GC].RETURN_SAVE,INIT_8279 


LJBT 
source, bit-select, target 
; 
Long Jump if Bit True (1) 


Operands 
Clocks 
Bytes 
Coding Example 


mem8, 0-7, label 
14 
4-5 
LJBT [GA].RESULT, 1, DATA_OK 


LJMCE 
source" target 
Long jump if Masked Compare Equal 


Operands 
- 
Clocks 
Bytes 
Coding Example 


mem8, label 
14 
4-5 
LJMCE [GBl. BYTE_FOUND 


LJMCNE 
source, target 
Long jump if Masked Compare Not Equal 


Operands 
Clocks 
Bytes 
Coding 
Example 


mem8, label 
14 
4-5 
LJMCNE [GC+IX+], SCAN_NEXT 


LJMP 
" 
target 
Long Jump Unconditional 


Operands 
Clocks 
Bytes 
Coding 
Example 


label 
3 
4 
LJMP GET_CURSOR 
~. 


LJNBT 
source, bit-select, target 
Long Jump if Bit Not True (0) 


Operands 
Clocks 
Bytes 
Coding 
Example 


mem8, 0-7, label 
14 
4-5 
LJNBT [GC], 6, CRCC_ERROR 


LJNZ 
source, target 
Long Jump if Word Not Zero 


Operands 
Clocks 
Bytes 
Coding 
Example 


register, label 
5 
4 
LJNZ BC, PARTIAL_XMIT 
mem16, label 
12/16 
4-5 
LJNZ [GA+IX].N_LEFT, 
PUT_DATA 


LJNZB 
source, target 
Long Jump if Byte Not Zero 


Operands 
Clocks 
Bytes 
Coding 
Example 


mem8, label 
12 
4-5 
LJNZB [GB+IX+].ITEM, BUMP_COUNT 


LJZ 
source, target 
Long Jump if Word Zero 


Operands 
Clocks 
Bytes 
Coding 
Example 


register, label 
5 
4 
LJZ IX, FIRST_ELEMENT 
mem16, label 
12/16 
4-5 
LJZ [GB].XMIT_COUNT, 
NO_DATA 
, 


LJZB 
source, target 
Long Jump if Byte Zero 


Operands 
Clocks 
Bytes 
Coding 
Example 


mem8, label 
12 
4-5 
LJZB [GA], RETURN_LINE 


LPD 
destination, 
source 
Load Pointer With Doubleword Variable 


Operands 
Clocks 
Bytes 
Coding 
Example 


ptr-reg, mem32 
20/28' 
2-3 
LPD GA, [PP].BUF _START 


LPDI 
destination, 
source 
Load Pointer With Doubleword Immediate 


Operands 
Clocks 
Bytes 
Coding Example 


ptr-reg, immed32 
12/16" 
6 
LPDI GB, DISK_ADDRESS 


MOV 
destination, 
source 
Move Word 


Operands 
Clocks 
Bytes 
Coding Example 


register, mem16 
8/12 
2-3 
MOV IX, [GCl 
mem16, register 
10/16 
2-3 
MOV [GA].COUNT, BC 
mem16, mem16 
18/28 
4-6 
MOV [GA].READING, [GB] 


MOVB 
destination, 
source 
Move Byte 


Operands 
Clocks 
Bytes 
Coding Example 


register, mem8 
8 
2-3 
MOVB BC, [PP].TRAN_COUNT 
mem8, register 
10 
2-3 
MOVB [PP].RETURN_CODE, 
GC 
mem8, mem8 
18 
4-6 
MOVB [GB+IX+], [GA+IX+] 


MOVBI 
destination, 
source 
Move Byte Immediate 


Operands 
Clocks 
Bytes 
Coding Example 


register, immed8 
3 
3 
MOVBI MC, 'A' 
mem8, immed8 
12 
3-4 
MOVBI [PP].RESUL T, 0 


MOVI 
destination, 
source 
Move Word Immediate 


Operands 
Clocks 
Bytes 
Coding Example 


register, immed16 
3 
4 
MOVI BC,O 
mem16, immed16 
12/18 
4-5 
MOVI [GB], OFFFFH 


MOVP 
destination, 
source 
Move Pointer 


Operands 
Clocks 
Bytes 
Coding Example 


ptr-reg, mem24 
19/27" 
2-3 
MOVP TP, [GC+IX] 
mem24, ptr-reg 
16/22" 
2-3 
MOVP [GB].SAVE_ADDR, 
GC 


NOP 


I 


(no operands) 
No Operation 


Operands 
Clocks 
Bytes 
Coding Example 


(no operands) 
4 
2 
NOP 


NOT 
destination /destination, 
sou rce 
Logical NOT Word 


Operands 
Clocks 
Bytes 
Coding Example 


register 
3 
2 
NOT MC 
mem16 
16/26 
2-3 
NOT [GA].PARM 
register, mem16 
11/15 
2-3 
NOT BC, [GA+IX].L1NES_LEFT 


NOTB 
destination /destination, 
source 
Logical NOT Byte 


Operands 
Clocks 
Bytes 
Coding Example 


mern8 
16 
2-3 
NOTB [GA].PARM_REG 
register, mem8 
11 
2-3 
NOTB IX, [GB].STATUS 


OR 
destination, 
source 
Logical OR Word 


Operands 
Clocks 
Bytes 
Coding Example 


register, mem16 
11/15 
2-3 
OR MC, [GC].MASK 
mem16, register 
( , 
16/26 
2-3 
OR [GCl. BC 


ORB 
destination, 
source 
Logical OR Byte 


Operands 
-- 
Clocks 
Bytes 
Coding Example 


register, mem8 
11 
2-3 
ORB IX, [PP].POINTER 
mem8, register 
16 
2-3 
ORB [GA+IX+l. GB 


( 


ORBI 
destination, 
source 
Logical OR Byte Immediate 


Operands 
Clocks 
Bytes 
Coding Example 


register, immed8 
3 
3 
ORBI IX, 00010001B 
mem8, immed8 
16 
3-4 
ORBI [GB].COMMAND,OCH 


ORI 
destination, 
source 
Logical OR Word Immediate 
-c 


Operands 
- 
Clocks 
Bytes 
Coding Example 


register, immed16 
3 
4 
ORI MC, OFFODH 
mem16,immed16 
16/26 
4-5 
ORI [GAl. 1000H 
-- 
- 


SETB 
destination, 
bit-select 
Set Bit to 1 


Operands 
Clocks 
Bytes 
Coding Example 


mem8,0-7 
16 
2-3 
SETB [GA].PARM_REG,2 


SINTR 
(no operands) 
Set Interrupt Service Bit 


Operands 
Clocks 
Bytes 
Coding Example 


(no operands) 
4 
2 
SINTR 


- 


TSL 


- 


destination, 
set-value, target 
Test and Set While Locked 


Operands 
Clocks 
Bytes 
Coding Example 


mem8, immed8, short-label 
14/16- 
4-5 
TSL [GAl.FLAG, OFFH, NOT_READY 


WID 
source-width, 
dest-width 
Set Logical Bus Widths 


Operands 
Clocks 
Bytes 
Coding Example 


8/16,8/16 
4 
2 
WID 8,8 


XFER 


, 


Enter DMA Transfer Mode After Next Instruction 
(no operands) 


Operands 
Clocks 
Bytes 
Coding Example 


(no operands) 
4 
2 
XFER 


Table 3-17. Instruction 
Fetch Timings 
(Clock Periods) 


BUSWIDTH 
INSTRUCTION 
LENGTH 
16 


(BYTES) 
8 
(1) 
(2) 


2 
14 
7 
11 
3 
18 
14 
11 
4 
22 
14 
15 
5 
26 
18 
15 


(1) 
First 
byte 
of 
instruction 
is 
on 
an 
even 
address. 


(2) 
First byte of instruction 
is on an odd address. 
Add 3 clocks if first byte is not in queue (e.g., 
first instruction 
following 
program transfer). 


8089 instruction operands may reside in registers, 
in the instruction 
itself or in the system or 1/0 


address spaces. Operands 
in the system and 1/0 


spaces may be either memory locations 
or 1/0 


device registers and may be addressed in four dif- 
ferent ways. This section describes how the chan- 


nel processes different types of operands and how 
it calculates addresses using its addressing modes. 
Section 3.9 describes 
the ASM-89 conventions 


that programmers 
use to specify these operands 


and addressing modes. 


Registers may be specified as source or destina- 
tion operands in many instructions. 
Instructions 


that 
operate 
on 
registers 
are 
generally 
both 
shorter and faster than instructions 
that specify 


immediate or memory operands. 


Immediate 
operands 
are 
data 
contained 
in 


instructions rather than in registers or in memory. 
The data may be either 8 or 16 bits in length. The 
limitations of immediate operands 
are that they 


may only serve as source operands and that they 
are constant values. 


Whereas the channel has direct access to register 
and immediate operands, 
operands in the system 


and 1/0 space must be transferred to or from the 
lOP 
over the bus. To do this, the lOP 
must 


calculate the address of the operand, 
called its 


effective 
address 
(EA). 
The programmer 
may 
specify that an operand's 
address be calculated in 
any of four different ways; these are the 8089's 
memory addressing modes. 


An operand in the system space has a 20-bit effec- 
tive address, and an operand in the I/O space has 
a 16-bit effective address. 
These addresses 
are 
unsigned numbers that represent the distance (in 
bytes) of the low-order byte of the operand from 
the beginning of the address space. Since the 8089 
does not "see" 
the segmented structure 
of the 
system space that it may share with an 8086 or 
8088, 8089 effective addresses are equivalent to 
8086/8088 
physical addresses. 


All memory addressing modes use the content of 
one of the pointer registers, and the state of that 
register's tag bit determines whether the operand 
lies in the system or the I/O space. If the operand 
is in the I/O space (tag = 1), bits 16-19 of the 
pointer 
register 
are 
ignored 
in 
the 
effective 
address calculation. 
Section 4.3 describes the two 
fields (AA and MM) in the encoded 
machine 
instruction that specify addressing mode and base 
(pointer) register. 


In based addressing 
(figure 3-39), the effective 
address is taken directly from the content of GA, 
GB, GC or PP. Using this addressing mode, one 
instruction 
may access different 
locations if the 
register is updated before the instruction executes. 
LPD,. MOV, MOVP or arithmetic 
instructions 
might be used to change the value of the base 
register. 


In this mode (figure 3-40) an 8-bit unsigned value 
contained in the instruction 
is added to the con- 
tent of a base register 
to form 
the effective 
address. The offset mode provides a convenient 
way 
to 
address 
elements 
in 
structures 
(a 
parameter 
block is a typical example of a struc- 
ture). As shown in figure 3-41, a base register can 
be pointed at the base (first element) in the struc- 
ture, and then different 
offsets can be used to 
access the 
elements 
within 
the 
structure. 
By 
changing the base address, the same structure can 
be relocated elsewhere in memory. 


An indexed address is formed by adding the con- 
tent of register IX (interpreted 
as an unsigned 
quantity) 
to a base register as shown in figure 
3-42. Indexed addressing is often used to access 


HIGH ADDRESSES 


ERROR 
I 
L1NECT 


BUFF_PTR 


POSITIONI 
CURSOR 


END_BUS 


LOW ADDRESSES 


r- 
I 
r---+O 
I 
I 
I 
I 
I 
I 
I 
I 


I 
EA 
I 
I 
I 
L 
~ 


array elements (see figure 3-43). A base register 
locates the beginning of the array and the value in 
IX selects one element, i.e., it acts as the array 
subscript. 
The ith element 
of a byte array 
is 
selected when IX contains (i - 1). To access the 
ith element of a word array, IX should contain 
«j-l)*2). 


In this variation of indexed addressing, the effec- 
tive address is formed by summing IX and a base 
register, 
and then IX is incremented 
automat- 
ically. (See figure 3-44.) The addition takes place 


after the EA is calculated. IX is incremented by 1 
for a byte operation, 
by 2 for a word operation 
and by 3 for a MOVP instruction. This addressing 


mode is very useful for "stepping 
through" 
suc- 
cessive elements of an array (e.g., a program loop 
that sums an array). 


GA 


OR 
GB 
+-1 
OR 
IX 
GC 


OR 
pp 


r 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
L..- 
_ 


1WORD 


LOW ADDRESSES 


GA 


OR 
GB r 


OR 
IX 
GC 


OR 
• 
pp 
I 
I 
I 


I 


I 
EA 
I 
I 


I 


I 


~ 


IX 
DELTA 


The compatibility 
of the 8089 with the 8086 and 
8088 extends 
beyond 
the 
hardware 
interface. 


Comparing figure 3-45, with figure 2-45, one can 
see that, except for the translate step, the software 
development 
process 
is 
identical 
for 
both 
8086/8088 
and 
8089 programs. 
The 
ASM-89 
assembler produces a relocatable 
object module 
that is compatible with the 8086 family software 
development utilities LIB-86, LINK-86, LOC-86 
and OH-86, described in section 2.9. All of these 
development 
tools run on an Intellec® 800 or 
Series II microcomputer 
development system. 


This section surveys the facilities of the ASM-89 
assembler 
and 
discusses 
how 
LINK-86 
and 
LOC-86 can be used in 8089 software develop- 
ment. 
For a complete 
description 
of the 8089 
assembly 
language, 
consult 
8089 
Assembly 


Language User's Guide, 
Order 
No. 
9800938, 


available from Intel's Literature Department. 


The ASM-89 assembler reads a disk file contain- 
ing 8089 assembly language statements, translates 
these statements into 8089 machine instructions, 
and writes the result into a second disk file. The 
assembly input is called a source module, and the 
principal output 
is a relocatable 
object module. 


The assembler also produces a file that lists the 
module and flags any errors detected during the 
assembly. 


Statements 
are the building blocks of ASM-89 


programs. Figure 3-46 shows several examples of 
ASM-89 statements. The ASM-89 assembler gives 
programmers 
considerable 
flexibility in format- 


ting program 
statements. 
Variable 
names 
and 


labels (identifiers) 
may be up to 31 characters 


long, the underscore (_) 
character may be used 


to improve the readability of longer names (e.g., 


WAIT_UNTIL_READY). 
The 
component 
parts of statements (fields) need not be located at 
particular 
"columns" 
of 
the 
statement. 
Any 
number of blank characters 
may separate fields 


and multiple identifiers within the operand field. 
Long statements may be continued onto the next 
link by coding an ampersand 
(&) as the first 


character of the continued line. 


; THIS STATEMENT 
CONTAINS 
A COMMENT 
FIELD ONLY 
ADDI 
BC,5 
; TYPICAL 
ASM89 INSTRUCTION 
ADD I 
BC, 
5 
; NO "COLUMN" 
REQUIREMENTS 
MOV 
[GA].STATUS, 


& 
6 
; A CONTINUED 
STATEMENT 
SOURCE 
EQU GA 
; A SIMPLE 
ASM89 DIRECTIVE 
L1NE_BUFFER_ADDRESS 
DO 
; A LONG IDENTIFIER 


semicolon is a comment 
statement. 
Comments 
have no affect on program execution and, in fact, 
are ignored 
by the ASM-89 assembler. 
Never- 


theless, carefully selected comments are included 
in all well written ASM-89 programs. They sum- 
marize, annotate and clarify the logic of the pro- 
gram 
where 
the 
instructions 
are 
too 


"microscopic" 
to make the operation of the pro- 


gram self-evident. 


An ASM-89 instruction 
statement 
(figure 3-47) 


directs the assembler to build an 8089 machine 
instruction. 
The optional 
label field assigns a 


symbolic 
identifier 
to 
the 
address 
where 
the 


instruction will be stored in memory. A labelled 
instruction 
can 
be 
the 
target 
of 
a 
program 


transfer; the transferring 
instruction specifies the 


label for its target operand. 
In figure 3-47 the 


labelled 
instruction 
conditionally 
transfers 
to 


itself; the program will loop on this one instruc- 


lGAJ.STATUS is not true. The mnemOnICheld ot 
an instruction statement specifies the type of 8089 
machine instruction that the assembler is to build. 


The operand 
field may contain no operands 
or 


one or more operands as required by the instruc- 
tion. Multiple operands are separated by commas 
and, optionally, by blanks. Any instmction state- 
ment may contain 
a comment 
field (comment 


fields are initiated by a semicolon). 


An ASM-89 directive statement (figure 3-48) does 
not produce an 8089 machine instruction. 
Rather, 


a directive gives the assembler information 
to use 


during the assembly. For example, the DS (define 
storage) directive in figure 3-48 tells the assembler 
to reserve 80 bytes of storage and to assign a sym- 
bolic identifier (INPUT_BUFFER) 
to the first 


(lowest-addressed) byte of this area. The ASM-89 
assembler accepts 14 directives; the more com- 
monly used directives are discussed in this section. 


IIGAI,STArS.,.DEMO! 
~TILREADY 
LCOMMENT 
(OPTIONALl 


OPERANDS 
(REQUIRED/PROHIBITED) 


MNEMONIC 
(REQUIRED) 


LABEL (OPTIONAL) 


COMMENT 
(OPTIONAL) 


OPERANDS 
(REQUIRED/PROHIBITED) 


MNEMONIC 
(REQUIRED) 


LABELINAME 
(REQUIRED/PROHIBITED) 


The first field in a directive may be a label or a 
name; individual directives may require or pro- 
hibit names, while labels are optional 
for direc- 


tives that accept them. A label ends in a colon like 
an instruction statement label. However, a direc- 
tive label cannot be specified as the target of a 
program transfer. A name does not have a colon. 
The second field is the directive mnemonic, and 
the assembler distinguishes 
between instructions 
and 
directives 
by 
this 
field. 
Any 
operands 


required by the directive are written next; multiple 
operands are separated by commas and, option- 
ally, by blanks. A comment may be included in 
any 
directive 
by 
beginning 
the 
text 
with 
a 


semicolon. 


Binary, decimal, octal and hexadecimal numeric 
constants (figure 3-49) may be written in ASM-89 
instructions 
and directives. 
The assembler 
can 


add and subtract 
constants 
at assembly 
time. 


Numeric 
constants, 
including 
the 
results 
of 


arithmetic operations, must be representable in 16 
bits. 
Positive 
numbers 
cannot 
exceed 
65,535 


(decimal); negative numbers, which the assembler 
represents in two's complement notation, 
cannot 


be "more negative" than -32,768 (decimal). 


Character 
constants are enclosed in single quote 


marks 
as 
shown 
in 
figure 
3-49. 
Strings 
of 


characters 
up to 255 bytes long may be. written 


when initializing storage. 
Instruction 
operands, 
however, can only be one or two characters long 
(for byte and word instructions respectively). 


As an aid to program clarity, The EQU (equate) 
directive may be used to give names to constants 
(e.g., DISK_STATUS 
EQU OFF20H). 


Four ASM-89 directives reserve space for memory 
variables 
in the ASM-89 
program 
(see figure 


3-50). The DB, DW and DD directives allocate 
units of bytes, words and doublewords, 
respec- 
tively, initialize the locations, and optionally label 
them so that they may be referred to by name in 
instruction 
statements. 
The label of a storage 


directive 
'always 
refers 
to 
the 
first 
(lowest- 


addressed)' 
byte 
of 
the 
area 
reserved 
by the 


directive. 


The DB and DW directives may be used to define 
byte- and word-constant 
scalars (individual data 


items) and arrays (sequences of the same type of 
item). For example, a character 
string constant 


could be defined as a byte array: 


The DD directive is typically used to define the 
address of a location in the system space, Le., a 
doubleword pointer variable. The address may be 
loaded 
into 
a pointer 
register 
with 
the 
LPD 


instruction. 


The DS directive reserves, and optionally names, 
storage in units of bytes, but does not initialize 
any of the reserved bytes. DS is typically used for 
RAM-based variables such as buffers. As there is 
no special directive for defining a physical address 
pointer, DS is typically used to reserve the three 
bytes used by the MOVP instruction. 


MOVBI 
GA 
'A' 
; CHARACTER 
MOVBI 
GA; 41H 
; HEXADECIMAL 
MOVBI 
GA,65 
; DECIMAL 
MOVBI 
GA,65D 
; DECIMAL 
ALTERNATIVE 
MOVBI 
GA,101Q 
; OCTAL 
MOVBI 
GA,1010 
; OCTAL 
ALTERNATIVE 
MOVBI 
GA, 01000001 B 
; BINARY 


; NEXT TWO STATEMENTS 
ARE EQUIVALENT 
AND 


. 
ILLUSTRATE 
TWO'S 
COMPLEMENT 
REPRESENTATION 


, 
OF NEGATIVE 
NUMBERS 
MOVBI 
GA,-5 
MOVBI 
GA,11111011B 


; ASM89 DIRECTIVE 
ALPHA: 
DB 
1 
DB 
-2 
DB 
'A' 
'B' 


BETA: 
OW 
l' 


OW 
-5 
OW 
'AB' 
OW 
400,500 
OW 
400H, 500H 
gamma: 
OW 
BETA 


DELTA 


ZETA: 


GAMMA 


80 


; MEMORY 
CONTENT 
(HEX) 
. 01 
: FE (TWO'S 
COMPLEMENT) 
; 4142 
. 


; 0100 
;FAFF 
; 4241 
; 2410F401 
; 00040005 
; OFFSET OF BETA ABOVE, 
; FROM BEGINNING 
OF PROGRAM 
; ADDRESS 
(SEGMENT 
& OFFSET) 
;OFGAMMA 
; 80 BYTES, UNINITIALIZED 


An ASM-89 structure is a map or template that 
gives names and relative locations to a collection 
of 
related 
variables 
that 
are 
called 
structure 


elements 
or 
members. 
Defining 
a 
structure, 


however, does not allocate storage. The structure 
is, in effect, 
overlaid 
on a particular 
area of 


memory when one of its elements is used as an 
instruction 
operand. 
Figure 3-51 shows how a 


structure representing a parameter block could be 
defined and then used in a channel program. The 


assembler uses the structure element name to pro- 
duce an offset value (structures are used with the 
offset addressing 
mode). 
Compared 
to "hard· 


coded" offsets, structures improve program clar- 
ity and simplify maintenance. 
If the layout of a 


memory block changes, only the structure defini- 
tion must be modified. 
When the program 
is 


reassembled, all symbolic references to the struc- 
ture are automatically 
adjusted. 
When multiple 


areas of memory are laid out identically, a single 
structure 
can be used to address 
any area by 


changing the content of the pointer (base) register 
that specifies the structure's 
"starting 
address." 


, 


BUFFER_LEN 


BUFFER_START 


COMMAND I 
RESULT 


TP_RESERVED 


, 
LOWER 
ADDRESSES 


PARM_BLOCK 
TP _RESERVED: 
COMMAND: 
RESULT: 
BUFFER_START: 
BUFFER_LEN: 


PARM_BLOCK 


STRUC 
OS 
4 
OS 
1 
OS 
1 
OS 
4 
OS 
2 
ENDS 


Table 3-18 summarizes the notation 
a program- 
mer uses to specify how the effective address of a 
memory operand is to be computed. Examples of 
typical ASM-89 coding for each addressing mode, 
as well as register and immediate operands, 
are 
provided in figure 3-52. Notice that a bracketed 
reference to a register indicates that the content of 
the register is to be used to form the effective 
address 
of 
a 
memory 
operand, 
while 
an 


unbracketed 
register reference specifies that the 


register itself is the operand. 


The 
following 
examples 
summarize 
how 
the 


memory addressing modes can be used to access 
simple variables, structures and arrays. 


• 
If GA contains 
the address 
of a memory 


operand, then [GA] refers to that operand. 


• 
If 
GA 
contains 
the 
base 
address 
of 
a 


structure, 
then 
[GA].DATA 
refers 
to the 
DAT A element (field) in that structure. 
If 
DATA is six bytes from the beginning of the 
structure, 
then 
[GA].6 refers to the same 
location. 


• 
If GA contains 
the starting 
address of an 


array, then [GA+ IX] addresses the array ele- 
ment indexed by IX. For example, if IX con- 
tains 
the value 4H, 
the effective 
address 
refers to the fifth element of a byte array, or 
the third element of a word array. [GA+IX+] 
selects the same element 
and additionally 
auto-increments 
IX by 1 (byte operation), 
2 


(word operation) or 3 (MOVP instruction) in 
anticipation 
of 
accessing 
the 
next 
array 


element. 


Note that any pointer 
register could have been 


substituted for GA in the previous examples. 


Table 3-18. ASM-89 Memory Addressing 
Mode Notation 


Notation 
Addressing 
Mode 


[ptr-reg] 
Based 
[ptr-reg ].offset 
Offset 
[ptr-reg + IX] 
Indexed 
[ptr-reg + IX +] 
Indexed Post Auto-increment 


ptr-reg 
offset 
= GA, GB, GC or PP 
= 8-bit signed 
value; 
may be struc- 


ture element 


As discussed 
in section 
3.7, program 
transfer 


instructions 
operate by adding a signed byte or 
word displacement to the task pointer. Table 3-19 
shows how the ASM-89 assembler determines the 
sign and size of the displacement value it places in 
a program 
transfer 
machine instruction. 
In the 


table, 
the terms 
"backward" 
and 
"forward" 


refer to the location 
of a label specified as a 


transfer target relative to the transfer instruction. 
"Backward" 
means the label physically precedes 


the instruction 
in the source module, 
and "for- 


ward" means the label follows the instruction in 
the source text. The distances are from the end of 
the 
transfer 
instruction; 
the 
distance 
to 
the 


instruction immediately following the transfer is 
o bytes. 


ADD I 
ADD 
ADDBI 
ADDB 
ADDB 
ADD 
ADDI 
ADDB 


GA,5 
; REGISTER, 
IMMEDIATE 
GC, [GB] 
; REGISTER, 
MEMORY 
(BASED) 
[PP],10 
; MEMORY 
(BASED), 
IMMEDIATE 
IX, [GB].5 
; REGISTER, 
MEMORY 
(OFFSET) 
BC, [GC].COUNT 
; REGISTER, 
MEMORY 
(OFFSET) 
[GC+ 
IX], BC 
; MEMORY 
(INDEXED), 
REGISTER 
[GA + IX +],5 
; MEMORY 
(INDEXED 
AUTO-INCREMENT), 
IMMED 
[PP].ERROR, [GAl; 
MEMORY 
(OFFSET), 
MEMORY 
(BASED) 


8089 INPUT /OUTPUT 
PROCESSOR 
transfer instruction; 
this should not prove restnc- 
same 
locatIOn, 
posslOly 
tive except in very large programs. 
Second, one 
address. 


byte can be saved in the assembled instruction by 
writing the short mnemonic 
when the target is 
known to be within -128 through +127 assembled 
bytes of the transfer. 


It is also important 
to note 
that 
a program 
transfer target must reside in the same module as 
the 
transferring 
instruction, 
i.e., 
the 
target 
address must be known at assembly time. 


An ASM-89 program may invoke an out-of-line 
procedure 
(subroutine) 
with the CALLILCALL 
instruction. 
The 
first 
instruction 
ope~and 
specifies a memory location where the content of 
TP will be stored as a physical address pointer 
before control 
is transferred 
to the procedure. 
The procedure 
may 
return 
to the 
instruction 
following 
the 
CALLILCALL 
by 
using 
the 
MOVP instruction 
to restore TP from the save 
area. Figure 3-53 illustrates one approach to pro- 
cedure linkage. 


A channel program may use the first two words of 
its parameter 
block (pointed to by PP) as a task 
pointer save area. However, 
this is not recom- 
mended if there is any chance that the CPU will 


As in any program 
transfer, 
the target 
of a 


CALL/LCALL 
instruction must be contained in 


the same m0dule and within 32k bytes of the 
instruction. 


The relocatable 
object module produced 
by the 


ASM-89 assembler consists of a single logical seg- 
ment. (A segment is a storage unit up to 64k bytes 
long; for a more complete description, 
refer to 


sections 2.3 and 2.7.) The ASM-89 SEGMENT 
and ENDS directives name the segment as shown 
in figure 3-54. Typically, all instructions and most 
directives are coded in between these directives. 
The 
END 
directive, 
which 
terminates 
the 


assembly, is an exception. 


The LOC-86 utility can assign this logical segment 
to any memory address that is a physical segment 
boundary 
(i.e., 
whose low-order 
four 
bits are 
0000). In a ROM-based 
system, 
variable 
data 


(which must be in RAM) 
can be "clustered" 


together at one "end" 
of the program as shown in 


figure 3-55. The ORG directive can then be used 
to force assembly of the variables to start at a 
given offset from the beginning of the segment 
(2,000 hexadecimal bytes in figure 3-55). As the 


Target Location 
J 


Mnemonic 
·Direction 
.Distance 
Displacement 
Form 
Sign Bytes 


Backward 
<;;;128 
- 
1 
Forward 
<;;;127 
+ 
1 
Short 
Backward 
<;;;32,768 
- 
2 
, 


(e.g., JMP) 
Forward 
<;;;32,767 
Error 
Backward 
>32,768 
Error 


.,·T 
Forward 
>32,767 
Error 
. 


Backward 
<;;;128 
- 
2 
Forward 
<;;;127 
+ 
2 
Long 
Backward 
<;;;32,768 
- 
2 
(e.g., LJMP) 
Forward 
<;;;32,767 
+ 
2 
Backward 
>32,768 
Error 
Forward 
>32,767 
Error 
. 


; SET UP TP SAVE AREA 
NOTE: EXAMPLE 
ASSUMES 
PROGRAM 
IS IN I/O SPACE. 
USE LPDI 
IF IN SYSTEM 
SPACE. 
, 
MOVI 
GC, CALLSAVE 
; LOAD ADDRESS 
TO GC 
; CALL IT. 
LCALL 
[GC],DEMO 


HLT 
; LOGICALENDOFPROGRAM 


; DEFINE THE PROCEDURE. 
DEMO: 
; PROCEDURE 
INSTRUCTIONS 
GO HERE. 
; NOTE: PROCEDURE 
MUST NOT UPDATEGC 
; 
AS IT POINTS TO THE RETURN 
ADDRESS. 


; RETURN 
TO CALLER. 
MOVP 
TP, [GCl 


ENDS 
END 
; END OF SEGMENT 
; END OF ASSEMBLY 


figure shows, the segment can then be located so 
that instructions and constants fall into the ROM 
portion of memory, while the variable part of the 
segment is located in RAM. The entire segment, 
including any "unused" 
portions, of course, can- 


not exceed 64k bytes. 


An 
ASM-89 
module 
can 
make 
some 
of 
its 


addresses available to other modules by defining 
symbols 
with 
the 
PUBLIC 
directive. 
At 
a 


minimum, 
a channel 
program 
must 
make the 


address of its first instruction 
available 
to the 


CPU module that starts the channel 
program. 


Figure 3-56 shows an ASM-89 module that con- 
tains three channel 
programs 
labelled 
READ, 


WRITE and DELETE. The example shows how a 
PLlM-86 
program 
and 
an 
ASM-86 
program 


could define these "entry 
points" 
as EXTER- 


NAL and EXTRN 
symbols respectively. 
When 


the modules are linked together, 
LINK-86 will 


match the externals with the publics, thus pro- 
viding the CPU programs with the addresses they 
need. 


(AVAILABL E) 
------ 
t 


RAM 
---- 
ROM 


HLT 
; ASM89 INSTRUCTIONS FOR "WRITE" OPERATION 


HLT 
; ASM89 INSTRUCTIONS FOR "DELETE" 
OPERATION 


PLlM-86 
MODULE 
USES "WRITE" 
SYMBOL 


DECLARE 
(READ,WRITE,DELETE) 
POINTER 
EXTERNAL; 


DECLARE 
PARM$BLOCK 
STRUCTURE 
(TP$ST ART 
POINTER, 
BUFFER$ADDR 
POINTER, 


BUFFER$LEN 
WORD); 


i*SET 
UP "WRITE" 
CHANNEL 
OPERATION* 
I 
PARM$BLOCK. 
TP$START 
= WRITE; 


READ_PTR 
DD 
READ 
WRITE_PTR 
DD 
WRITE 
DELETE_PTR 
DD 
DELETE 


; PARM_BLOCK 
EVEN 
TP_ST 
ART 
DD ? 
BU FFER_ADDRDD 
? 
BUFFER_LEN 
DW? 


; SET UP "READ" 
CHANNEL 
OPERATION 
MOV 
AX, WORD PTR READ_PTR 
MOV 
WORD PTR TP _ST 
ART, AX 
MOV 
AX, WORD PTR READ_PTR 
MOV 
WORD PTR TP _ST 
ART + 2, AX 


; 1ST WORD 


; 2ND WORD 


Conversely, 
an ASM-89 module can obtain the 


address of a public symbol in another module by 
defining it with the EXTRN directive. An external 
symbol, however, can only appear as the initial 
value operand of a DD directive (see figure 3-57). 
This effectively means that an ASM-89 program's 


use of external symbols is limited to obtaining the 
addresses of data located in the system space. 
Another 
way 
of 
doing 
this, 
which 
may 
be 


preferable in many cases, is to have the CPU pro- 
gram 
place 
system 
space 
addresses 
in 
the 


parameter block. 


Figure 
3-58 diagrams 
the 
logic 
of 
a 
simple 
ASM-89 program; 
the code is shown in figure 
3-59. The program reads one physical record (sec- 
tor) from a diskette drive controlled by an 8271 
Floppy 
Disk Controller. 
No particular 
system 
configuration 
is implied by the program, 
except 
that the 8271 resides in the lOP's 1/0 space. 


Hardware address decoding logic is assumed to be 
set up as follows: 


• 
reading 
location 
FFOOH selects 
the 
8271 


status register, 


• 
writing 
location 
FFOOH selects 
the 
8271 
command register, 


• 
reading 
location 
FFOIH 
selects 
the 
8271 
result register 


• 
writing 
location 
FFO1H 
selects 
the 
8271 
parameter register 


• 
decoding 
the address 
FF04H 
provides 
the 
8271 DACK (DMA acknowledge) signal. 


Register IX is used as a retry counter. 
If the 
transfer is not completed successfully (bit 3 of the 
8271 result register * 0), the program retries the 
transfer up to 10times. 


The 
program 
uses 
structures 
to 
address 
the 
parameter 
block and the 8271 registers. Register 
PP contains the address of the parameter block, 
and the program loads GC with FFOOH to point 
to the 8271 registers. The program's 
entry point 
(the label START) is defined as a PUBLIC sym- 
bol so that the CPU program can place its address 
in the parameter block when it starts the program. 


Since the 8271 automatically 
requests 
a DMA 
transfer upon receipt of the last parameter, 
this 


parameter 
is sent 
immediately 
following 
the 
XFER command. 


ISIS-II 
8089 ASSEMBLER 
V1.0 ASSEMBLY 
OF MODULE 
FLOPPY 


OBJECT 
MODULE 
PLACED 
IN 
:FO:FLOPPY.OBJ 


ASSEMBLER 
INVOKED 
BY ASM89 
FLOPPY.A89 


0000 
0004 
0008 
0009 
OOOA 
OOOB 


0000 
0001 
0002 


OOOC 
EABA 
00 FC 


0010 
OA4E 
00 
12 


0014 
0293 
08 02CE 
01 


FLOPPY 
SEGMENT 


j *-* 
,'•• 8089 
PROGRAM 
TO READ SECTOR 
FROM 
FLOPPY 
DISK 


j**. 


,'•• LAY OUT 
PARAMETER 
BLOCK. 
PARM 
BLOCK 
STRUC 


R"ESERVED TP: 
DS 
4 


BUFF 
PTR: 
DS 
4 


TRACK: 
DS 
1 


SECTOR: 
DS 
1 


RETURN 
CODE: 
DS 
1 


PARM BLOCK 
ENDS 


,···LAY 
OUT 
8271 


FLOPPY 
REGS 
COMMAND 
STAT: 
PARM 
RESULT: 


FLOPPY 
REGS 


DEVICE 
REGISTERS. 


STRUC 
DS 
DS 
ENDS 


,···8271 
ADDRESSES. 
FLOPPY 
REG ADDR 
EQU 
OFFOOH 


DACK 
8271 
EQU 
OFF04H 


,···MAKE 
PROGRAM 
ENTRY 
POINT 
ADDRESS 
AVAILABLE 
TO OTHER 
MODULES. 


START 


,···CLEAR 
RETURN 
CODE 
IN PARAMETER 
BLOCK. 


START: 
MOVBI 
[PPJ.RETURN 
CODE,O 


,.·.INITIALIZE 
RETRY 
COUNT. 
. 
MOVI 
IX,10 


j-*·SEND 
····WAIT 
RETRY: 
,···SEND 


COMMAND 
SEQUENCE 
TO 8271, 
HOLDING 
FINAL 
PARM. 


UNTIL 
8271 IS NOT BUSY. 
JNBT 
[GCj.COMMAND 
STAT,7,RETRY 
"READ SECTOR, 
DRIVE 
0" COMMAND. 


MOVBI 
[GCj.COMMAND 
STAT,012H 
TRACK 
ADDRESS 
PARAMETER. 
MOVB 
[GCj.PARM 
RESULT,[PPJ.TRACK 


····LOAD 
CHANNEL 
CONTROL 
REGISTER 
SPECIFYING: 


, 
FROM 
PORT TO MEMORY, 
SYNCHRONIZE 
ON SOURCE, 


GA 
POINTS 
TO SOURCE, 
TERMINATE 
ON EXT, 


TERMINATION 
OFFSET 
O. 
MOVI 
CC,08820H 


55 
001E 
AOOO 
56 
57 
~8 
0020 
238B 
04 
59 
0023 
1130 04FF 
60 
bl 
b2 
0027 
AABA 
00 FC 
63 
64 
b5 
002B 
6000 
b6 
67 


0293 r 09 
b8 
002D 
02CE 
01 
b9 
70 
71 
72 
73 
0033 
6ABE 
01 05 
74 
75 
76 
0037 
A03C 
77 
78 
79 
0039 
A840 DO 
80 
81 
~2 
003C 
EABA 
00 FC 
83 
84 
85 
0040 
OA4E 
00 2C 
86 
87 
88 
0044 
8ABA 
00 FC 
89 
90 
91 
0048 
0292 
01 02CF 
OA 
92 
93 
94 
004E 
4000 
95. 
96 
97 
0050 
2048 
0 
98 
99 
0052 
100 
101 


SYMBOL 
TABLE 
------------ 


DEFN 
VALUE 
TYPE 
NAME 


10 
0004 
SYM 
BUFF PTR 
18 
0000 
SYM 
COMMAND 
STAT 
24 
FF04 
SYM 
DACK_8271 
83 
003C 
SYM 
EXIT 
2 
0000 
SYM 
FLOPPY 
17 
0000 
STR 
FLOPPY 
REGS 
23 
FFOO 
SYM 
FLOPPCREG 
ADDR 
8 
0000 
STR 
PARM BrOCK 
19 
0001 
SYM 
PARM-RESULT 
9 
0000 
SYM 
RESERVED 
TP 
41 
OOOC 
SYM 
RETRY 
13 
OOOA 
SYM 
RETURN 
CODE 
12 
0009 
SYM 
SECTOR 
31 
0000 
PUB 
START 
11 
0008 
SYM 
TRACK 
63 
0027 
SYM 
WAIT 1 
89 
0044 
SYM 
WAIT2 


ASSEMBLY 
COMPLETE; 
NO ERRORS 
FOUND 


;***POINT 
GB AT DESTINATION, 
GA AT SOURCE. 


LPD 
uB,[PP1.BUFF 
PTR 
MOVI 
GA,DACK_8271- 


;***INSURE 
THAT 
8271 IS READY FOR 
LAST 
PARAMETER. 
WAIT1: 
JNBT 
[GC1.COMMAND_STAT,5,WAIT1 


;***P,REPARE FOR 
D~:ER 


;***START 
DMA BY SENDING 
FINAL 
PARAMETER 
TO 8271. 


MOVB 
[GC J.PARM_ RESULT, [PP]. SECTOR 


;***PROGRAM 
RESUMES 
HERE FOLLOWING 
EXT. 


;o**IF 
TRANSFER 
IS OK THEN 
E~IT, 
ELSE TRY AGAIN. 


JBT 
[GCJ.PARM_RESULT,3,EXIT 


;***DECREMENT 
RETRY 
COUNT. 
DEC 
IX 


;o**TRY AGAIN 
IF COUNT 
NOT EXHAUSTED. 
JNZ 
IX,RETRY 


;OO*POST 
RESULT 
IN PARAMETER 
BLOCK 
FOR CPU. 
MOVB 
[PP1.RETURN 
CODE,[GCJ.PARM_RESULT 


;***INTERRUPT 
CPU. 
SINTR 


,..",.." 
••••..••••. 
1".,...··--·-.::n."thTh~itt."lI:a'\.n~S-11VHl 
Ule 
/"\.01VI-0::11,,;11C11111t:l 
pru- 
gram when the two are linked 
together. 
(The 
ASM-89 program 
must, 
of course, 
define the 
symbol in a PUBLIC directive.) 
The LINK-86 utility program combines multiple 
relocatable 
object 
modules 
into 
a 
single 
relocatable module. The input modules may con- 
sist of modules produced by any of the 8086 fam- 
ily language 
translators: 
ASM-89, ASM-86, or 
PLlM-86. 
LINK-86's 
principal 
function 
is to 
satisfy external references made in the modules. 
Any symbol that 
is defined 
with the EXTRN 
directive in ASM-89 or ASM-86 or is declared 
EXTERNAL 
in 
PLlM-86 
is 
an 
external 
reference, 
i.e., 
a reference 
to an address 
con- 
tained 
in another 
module. 
Whenever 
LINK-86 
encounters an external reference, it searches the 
other modules for a PUBLIC symbol of the Same 
name. If it finds the matching symbol, it replaces 
the external 
reference 
with the address 
of the 
object. 


The most 
common 
occurrence 
of an external 
reference in a system that employs one or more 
8089s is the channel program 
address. 
In order 
for a CPU program to start a channel program, it 
must ensure that the address of the first channel 
program instruction 
is contained in the first two 
words of the parameter 
block. Since the channel 
program 
is assembled separately, 
the translator 
that processes the CPU program will not typically 
know its address. If this address is defined as an 


Other 
external 
references 
may arise when one 
module uses data (e.g., a buffer) that is contained 
in 
another 
module, 
and 
(in 
PLlM-86 
and 
ASM-86 modules) 
when one module 
executes 
another module, typically by a CALL statement 
or instruction. 


When an 8089 module 
(or modules) 
is to be 
located in the system space, 
it may be linked 
together with PLlM-86 
or ASM-86 modules as 
described 
above 
and 
shown 
in 
figure 
3-60. 
LINK-86 resolves external 
references and com- 
bines the input modules into a single relocatable 
object 
module. 
This 
module 
can be input 
to 
LOC-86 (LOC-86 assigns final absolute memory 
addresses to all of the instructions and data). This 
absolute 
object 
module 
may, 
in turn, 
be pro- 
cessed by the 
OH-86 
utility 
to 
translate 
the 
module into the hexadecimal format. This format 
makes the module readable (the records are writ- 
ten in ASCII characters) and is required by some 
PROM programmers 
and RAM loaders. 
Intel's 
Universal PROM Programmer 
(UPP) and iSBC 
957™ Execution Package (loader) use the hexa- 
decimal format. 


ferent 
technique 
is 
required 
since 
separate 
absolute object modules must be produced for the 
system and I/O spaces. Figure 3-61 shows how to 
link and locate wheq there are external references 
between 
I/O 
space modules 
and system space 
modules. 


The normal link and locate sequence is followed 
and culminates in the production 
of an absolute 
module in hexadecimal format. Since the recprds 
in this file are human-readable, 
the file can be 


edited using the ISIS-II text editor. The editing 
task involves finding the 8089 I/O space records 
in the file, writing them to one file, and then 
writing the 8086/8088 
records (destined for the 


system space) to another 
file: MCS-86 Absolute 
Object 
File 
Formats, 
Order 
No. 
9800921, 


available 
from 
Intel's 
Literature 
Department, 
describes the records in absolute (including hexa- 
decimal) object modules. 


When using the previous method, it is likely that 
LOC-86 
will 
issue 
messages 
warning 
that 


segments overlap. 
For example, 
the 8089 code 


would typically be located starting 
at absolute 


location 
OH of the 
I/O 
space. 
However, 
the 


8086/8088 
interrupt 
pointer table occupies these 
low memory addresses in the system space. Since 
LOC-86 has no way to know that the segment will 
ultimately be located in different address spaces, 
it will warn of the conflict; the warning may be 
ignored. 


An alternative 
to linking the modules together 


and then separating them is to link system space 
modules separately 
from I/O 
space modules as 


shown in figure 3-62. This approach 
avoids the 


manual edit of the absolute object module and the 


requires, however, that modules in the two spaces 
not use the EXTRN/PUBLIC 
mechanism to refer 


to each other. 
Modules in the same space can 


define external and public symbols, however. 


External references from I/O 
space modules to 
system space modules can be. eliminated 
if the 


CPU programs pass all system space addresses in 
parameter blocks. In other words, a channel pro- 
gram can obtain any address in the system space if 
the address is'in the parameter 
block~ Using this 


approach allows the system space addresses to be 
changed during execution. 
If the addresses are 


constant 
values, 
they 
may 
also be altered 
as 
system development 
proceeds without 
relinking 


the channel programs. 


External references from system space modules to 
addresses in the I/O space may be eliminated by 
assigning these addresses values that are known at 
assembly 
or 
compilation 
time. 
Figure 
3-63 


illustrates how the ASM-89 ORO directive can be 
used to force the first instruction (entry point) of 
a channel program to an absolute address. In the 
case of the example, 
one module contains 
two 


entry points labelled "READ" 
and "WRITE." 


Assuming 
the 
module 
is located 
at 
absolute 


address OH in the 1/0 space, the channel pro- 
grams will begin at 200H and 600H respectively. 
In the example, 
these values have been chosen 


arbitrarily; in a typical application they would be 
based on the length of the programs and the loca- 
tion of RAM and ROM areas. By starting the pro- 
grams at fixed addresses that are known to the 
CPU programs 
that activate them, the channel 


programs can be reassembled without needing to 
relink the CPU programs. 


Figure 3-61. Creating Separate Absolute 
Object Modules-External 
References in Relocatable 
Modules 


Figure 3-62. Creating Separate Absolute Object Modules-No 
External References in Relocatable 


Modules 


ORG 600H 
WRITE: 


READ_ADDR 
WRITE_ADDR 
DD 
200H 
DD 
600H 


DECLARE 
READ$ADDR 
POINTER; 


DECLARE 
WRITE$ADDR 
POINTER; 


READ$ADDR 
= 200H; 


WRITE$ADDR 
= 600H; 


Figure 3-63. Using Absolute Entry Point Addresses 


3.10 Programming Guidelines 
and Examples 


This section provides two types of 8089 program- 
ming information. 
A series of general guidelines, 
which apply to system and program 
design, is 
presented first. These guidelines are followed by 
specific coding examples that illustrate program- 
ming techniques that may be applied to many dif- 
ferent types of applications. 


The practices in this section are recommended to 
simplify system development 
and, 
particularly, 
for system maintenance 
and enhancement. 
Soft- 
ware that is designed in accordance 
with these 
guidelines 
will be adaptable 
to the changing 
environment 
in which 
most 
systems 
operate, 
and 
will 
be 
in 
the 
best 
position 
to 
take 
advantage 
of new intel hardware 
and software 
products. 


Although the lOP does not "see" 
the segmented 
organization of system memory, it should respect 
this 
logical 
structure. 
The 
lOP 
should 
only 


address the system space through pointers passed 
by the CPU in the parameter block. It should not 
perform arithmetic 
on these addresses or other- 


wise manipulate 
them except for the automatic 
incrementing that occurs during DMA transfers. 
It 
is the 
responsibility 
of 
the 
CPU 
to 
pass 
addresses such that transfer 
operations 
do not 


cross segment boundaries. 


Programs that alter their own instructions are dif- 
ficult to understand 
and modify, 
and preclude 
placing the code in ROM. They may also inhibit 
compatibility with future Intel hardware and soft- 
ware products. 


Note also that when the 8089 is on a 16-bit bus, its 
instruction 
fetch queue 
can interfere 
with the 
attempt 
of one instruction 
to modify the next 


sequential instruction. 
Although 
the instruction 
may be changed in memory, its unmodified 
first 
byte will be fetched from the queue rather than 


memory if it is on an odd address. The processor 
will thus execute a partially-modified 
instruction 


with unpredictable results. 


Section 2.10 notes that I/O 
systems should be 


designed 
hierarchically. 
Application 
programs 


"see" 
only the topmost level of the structure; all 
details pertaining 
to the physical characteristics 


and operation 
of I/O 
devices are relegated 
to 


lower levels. Figure 3-64 shows how this design 
approach might be employed in a system that uses 
an 8089 to perform I/O. The same concept can be 
expanded to larger systems with multiple lOPs. 


The application 
system is clearly separated from 


the I/O 
system. No application 
programs 
per- 


form I/O; instead they send an I/O request to the 
I/O 
supervisor. 
(In systems with 
file-oriented 


I/O, the request might be sent to a file system that 
would then invoke the I/O supervisor.) The 1/0 
request should be expressed in terms of a logical 
block of data-a 
record, a line, a message, etc. It 


should also be devoid of any device-dependent 
information 
such as device address, 
sector size, 


etc. 


The I/O 
supervisor 
transforms 
the application 


program's 
request for service into a parameter 


block and dispatches a channel program to carry 
out the operation. 
The I/O 
supervisor controls 


the 
channels; 
therefore, 
it 
knows 
the 
cor- 


respondence between channels and I/O devices, 
the locations of CBs and channel programs, 
and 


the format 
of all of the parameter 
blocks. The 


I/O 
supervisor 
also 
coordinates 
channel 


"events," 
monitoring 
BUSY flags and respond- 


ing to channel-generated 
interrupt 
requests. The 


I/O supervisor does not, however, communicate 
with I/O devices that are controlled by the chan- 
nels. If the CPU performs 
some I/O itself (this 


should be restricted to devices other than those 
run by the channels), the I/O supervisor invokes 
the equivalent of a channel program in the CPU 
to do the physical I/O. 
Note that although 
the 


I/O supervisor is drawn as a single box in figure 
3-64, it is likely to be structured 
as a hierarchy 


itself, with separate modules performing its many 
functions. 


The software interface 
between the CPU's 
I/O 


supervisor and an lOP channel program 
should 


be 
completely 
and 
explicitly 
defined 
in 
the 
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areas that 
the channel 
program 
will use. The 
I 
channel program should not be writt~n so that it 
"knows" 
any of these addresses, even if they are 
C 
. 
h' 
f 
. 
FIXED 
constants. 
oncentratmg 
t e mter ace mto one 
FORMAT 
place like this makes the system easier to under- 
1 
stand and reduces the likelihood of an undesirable 
side effect if it is modified. It also generalizes the 
design so that it may be used in other application 


::::s;_64 
shows a simple channel program run- 
1 
ning on channel I and a more complex program 
running on channel 2. Channel I's program per- 
forms a single function and is therefore designed 
VARIABLE 
as a simple program. The program on channel 2 
FORMAT 
performs three functions (e.g., "read," 
"write," 
1 
"delete") 
and is structured 
to separate its func- 
GOVERNED 
BY 
tions. The functions 
might be implemented 
as 
1 
FUNCTION 
CODE 
J 
procedures 
called by the "channel 
supervisor" 
depending on the content of the parameter block. 
Notice that to the I/O supervisor, both programs 
_ 
appear 
alike; in particular, 
both have a single 
entry point. 


In some channel 
programs, 
different 
functions 


will need different information 
passed to them in 
the 
parameter 
block. 
Figure 
3-65 shows 
one 


technique that accommodates 
different 
formats 


while still allowing 
the channel 
supervisor 
to 
determine which procedure to call from the PH. 
The parameter 
block is divided into fixed and 


variable portions, and a function code in the fixed 
area indicates the type of operation 
that is to be 
performed. 
Part of the fixed area has been set 
aside so that additional parameters can be added 
in the future. 


The first example in this section illustrates how a 
CPU can initialize a group 
of lOPs 
and then 
dispatch channel programs. 
This code is written 


in PLlM-86. 


The remaining 
examples, 
written 
in ASM-89, 
demonstrate the 8089 instruction set and address- 
ing modes in various commonly-encountered 
pro- 


gramming situations. These include: 


• 
memory-to-memory 
transfers 


• 
saving and restoring registers 


::I 
'1",I,;t1ANNI:L::i. 
A II: 
SAVE 
AREA 


FIXED 
PARM11 
FUNCTION 
CODE 


FIXED 
PARM2 


FIXED 
PARM3 


RESERVED 
FOR 
FUTURE 
USE 


VARIABLE 
PARAMETER 
, 
FORMAT 
AND SIZE 
,,", 


The PLlM-86 
code in figure 3-66 initializes two 
lOPs and dispatches 
two channel programs 
on 


one of the lOPs. The same general technique can 
be used to initialize any number of lOPs. 
The 


hypothetical system that this code runs on is con- 
figured as follows: 


• 
8086 CPU (16-bit system bus); 


• 
two remote lOPs share an 8-bit local I/O bus 
via 
the 
request/grant 
lines 
operating 
in 


mode 1; 


• 
8089 channel attentions are mapped into four 
port addresses in the CPU's I/O space; 


• 
channel 
programs 
reside in the 8089 I/O 


space; 


• 
one 
8089 controls 
a CRT 
terminal, 
one 
channel running the display, the other scan- 
ning 
the 
keyboard 
and 
building 
input 


messages; 


• 
the function of the second 8089 is not defined 
in the example. 


The code declares one CB (channel control block) 
for each 8089. The CBs are declared 
as two- 


element arrays, each element defining the struc- 
ture of one channel's portion of the CB. The SCB 
(system configuration 
block) and SCP (system 


configuration 
pointer) are also declared as struc- 
tures. The SCP is located at its dedicated system 
space address of FFFF6H. 
The other structures 
are not located at specific addresses since they are 
all 
linked 
together 
by 
a 
chain 
of 
pointers 
"anchored" 
at the SCP. 


Two simple parameter 
blocks define messages to 


be transmitted 
between the PL/M-86 
program 


and the CRT. Each PB contains a pointer to the 
beginning of the message area and the length of 
the message. In the case of the keyboard (input) 
message, the channel program builds the message 
in the buffer pointed to by the pointer in the PB 
and returns the length of the message in the PB. 


The code initializes one lOP at a time since the 
chain of control blocks read by the lOP during 
initialization must remain static until the process 
is complete. To initialize the first lOP, the code 
fills in the SYSBUS and sac fields and links the 
blocks 
to each 
other 
using 
the 
P-LlM-86 
@ 
(address) operator. 
It sets channell's 
BUSY flag 


to FFH so that it can monitor the flag to deter- 
mine when the initialization 
has been completed 


(the lOP 
clears 
the 
flag to OH when 
it has 
finished). 
Channel 
2's 
BUSY flag is cleared, 


although this could just as well have been done 
after the initialization 
(the lOP does not alter 


channel 2's BUSY flag during initialization). 
The 
code starts the lOP by issuing a channel attel\tion 
to channel 
1 to indicate that the lOP is a DUs 
master. PL/M-86's 
OUT function is used to select 


the port address to which the lOP's CA and SEL 
lines have been mapped. The data placed on the 
bus (OH) is ignored by the lOP. It then waits until 
the lOP clears the channell 
BUSY flag. 


The second lOP is initialized in the same manner, 
first changing the pointer in the SCB to point to 
the second lOP's 
channel control block. If this 


lOP were on a different I/O bus, the sac field 
would 
have 
been 
altered 
if 
a 
different 


request/grant 
mode were being used or if the lOP 


had a 16-bit I/O bus. The second lOP is a slave so 
its initialization is"Started by issuing a CA to chan- 
nel2 rather than channell. 


After both lOPs are ready, the code dispatches 
two channel programs (not coded in the example); 
one program is dispatched to each channel of one 
of the lOPs. 
To avoid external 
references, 
the 
system has been set up so that the PL/M-86 code 
"knows" 
the starting addresses of these channel 


programs 
(200H and 600H). The code uses the 


PLlM-86 LOCKSET function to: 


• 
lock the system bus; 


• 
read the BUSY flag; 


• 
set the BUSY flag to FFH if it is clear; 


• 
-unlock the system bus. 


This operation 
continues until the BUSY flag is 


found to be clear (indicating that the channel is 
available). Setting the flag immediately to FFH 
prevents another 
processor 
(or another 
task in 


this program activated as a result of an interrupt) 
from using the channel. 
The code fills in the 


parameter 
block with the address and length of 


the message to be displayed, sets the CCW and 
then links the channel program (task block) start 
address 
to the parameter 
block and links the 


parameter 
block to the CB. The channel is dis- 


patched 
with the OUT function 
that effects a 
channel attention for channell. 


A similar procedure is followed to start channel 2 
scanning the terminal keyboard. 
In this case, the 
code allows channel 2 to generate an interrupt 
request (which it might do to signal that a message 
h'!-s been 
assembled). 
An 
interrupt 
procedure 
would then handle the interrupt request. 


'"ASSIGN NAMES TO CONSTANTS"' 
DECLARE 
CHANNEL$BUSY 
L1TERALLY'OFFH'; 


DECLARE 
CHANNEL$CLEAR 
L1TERALLY'OH" 


DECLARE 
CR '"CARR. RET."' 
L1TERALLy'om-i'; 


DECLARE 
LF '"LINE FEED"' 
L1TERALLY'OAH'; 


DECLARE 
DISPLAY$TB 
LITERALLY '200H'; 
DECLARE 
KEYBD$TB 
LITERALLY '600H'; 


Figure 3-66. Initialization 
and Dispatch Example 


DECLARE 
'"lOP 
CHANNEL 
ATTENTION 
ADDRESSES"' 
IOP$A$CH1 
LITERALLY 
'OFFEOH', 
IOP$A$CH2 
LITERALLY 
'OFFE1 H', 


IOP$B$CH1 
LITERALLY 
'OFFE2H', 
IOP$B$CH2 
LITERALLY 
'OFFE3H'; 


DECLARE 
'"CHANNEL 
CONTROL 
BLOCK 
FOR 10P$A) 
CB$A(2) 
STRUCTURE 
(BUSY 
BYTE, 
CCW, 
BYTE, 
PB$PTR 
POINTER, 


RESERVED 
WORD); 


DECLARE 
'"CHANNEL 
CONTROL 
BLOCK 
FOR 10P$B"' 
CB$B(2) 
STRUCTURE- 


(BUSY 
BYTE, 


CCW 
BYTE, 
- 


PB$PTR 
POINTER, 


RESERVED 
WORD); 


DECLARE 
'"SYSTEM 
CONFIGURATION 
BLOCK"' 
SCB 
STRUCTURE 
(SOC 
BYT~, 
RESERVED 
BYTE: 


CB$PTR 
POINTER); 


DECLARE 
'"SYSTEM 
CONFIGURATION 
POINTER"' 
SCP 
STRUCTURE 
(SYSBUS 
BYTE, 


SCB$PTR 
POINTER) 
AT (OFFFFQH); 


DECLARE 
MESSAGE$PB 
STRUCTURE 
(TB$PTR 
POINTER, 


MSG$PTR 
POINTER, 


MSG$LENGTH 
WORD); 
. 
. 


DECLARE 
KEYBD$PBSTRUCTUE 
(TP$PTR 
POINTER, 


BUFF _PTR 
POINTER, 
MSG$SIZE 
WORD); 


DECLARE 
SIGN$ON 
BYTE (") DATA 
(CR, LF, 'PLEASE 
ENTER USER ID'); 


DECLARE 
KEYBD$BUFF 
BYTE (256); 


," 
"INITIALIZE 
10P$A, THEN 
10P$B 


"' 
'"PREPARE 
CONTROL 
BLOCKS 
FOR 10P$A"' 
SCP.SCB$PTR 
= @ SCB; 


SCP.SYSBUS 
= 01H; '"16-BIT 
SYSTEM 
BUS"' 
SCB.SOC 
= 02H; '"RQ'GT 
MODE1, 8-BIT I/O BUS"' 
SCB.CB$PTR 
= @ CB$A(O); 
CB$A(O).BUSY 
= CHANNEL$BUSY 
CB$A(1).BUSY 
= CHANNEL$CLEAR; 


'"ISSUE 
CA FOR CHANNEL1, 
INDICATING 
lOP IS MASTER"' 
OUT (I0P$A$CH1) 
= OH; 


- I 


'"WAIT 
UNTIL 
FINISHED"' 
DO WHILE 
CB$A(O).BUSY 
= CHANNEL$BUSY; 
END; 


'"PREPARE 
CONTROL 
BLOCKS 
FOR 10P$B"' 
SCB.CB$PTR 
= @CB$B(O); 
CB$B(O).BUSY 
= CHANNEL$BUSY; 
CB$B(1).BUSY 
= CHANNEL$CLEAR; 


'"ISSUE 
CA FOR CHANNEL2, 
INDICATING 
SLAVE STATUS"' 
OUT'(IOP$B$CH2) 
= OH; 


'"WAIT 
UNTIL 
lOP IS READY"' 
DO WHILE CB$B(O).BUSY 
= CHANNEL$BUSY; 
END; 


," 
"SEND 
SIGN ON MESSAGE 
TO CRT CONTROLLED 
"BY CHANNEL 
1 OF 10P$A 
"' 
'"WAIT 
UNTIL CHANNEL 
IS CLEAR, 
THEN SET TO BUSY"' 
DO WHILE 
LOCKSET 
(@CB$A(O).BUSY, 
CHANNEL$BUSY); 
END; 


'"SET 
CCW AS FOLLOWS: 
" 
PRIORITY 
= 1, 
NO BUS LOAD LIMIT, 
DISABLE 
INTERRUPTS, 
START CHANNEL 
PROGRAM 
IN I/O SPACE"' 
CB$A(O).CCW 
= 10011001B; 


,"LINK 
MESSAGE 
PARAMETER 
BLOCK TO CB"' 
CB$A(O).PB$PTR 
= @ MESSAGE$PB; 


'"FILL 
IN PARAMETER 
BLOCK"' 
MESSAGE$PB.TB$PTR 
= DISPLAY$TB; 
MESSAGE$PB.MSG$PTR 
= @SIGN$ON; 


MESSAGE$PB. 
MSB$LENGTH 
= LENGTH 
(SIGN$ON); 


'"DISPATCH 
THE CHANNEL"' 
OUT (IOP$A$CH1) 
= OH; 


," 
"DISPATCH 
CHANNEL 
2 OF 10P$A TO 
"CONTINUOUSLY 
SCAN KEYBOARD, 
INTERRUPTING 
"WHEN 
A COMPLETE 
MESSAGE 
IS READY 
"' 
'"WAIT 
UNTIL CHANNEL 
IS CLEAR, 
THEN SET TO BUSY"' 
DO WHILE 
LOCKSET 
(@ CB$A(1).BUSY, 
CHANNEL$BUSY); 
END; 


I*SET 
CCW AS FOllOWS: 
* 
PRIORITY = 0 
BUS lOAD 
LIMIT, 
ENABLE 
INTERRUPTS, 
START CHANNEL 
PROGRAM 
IN I/O SPACE* I 
CB$A(1).CCW 
= 00110001 B; 
1*L1NK KEYBOARD 
PARAMETER 
BLOCK TO CB* I 
CB$A(1).PB$PTR 
= @ KEYBD$PB; 
I*Flll 
IN PARAMETER 
BlOCK* 
I 
KEYBD$PB.TB$PTR 
= KEYBD$TB; 
KEYBD$PB.BUFF$PTR 
= @ KEYBD$BUFF; 
KEYBD$PB.MSG$SIZE 
= OH; 
I*DISPATCH 
THE CHANNEL 
* I 
OUT (IOP$A$CH2) 
= OH; 


Figure 3-67 shows a channel program 
that per- 


forms 
a memory-to-lJlemory 
block transfer 
in 


seven instructions. 
The program moves up to 64k 


bytes between any two locations 
in the system 


space. A 16-bit system bus is assumed, and the 
CPU is assumed to be monitoring 
the channel's 


BUSY flag to determine when- the program has 
finished. 


To attain maximum transfer speed, the program' 
locks the bus during 
each transfer 
cycle. This 


ensures that another 
processor does not acquire 


the bus in the interval between the DMA fetch 
and store operations. 
By setting this channel's 


priority bit in the CCW to I and the other chan- 
nel's to 0, the CPU could effectively prevent the 
other channel from running during the transfer. 
Byte count 
termination 
is selected so that the 


transfer 
will stop 
when the number 
of bytes 


specified by the CPU has been moved. Since there 
is only a single termination 
condition, a termina- 


tion offset of 0 is specified. The transfer begins 
after the WID instruction, 
and the HL T instruc- 


tion is executed immediately upon termination. 


A CPU program can "interrupt" 
a channel pro- 


gram by issuing a "suspend" 
channel command. 


The channel responds to this command by saving 
the task pointer and PSW in the first two words 
of the parameter 
block. The suspended program 


can be restarted by issuing a "resume" 
command 


that loads TP and the PSW from the save area. 


If the CPU wants to execute another channel pro- 
gram between the suspend 
and resume opera- 


tions, 
the 
suspended 
program's 
registers 
will 


usually have to be saved first. If the "interrupt- 
ing" program "knows" 
that the registers must be 


saved, 
it can perform 
the operation 
and also 


restore the registers before it halts. 


A more general solution is shown in figure 3-68. 
This is a program that does nothing but save the 
contents of the channel registers. The registers are 
saved in the parameter 
block because PP is the 


only register that is known to point to an available 
area of memory. A similar program could be writ- 
ten to restore registers from the same parameter 
block. 


Using this approach, 
the CPU would "interrupt" 


a running program as follows: 


• 
suspend the running program, 


• 
run the register save program, 


• 
run the "interrupting" 
program, 


• 
run the register restore program, 


• 
resume the suspended program. 


· _ 
8089INP!J;1;(9jJTPUT 
PRoc!=~snA 
TP_RESERVED: 
DS 
4 
FROM_ADDR: 
DS 
4 
TO_ADDR: 
DS 
4 
SIZE: 
DS 
2 
PB 
ENDS 


;POINT GA AT SOURCE, GB AT DESTINATION. 


LPD 
GA, [PPJ.FROM_ADDR 


LPD 
GB, [PP .TO_ADDR 
;LOAD BYTE COUNT INTO BC. 


MOV 
BC, [PP].SIZE 


;LOAD CC SPECIFYING: 
; 
MEMORY TO MEMORY, 
; 
NO TRANSLATE, 
; 
UNSYNCHRONIZED, 
; 
GA POINTS TO SOURCE, 
; 
LOCK BUS DURING TRANSFER, 


; 
NO CHAINING, 
; 
TERMINATING ON BYTE COUNT,OFFSET = o. 
MOV 
CC, OC208H 


;PREPARE CHANNEL FOR TRANSFER. 


XFER 


;SET LOGICAL BUS WIDTH. 


WID 
16,16 


;STOP EXECUTION AFTER DMA. 


HLT 
MEMEXAMP 
ENDS 
END 


SAVEREGS 
SEGMENT 


;SAVE ANOTHER CHANNEL'S REGISTERS IN PB 
PB 
'STRUC 
TP_RESERVED: 
DS 
4 
GA_SAVE: 
DS 
3 
GB_SAVE: 
DS 
3 


(3C_SAVE: 
DS 
3 
IX_SAVE: 
DS 
2 
BC_SAVE: 
DS 
2 
MC_SAVE: 
DS 
2 
CC_SAVE: 
DS 
2 
PB 
ENDS 


MOVP 
MOVP 
MOVP 
MOV 
MOV 
MOV 
MOV 
HLT 
ENDS 
!=ND 


PP .GA_SAVE, 
GA 
PP .GB_SAVE, 
GB 
PP .GC_SAVE, GC 
PP .IX_SAVE, IX 
PP .BC_SAVE, 
BC 
PP .MC_SAVE, 
MC 
PP .CC_SAVE, CC 


CHAPTER 4 
HARDWARE REFERENCE 
INFORMATION 


This chapter presents specific hardware informa- 
tion regarding the operation and functions of the 
8086 family processors: the 8086 and 8088 Central 
Processing Units (CPUs) and the 8089 I/O Pro- 
cessor (lOP). 
Abbreviated 
descriptions 
of the 


8086 family 
support 
circuits 
and 
their 
circuit 


functions 
appear 
where appropriate 
within the 


processor 
descriptions. 
For 
more 
specific 
information 
on any of the 8086 family support 


circuits, refer to the corresponding 
data sheets in 


Appendix B. 


The 8086 and 8088 CPUs are characterized 
by a 


20-bit (I megabyte) address bus and an identical 
instruction/function 
format, and differ essential- 


ly from one another by their respective data bus 
widths (the 8086 uses a 16-bit data bus, and the 
8088 uses an 
8-bit 
data 
bus). 
Except 
where 
expressly 
noted, 
the 
ensuing 
descriptions 
are 


applicable to both CPUs. 


Both the 8086 and 8088 feature a combined or 
"time-multiplexed" 
address 
and data bus that 


permits a number of the pins to serve dual func- 
tions and consequently allows the complete CPU 
to be incorporated 
into a single, 40-pin package. 
As explained later in this chapter, a number of the 
CPU's control pins are defined according to the 
strapping of a single input pin (the MN/MX pin). 
In the "minimum 
mode," 
the CPU is configured 
for small, single-processor systems, and the CPU 
itself provides all control signals. In the "max- 
imum mode," 
an Intel® 8288 Bus Controller, 
rather than the CPU, provides the control signal 
outputs 
and allows a number 
of the pins pre- 


viously delegated to these control functions to be 
redefined 
in order 
to support 
multiprocessing 
applications. 
Figures 4-1 and 4-2 describe the pin 


assignments and signal definitions 
for the 8086 


and 8088, respectively. 


As shown in figures 4-3 and 4-4, both 
CPUs 
incorporate 
two separate 
processing 
units: 
the 


Execution Unit or "EU" 
and the Bus Interface 


Unit or "BIU." 
The EU for each processor 
is 


identical. The BIU for the 8086 incorporates a 16- 
bit data 
bus and 
a 6-byte 
instruction 
queue 


whereas the 8088 incorporates 
an 8-bit data bus 


and a 4-byte instruction queue. 


The EU is responsible 
for the execution 
of all 


instructions, 
for providing data and addresses to 


the 
BIU, 
and 
for 
manipulating 
the 
general 


registers and the flag register. Except for a few 
control pins, the EU is completely isolated from 
the "outside 
world." 
The BIU is responsible for 


executing all external bus cycles and consists of 
the segment and communications 
registers, 
the 


instruction 
pointer 
and 
the 
instruction 
object 


code queue. The BIU combines segment and off- 
set values in its dedicated adder to derive 20-bit 
addresses, transfers data to and from the EU on 
the ALU data 
bus and loads or "prefetches" 


instructions 
into the queue from which they are 


fetched by the EU. 


The EU, when it is ready to execute an instruc- 
tion, fetches the instruction object code byte from 
the BIU's instruction queue and then executes the 
instruction. 
If the queue is empty when the EU is 


ready to fetch an instruction 
byte, the EU waits 


for the instruction 
byte to be fetched. 
In the 
course of instruction execution, if a memory loca- 
tion 
or 
I/O 
port 
must 
be accessed, 
the 
EU 


requests the BIU to perform 
the required 
bus 


cycle. 


The two processing sections of the CPU operate 
independently. 
In the 8086 CPU, 
when two or 


more bytes of the 6-byte instruction 
queue are 


empty and the EU does not require the BIU to 
perform a bus cycle, the BIU executes instruction 
fetch cycles to refill the queue. In the 8088 CPU, 
when one byte of the 4-byte instruction 
queue is 


empty, 
the BIU executes an instruction 
fetch 


cycle. Note that the 8086 CPU, since it has a 16- 
bit data bus, can access two instruction 
object 


code bytes in a single bus cycle, while the 8088 
CPU, since it has an 8-bit data bus, accesses one 
instruction object code byte per bus cycle. If the 
EU issues a request for bus access while the BIU is 
in the process of an instruction 
fetch bus cycle, 


the BIU completes the cycle before honoring the 
EU's request. 


~, 


Common Signals 


Name 
Function 
Type 


AD15-ADO 
AddresslData 
Bus 
Bidirectional, 


3-State 


A19/S6- 
Address/Status 
Output, 


A16/S3 
3-State 


BHE/S7 
Bus High 
Enable/ 
Output, 


Status 
3-State 


MN/MX 
Minimum/Maximum 
Input 
Mode Control 
, 


Output, 
RD 
Read Control 
3-State 


TEST 
Wait On Test Control 
Input 


READY 
Wait State 
Control 
Input 


RESET 
System 
Reset 
~ 
Input 


NMI 
Non-Maskable 
Input 
Interrupt 
Request 


INTR 
Interrupt 
Request 
Input 


I' 
CLK 
System 
Clock 
Input 


VCC 
+5V 
Input 


GND 
Ground 


Minimum Mode Signals (MN/MX = Vcd 


Name 
Function 
Type 


HOLD 
Hold Request 
Input 


HLDA 
Hold Acknowledge 
Output 


WR 
Write 
Control 
Output, 
3-State 


M/IO 
Memory/lO 
Control 
Output, 
3-State 


DT/R 
Data Transmit! 
Output, 


Receive 
3-State 


DEN 
Data Enable 
Output, 
3-State 


ALE 
Address 
Latch 
Output 
Enable 
INTA 
Interrupt 
Acknowledge 
Output 


'Maximum Mode Signals (MN/MX = GND) 


Name 
Function 
Type 


RQ/GT1,O 
Request/Grant 
Bus 
Bidirectional 
Access 
Control 
IOCR 
Bus Priority 
Lock 
Output, 


Contml 
3-State 


S2-S0 
Bus Cycle Status 
Output, 
3-State 


QS1, QSO 
Instruction 
Queue 
Output 
Status 


GND 
vcc 


AD14 
AD1S 


AD13 
A16/53 


AD12 
A17/54 


AD11 
A18/5S 


AD10 
A19/56 


AD9 
BHE/57 


AD8 
MN/MX 


AD7 
RD 


AD6 
8086 
HOLD 
(RO/GTO) 
CPU 
ADS 
HLDA 
(RD/GT1) 


AD4 
WR 
(LOCK) 


AD3 
M/iO 
(52) 


AD2 
DTIR 
(51) 


AD1 
DEN 
(So) 


ADO 
ALE 
(050) 


NMI 
INTA 
(051) 


INTR 
TE5T 


CLK 
READY 


GND 
RE5ET 


-_ .•.... _ .. --;11.---- 


Name 
Function 
Type 


AD7-ADO 
AddresslData 
Bus 
Bidirectional, 
3·State 


A15-A8 
Address 
Bus 
Output, 
3-State 


A19/S6- 
Address/Status 
Output, 
A16/S3 
3-State 


MN/MX 
Minimum/Maximum 
Input 
Mode Control 


RD 
Read Control 
Output, 
3·State 


TEST 
Wait On Test Control 
Input 


READY 
Wait State 
Control 
Input 


RESET 
System 
Reset 
Input 


NMI 
Non·Maskable 
Input 
Interrupt 
Request 


INTR 
Interrupt 
Request 
Input 


CLK 
System 
Clock 
Input 


VCC 
+5V 
Input 
GND 
Ground 


Minimum 
Mode Signals 
(MN/MX 
= VCC> 


Name 
Function 
Type 


HOLD 
Hold 
Request 
Input 
HLDA 
Hold 
Acknowledge 
Output 


WR 
Write 
Control 
Output, 
3-State 
- 
IO/Memory 
Control 
Output, 
IO/M 
3-State 


DT/R 
Data Transmit/ 
Output, 


Receive 
3-State 


DEN 
Data Enable 
Output, 
3·State 


ALE 
Address 
Latch 
Output 
Enable 


TN'TA 
Interrupt 
Acknowledge 
Output 


SSO 
SO Status 
Output, 
3-State 


Maximum 
Mode Signals 
(MN/MX 
= GND) 


Name 
Function 
Type 


RQ/GT1,0 
Request/Grant 
Bus 
Bidirectional 
Access 
Control 


LOCK 
Bus Priority 
Lock 
Output, 


Control 
3·State 


S2-S0 
Bus Cycle 
Status 
Output, 
3-$t/lte 


QS1, QSO 
Instruction 
Queue 
Output 
Status 


GND 


A14 


A13 


A12 


All 


Al0 
A19/S6 


A9 
SSO 
(HIGH) 


AS 
MN/MX 


AD7 
RD 


AD6 
8088 
HOLD 
(RQ/GTO) 


CPU 
HLDA 
(RO/GT1) 


WR 
(LOCK) 


101M 
(52) 


DT/R 
(51) 


DEN 
(SO) 


ALE 
(050) 


INTA 
(051) 


TEST 


READY 


RESET 


To explain the operation of the time-multiplexed 
bus, 
the BIV's 
bus cycle must 
be examined. 
Essentially, a bus cycle is an asynchronous 
event 


in which the address 
of an I/O 
peripheral 
or 
memory location is presented, followed by either 
a read control signal (to capture or "read" 
the 
data from the addressed device) or a write control 
signal and the associated 
data 
(to transmit 
or 


"write" 
the data to the addressed device). The 


selected 
device 
(memory 
or 
I/O 
peripheral) 


accepts the data on the bus during a write cycle or 
places the requested data on the bus during a read 
cycle. On termination 
of the cycle, the device 


latches the data written or removes the data read. 


As shown in figure 4-5, all bus cycles consist of a 
minimum of four clock cycles or "T -states" iden- 
tified as T}, T2, T3 and T4' The CPU places the 
address of the memory location or I/O device on 
the bus during state T}. During a write bus cycle, 
the CPU places the data on the bus from state T2 
until state T4' During a read bus cycle, the CPU 
accepts the data present on the bus in states T3 


and T4' and the multiplexed address/data 
bus is 


floated in state T2 to allow the CPU to change 
from the write mode (output address) to the read 
mode (input data). 


It is important to note that the BIU executes a bus 
cycle only when a bus cycle is requested by the EU 
as part of instruction 
execution or when it must 


fill the instruction 
queue. 
Consequently, 
clock 


periods in which there is no BIU activity can 
occur between bus cycles. These inactive clock 
periods are referred to as idle states (TI)' While 
idle clock states result from several conditions 
(e.g., bus access granted to a coprocessor), 
as an 


example, consider the case of the execution of a 
"long" 
instruction. 
In the following example, an 


8-bit register multiply (MUL) instruction 
(which 


requires between 70 and 77 clock cycles) is exe- 
cuted by the 8086. Assuming that the multiplica- 
tion routine is entered as a result of a program 
jump (which causes the instruction 
queue to be 


reinitialized when the jump is executed) and, as 
will be explained later in this chapter, 
that the 


object code bytes are aligned on even-byte bound- 
aries, the BIU's bus cycle sequence would appear 
as shown in figure 4-6. 


2 
3 
4 
5 
6 
1 
8 
9 
10 
11 
EU FETCHES 
THE FIRST TWO BYTES FROM THE QUEUE (THE MUL INSTRUCTION) 
AND 


COMPLETES 
INSTRUCTION 
EXECUTION 
IN 70 TO 77 CLOCK· CYCLES. 


AS A RESULT OF THE JMP 
INSTRUCTION, 
THE EU 
REINITIALlZES 
THE QUEUE 
DURING 
EXECUTION 
OF 
THE JUMP. 


SINCE THE QUEUE IS 
BIU FETCHES 
TWO OBJECT 
EMPTY, THE BIU FETCHES 
coDe 
BYTES. QUEUE 
TWO OBJECT 
CODE BYTES 
AGAIN CONTAINS 
FOUR 


(THE MUL INSTRUCTION) IN 
BYTES. 


ONE BUS CYCLE AND 
I 


COMPLETES 
A SECOND 
BUS CYCLE. THE QUEUE 
CONTAINS 
FOUR BYTES. 


81U FETCHES 
TWO MORE 
OBJECT 
CODE BYTES. 


QUEUE IS NOW FULL (SIX 
BYTES). 


EU FETCHES 
THE NEXT 


OBJECT 
CODE BYTES 
FROM THE QUEUE AND 
BEGINS 
EXECUTING 
THE 


NEXT INSTRUCTION. 


BIU FETCHES 
TWO OBJECT 


CODE BYTES TO REFIll 
THE QUEUE. 
THE QUEUE IS 


AGAIN 
FULL. 


In addition to the idle state previously described, 
both 
the 
8086 
and 
8088 
CPUs 
include 
a 
mechanism for inserting additional T-states in the 
bus cycle to compensate 
for devices (memory or 


I/O) that cannot transfer 
data at the maximum 


rate. These extra T-states are called wait states 
(TW) and, when required, 
are inserted between 


states T3 and T4' During a wait state, the data on 
the bus remains unchanged. 
When the device can 


complete the transfer (present or accept the data), 
it signals the CPU to exit the wait state and to 
enter state T4' 


As shown in the following timing diagrams, 
the 


actual bus cycle timing differs between a read and 
a write bus cycle and varies between the two 
CPUs. Note that the timing diagrams illustrated 
are for the minimum 
mode. 
(Maximum 
mode 


timing is described later in this chapter.) 


Referring to figures 4-7 and 4-8, the 8086 CPU 
places 
a 
20-bit 
address 
on 
the 
multiplexed 


address/ data bus during state T l' During state 
T2. the CPU removes the address from the bus 
and 
either 
three-states 
(floats) 
the 
lower 
16 


address/data 
lines in preparation 
for a read cycle 


(figure 4-7) or places write data on these lines 


(figure 4-8). At this time, 
bus cycle status 
is 


available on the address/status 
lines. During state 


T3' 
bus 
cycle 
status 
is 
maintained 
on 
the 


address/status 
lines and either the write data is 


maintained or read data is sampled on the lower 
16 address/data 
lines. The bus cycle is terminated 


in state T4 (control 
lines are disabled 
and the 


addressed device deselects from the bus). 


The 8088 CPU, 
like the 8086, places a 20-bit 


address on the multiplexed address/data 
bus dur- 


ing state T 1 as shown in figures 4-9 and 4-10. 
Unlike the 8086, the 8088 maintains the address 
on the address lines (AIS-A8) for the entire bus 
cycle. During 
state T2' 
the CPU 
removes the 


address on the address/data 
lines (ADTADO) and 


either floats these lines in preparation 
for a read 


cycle (figure 4-9) or places write data on these 
lines (figure 4-10). At this time, bus cycle status is 
available on the address/status 
lines. During state 


T3, 
bus 
cycle 
status 
is 
maintained 
on 
the 


address/status 
lines and either write data is main- 


tained 
or 
read 
data 
is 
sampled 
on 
the 


address/data 
lines. The bus cycle is terminated in 
state 
T4 (control 
lines 
are 
disabled 
and 
the 


addressed device deselects from the bus). 
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A majority of system memories and peripherals 
require a stable address for the duration 
of the 


bus cycle (certain 
MCS-85™ 
components 
can 


operate 
with a multiplexed 
address/data 
bus). 
During state T 1 of every bus cycle, the ALE 
(Address Latch Enable) control signal is output 
(either directly from the microprocessor 
in the 


minimum mode or indirectly through an 8288 Bus 
Controller 
in the maximum mode) to permit the 


address to be latched (the address is valid on the 
trailing-edge of ALE). This "demultiplexing" 
of 
the address/data 
bus can be done remotely at 
each device in the 'system or locally at the CPU 
and 
distributed 
throughout 
the 
system 
as 
a 


separate address bus. For optimum 
system per- 


formance 
and 
for 
compatibility 
with 
multi- 
processor 
systems 
or with 
the 
Intel 
Multibus 
architecture, 
the 
locally-demultiplexed 
addres.s 


bus is recommended. To latch the address, Intel ® 
8282 (non-inverting) 
or 8283 (inverting) 
Octal 


Latches are offered as part of the 8086 product 
family and are implemented 
as shown in figure 


4-11. These circuits, in addition to providing the 
desired latch function, 
provide increased current 
drive capability and capacitive load immunity. 


The data bus cannot be demultiplexed due to the 
timing differences between read and write cycles 
and 
the 
various 
read 
response 
times 
among 


peripherals 
and 
memories. 
Consequently, 
the 


multiplexed data bus either can be buffered 
or 


used directly. When memory and I/O peripherals 
are connected directly to an unbuffered 
bus, it is 


essential that during 
a read cycle, a device is 


prevented from corrupting the address present on 
the bus during 
state 
T 1. To 
ensure 
that 
the 
address is not corrupted, a device's output drivers 
should be enabled by an output enable function 
(rather than the device's chip select function) con- 
trolled by the CPU's 
read signal. (The MCS-86 


family processors guarantee 
that the read signal 
will not be valid until after the address has been 
latched 
by 
ALE.) 
Many 
Intel 
peripheral, 


ROM/EPROM, 
and RAM circuits provide 
an 


output enable function 
to allow interface to an 


unbuffered 
multiplexed 
address/data 
bus. The 


alternative of using a buffered data bus should be 


_ considered 
since 
it 
simplifies 
the 
interfacing 


requirements and offers both increased drive cur- 
rent capability and capacitive load immunity. The 
Intel® 8286 (non-inverting) 
and 8287 (inverting) 
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Octal Bus Transceivers, 
shown in figure 4-12, are 


expressly designed to buffer the data bus. These 
transceivers use the CPU's 
DEN (Data Enable) 
and 
DTIi{ (Data 
Transmit/Receive) 
control 


signals to enable and control the direction of data 
on the bus. These signals provide the proper tim- 
ing relationship 
to guarantee 
isolation 
of the 
address that is present 
on the multiplexed 
bus 
during state T 1. 


Except where noted, 
all subsequent 
discussions 


and examples in this chapter 
assume a locally 


demultiplexed 
address bus and a buffered 
data 


bus. The resultant address and data buses from 
the address latches and data transceivers to the 
memory and 1/0 devices will be referred to collec- 
tively as the "system" 
bus. 
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To establish the bus cycle time, the CPU requires 
an external clock signal. As an integral part of the 
8086 
family, 
Intel 
offers 
the 
8284 
Clock 
Generator/Driver 
for this purpose. In addition to 
providing the primary (system) clock signal, this 
device provides both the hardware reset interface 
and the mechanism for the insertion of wait states 
in the bus cycle. 


The clock generator/driver 
requires an external 
series-resonant crystal input (or external frequen- 
cy source) at three times the required system clock 
frequency (i.e., to operate the CPU at 5 MHz, a 
IS 
MHz 
fundamental 
frequency 
source 
is 
required). 
The 
divided-by-three 
output 
(CLK) 
from the 8284 is routed 
directly to the CPU's 
CLK input. The clock generator/driver 
provides a 
second clock output 
called PCLK 
(Peripheral 
Clock) at one half the frequency of the CLK out- 
put and a buffered 
TTL level OSC (oscillator) 
output 
at the applied 
crystal 
input 
frequency. 
These outputs 
are available 
for use by system 
devices. 


The 8284's hardware 
reset function 
is accom- 
plished with an internal 
Schmitt trigger circuit 
that is activated by the RES (Reset) input. When 
this input is pulled low (i.e., a contact closure to 
ground), 
the RESET 
output 
is activated 
syn- 
chronously with the CLK signal. This signal must 
be active for four clock cycles and causes the CPU 
to fetch and execute the instruction 
at location 
FFFFOH. An external RC circuit is connected to 
the RES input to provide the power-on reset func- 
tion (on power-on, the m input must be active 
for 
50 microseconds). 
The 
RESET 
output 
is 
coupled directly to the RESET input of the CPU 
as well as being available to system peripherals as 
the system reset signal. 


The insertion of wait states in the CPU's bus cycle 
is accomplished by deactivating one of the 8284's 
RDY inputs (RDYI or RDY2). Either of these 
inputs, when enabled by its corresponding 
AENI 
or AEN2 input, can be deactivated directly by a 
peripheral device when it must extend the CPU's 
bus cycle (when it is not ready to present or accept 
data) or by a "wait 
state generator" 
circuit (a 
logic circuit that holds the RDY input inactive for 
a given number of clock cycles). 


The READY output, which is sYJ1chronizedto the 
CLK signal is coupled 
directly 
to the CPU's 


. READY input. As shown in figure 4-13, when the 
addressed device needs to insert one or more wait 
states in abus cycle, it deactivates the 8284's RDY 
input prior to the end of state T2 which causes the 
READY output to be deactivated 
at the end of 
state T2' The resultant wait state (TW) is inserted 
between states T3 and T4' To exit the wait state, 
the device activates the 8284's RDY input which 
causes the READY input to the CPU to go active 
at the end of the current wait state and allows the 
CPU to enter state T4' 


A unique feature of the 8086 and 8088 CPUs is 
the ability of a user to define a subset of the 
CPU's control signal outputs in order to tailor the 
CPU to its intended 
system environment. 
This 
"system tailoring" 
is accor!!J2lished by the strap- 


ping of the CPU's 
MN/MX 
(minimum/max- 
imum) input pin. Table 4-1 defines the 8086 and 
8088 pin assignments in both the minimum and 
maximum modes. 


ROY INPUT 
_ 


~ 
~ 
\_-_/ 


8086 
8088 


Mode 
Mode 
Pin 
Pin 


Minimum 
Maximum 
Minimum 
Maximum 


31 
HOLD 
RQ/GTO 
31 
HOLD 
RQ/GTO 


30 
HLDA 
RQ/GT1 
30 
HLDA 
RQ/GT1 


29 
WR 
LOCK 
29 
WR 
LOCK 


28 
MilO 
S2 
28 
101M 
S2 


27 
DT/R 
§1 
27 
DTiR' 
S1 


26 
DEN 
SO 
26 
DEN 
SO 
25 
ALE 
QSO 
25 
ALE 
QSO 
24 
INTA 
QS1 
24 
INTA 
QS1 


34 
SSO 
High State 


In the minimum mode (MN/MX 
pin strapped to 


+5V), the CPU supports small, single-processor 
systems that consist of a few devices and that use 
the 
system 
bus 
rather 
than 
support 
the 


Multibus™ architecture. 
In the minimum mode, 


the 
CPU 
itse!L...&enerates 
all 
bus 
control 


signals (DT/R, 
DEN, ALE and either MilO 
or 


101M) and the command output signal (RD, WR 
or 
INTA), 
and 
provides 
a 
mechanism 
for 


requesting 
bus access (HOLD/HLDA) 
that 
is 


compatible with bus master type controllers (e.g., 
the Intel® 8237 and 8257 DMA Controllers). 


In the 
minimum 
mode, 
when 
a 
bus 
master 


requires bus access, it activates the HOLD input 
to the CPU (through its request logic). The CPU, 
in response 
to 
the 
"hold" 
request, 
activates 
HLDA as an acknowledgement 
to the bus master 


requesting the bus and simultaneously 
floats the 


system bus and control lines. Since a bus request 
is asynchronous, 
the CPU samples the HOLD 


input 
on the positive 
transition 
of each CLK 


signal and, 
as shown in figure 4-14, activates 


HLDA at the end of either the current bus cycle 
(if a bus cycle is in progress) or idle clock period. 
The hold state is maintained until the bus master 
inactivates 
the HOLD 
input at which time the 


CPU regains control of the system bus. Note that 
during a "hold" 
state, the CPU will continue to 


execute instructions until a bus cycle is required. 


Note that in the minimum mode, the I/O-memory 
control line for the 8088 CPU is the converse of 
the corresponding 
control line for the 8086 CPU 


(MilO on the 8086 and 101M on the 8088). This 
was done to provide the 8088 CPU, since it is an 


8-bit 
device, 
compatibility 
with 
existing 


MCS-85™ systems and specific MCS-85™ family 
devices (e.g., the Intel® 8155/56). 


Maximum 
Mode 


In the maximum mode (MN/MX 
pin strapped to 


ground), an Intel® 8288 Bus Controller is added 
to provide a sophisticated 
bus control 
function 


and compatibility with the Multibus architecture 
(combining an Intel® 8289 Arbiter with the 8288 
permits the CPU to support multiple processors 
on the system bus). As shown in figure 4-15, the 
bus controller, 
rather than the CPU, provides all 


bus control and command outputs, and allows the 
pins previously delegated to these functions to be 
redefined to support multiprocessing functions. 


Referring"..!p1!.gure4-15, the 8288 Bus Controller 
uses the S2, SI and SOstatus bit outputs from the 
CPU (and the 8089 lOP) to generate all bus con- 
trol and command 
output signals required for a 


bus cycle. The status bit outputs are decoded as 
outlined in table 4-2. (For a detailed cjescription 
of the operation of the 8288 Bus Controller, 
refer 


to the associated data sheet in Appendix B.) 


The 8088 CPU, in the minimum mode, provides 
an SSOstatus output. This output is equivalent to 
SO in the maximum 
mode and can be decoded 


with 
DTiR: and 
101M 
(inverted), 
which 
are 


equivalent to Si and S2 respectively, to provide 
the same CPU cycle status information 
defined in 


table 4-2. This type of decoding could be used in a 
minimum 
mode 
8088-based 
system 
to 
allow 


dynamic RAM refresh during passive CPU cycles. 
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Figure 4-15. Elementary 
Maximum Mode System 


Table 4-2. Status Bit Decoding 


Status Inputs 
CPU Cycle 
8288 Command 
52 
51 
SO 


0 
0 
0 
Int~rrupt Acknowledge 
INTA 
0 
0 
1 
Read I/O Port 
---!O~ 


0 
1 
0 
Write I/O Port 
10WC,AIOWC 
0 
1 
1 
Halt 
None 
1 
0 
0 
Instruction 
Fetch 
MRDC 
1 
0 
1 
Read Memory 
MRDC 
1 
1 
0 
Write Memory 
MWTC,AMWC 
1 
1 
1 
Passive 
None 


The Request/Grant 
signal lines (RQ/GTO 
and 


RQ/GTl) 
provide 
the 
CPU's 
bus 
access 


mechanism in the maximum mode (replacing the 
HOLD/HLDA 
function 
available, 
in 
the 


minimum mode) and are designed expressly for 
multiprocessor 
applications 
using the 8089I10 


Processor 
in its local mode or other processors 


that can support 
this function. 
These lines are 


unique 
in that 
the 
request/grant 
function 
is 
acc0I!!E.lished over 
a 
single 
line 
(RQ/GTO 


or 
RQ/GTl) 
rather 
than 
the 
two-line 


HOLD/HLDA 
function. 


As 
shown 
in 
figure 
4-16, 
the 
request/grant 


sequence is a three-phase cycle: request, 'grant and 
release. The sequence is initiated by another proc 
cessor on the s~em 
bus when it outputs a pulse 


on one of the RQ/GT 
lines to request bus access 


(request phase). In response, the CPU outputs a 
pulse (on the same line) at the end of either the 
current bus cycle (if a bus cycle is in progress) or 
idle clock period to indicate to the requesting pro- 
cessor that it has floated the system bus and that it 
will logically disconnect from the bus controller 
on the next clock cycle (grant phase) and enter a 


"hold" 
state. Note that the CPU's execution unit 


(EU) continues to execute the instructions 
in the 


queue until an instruction 
requiring bus access is 


encountered 
or until the queue is empty. In the 


third 
(release) phase" the requesi~ 
processor 


again outputs a pulse on the RQ/GT 
line. This 


pulse alerts the CPU that the processor is ready to 
release the bus. The CPU regains bus access on its 
next clock cycle. Note that the exchange of pulses 
is synchronized and, accordingly, 
both the CPU 


and requesting processor must be referenced to 
the same clock signal. 


The 
request/grant 
lines 
are 
prioritized 
with 


RQ/GTO taking precedence over, RQ/GTl. 
If a 


request arrives...Q!lboth lines simultaneously, 
the 


processor 
o.!!.-RQ/GTO 
is granted 
the bus (the 


request on RQ/GTl 
is granted when the bus is 


released by the first processor following '!...2.ne...QI 
two clock channel transfer delay). Both RQ/GT 
lines (and the HOLD line in minimum mode) have 
a higher priority than a pending interrupt. 


Request/grant 
latency (the time interval between 


the receipt of a request pulse and the return of a 
grant pulse) for several conditions is given in table 
4-3. 
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COPROCESSOR REQUESTS 
CPU GRANTS BUS 
COPROCESSOR RELEASES 


BUS ACCESS 
TO COPROCESSOR' 
BUS 


i 
Request/Grant 
Delay 
Operating Condition 
8086 
8088 


Normal Instruction Processing~LOCK 
inactive 
3~6(10*)clocks 
3-10clocks 


INTACycle Executing-LOCK 
active 
.. 


I 
15clocks 
15clocks 


Locked XCHGInstruction Processing-LOCK 
active 
24-31(39*)clocks 
24-39clocks 


*The number of clocks in parentheses 
applies when the instruction being executed 
references 
a word 
operand at an Oddaddress boundary. 


Latency 
during 
normal 
instruction 
processing 


(LOCK inactive) can be as short as three clock 
cycles (e.g., during execution 
of an instruction 


that does not reference 
memory) and no more 


than ten clock cycles. Whenever the LOCK out- 
put is active (LOCK is activated during an inter- 
rupt acknowledge cycle or during execution of an 
instruction 
with 
a 
Lock 
prefix), 
latency 
is 


increased. In the case of the execution of a locked 
XCHG 
instruction 
(used 
during 
semaphore 


examination), 
maximum latency is limited to 39 


clock cycles. 
Greater 
latencies 
occur 
when 
a 


"long" 
instruction 
is locked. This, however, is 


neither necessary nor recommended. 


At the end of processor 
activity, 
the 8086 or 


8088 will not redirve its control and data buses 
until two clock cycles following receipt of the 
release pulse (or two clock cycles after HOLO 
goes inactive in the minimum mode). 


A Hold request is honored immediately following 
CPU reset if the HOLO line is active when the 
RESET line goes inactive. This action facilitates 
the 
downloading 
of 
programs 
and, 
more 


specifically, 
the 
setting 
of 
memory 
location 


FFFFOH prior to CPU activation. 
Note that the 


same result can be effected in the maximum mode 
through the RQ/GT line by generating the request 
pulse in the first 
or second clock cycle after 


RESET goes inactive. 


The LOCK output is used in conjunction with an 
Intel 8289® Bus Arbiter 
to guarantee 
exclusive 


access of a shared system bus for the duration of 
an instruction. This output is software controlled 
and 
is effected 
by preceding 
the 
instruction 


requiring exclusive access with a one byte "lock" 
prefix (see instruction 
set description in Chapter 


2). 


When the lock prefix is decoded by the EU, the 
EU informs the BIU to activate the LOCK output 
during the next clock cycle. This signal remains 
active until one clock cycle after the executi6n of 
the associated instruction is concluded. 


The QSI and QSO (Queue Status) outputs permit 
ex.ternal 
monitoring 
of 
the 
CPU's 
internal 


instruction 
queue to allow instruction 
set exten- 


sion 
processing 
by 
a 
coprocessor. 
(The 


corresponding 
Intel ICE modules use these status 


bits during "trace" 
operations.) 
The encoding of 


the QSI and QSObits is shown in table 4-4. 


Table 4-4. Queue Status Bit Decoding 


QS1 
QSO 
Queue 
Status 


o (low) 
0 
No Operation. 
During 
the 
last 
clock cycle, 
nothing 
was taken 
from the queue. 


0 
1 
First Byte. The byte taken from the 
queue 
was the first byte of the 
instruction. 


1 (high) 
0 
Queue 
Empty. 
The 
queue 
has 
been reinitialized as a result of the 
execution of a transfer instruction. 


1 
1 
Subs~quent 
Byte. The byte taken 
from the queue was a subsequent 
byte of the instruction. 


The queue status is valid during the clock cycle 
after the indicated activity has occurred. 


The 8086 and 8088 CPUs have a 20-bit address 
-bus and are capable of accessing one megabyte of 
memory address space. 


The 8086 memory 
address 
space consists of a 
sequence of up to one million individual bytes in 
which any two consecutive bytes can be accessed 
as a 16-bit data word. As shown in figure 4-17, 
the memory address space is physically divided 
into two banks of up to 512k bytes each. 


One bank is associated with the lower half of the 
CPU's 16-bit data bus (data bits 07-00), 
and the 


dther bank is associated with the upper half of the 
data bus (data bits 015-08). 
Address bits A19 


through Al are used to simultaneously address a 
specific byte location in both the upper and lower 
banks, 
and the AO address 
bit is not used in 


memory 
addressing. 
Instead, 
AO is 
used 
in 


memory bank selection. The lower bank, which 


~r=r='" ~"I t:'.J: JA. rJ:'I,p.AAA 
TI('\ M 
_ 


operation 
(a single bus .cycle). As with the byte 


transfers previously described, 
address bits Al9 


through Al address both banks, except that now 
BHE is active (selecting the upper bank) and AOis 
inactive (selecting the lower bank) to access both 
bytes. 
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LOWER 
HALF 
OF 
DATA 
BUS 


contains 
even-address 
bytes, 
is selected 
when 


AO=O, The upper bank, containing 
odd address 
bytes (AO=I), is selected by a separate signal, Bus 
High 
Enable 
(BHE). 
Table 
4-5 
defines 
the 


BHE-AO bank selection mechanism. 


BHE 
AO 
Byte Transferred 


o (low) 
0 
Both bytes 


0 
1 
Upper byte to/from odd address 


1(high) 
0 
Lower byte to/from even address 


1 
1 
None 


When accessing a data byte at an even address, 
the byte is transferred 
to or from the lower bank 


on the lower half of the data bus (07-00). 
In this 
case, the inactive 
level of the AO address 
bit 


enables the addressed byte in the lower"bank, and 
the inactive level of the BHE signal disables the 
addressed 
byte in the upper bank. 
Conversely, 
when performing a byte access at an odd address, 
the data byte is transferred 
to or from the upper 


bank on the upper half of the data bus (DI5-D8). 
The active level of the BHE signal enables the 
upper bank, and the active level of the AOaddress 
bit disables the lower bank. 


As indicated in table 4-5, the 8086 can access a 
byte 
in 
both 
the 
upper 
and 
lower 
banks 
simultaneously 
as a 16-bit word. When the low- 


order byte of the word to be accessed is on an 
even address boundary 
(that is, when the low- 


When 
the low-order 
byte of the word 
to be 


accessed is on an odd address boundary (when the 
low-order byte is in the upper bank), the word is 
"not aligned" 
and must be accessed in two bus 


cycles. During the first cycle, the low-order byte 
of the word is transferred 
to or from the upper 


bank as described 
for a byte access at an odd 


address 
(AO and 
BHE 
active). 
The 
memory 


address is then incremented, 
which causes AO to 
shift 
to an inactive 
level (selecting 
the lower 


bank), and a byte access at an even address is per- 
formed during the next bus cycle to transfer the 
word's high-order byte to or from the lower bank. 
The above sequence is initiated automatically 
by 


the 
8086 whenever 
a word 
access at an odd 


address is performed. 
Also, the directing of the 


high- and low-order bytes of the 8086's internal 
word registers to the appropriate 
halves of the 


data bus is performed 
automatically 
and, except 


for the additional 
four clock cycles required to 


execute the second bus cycle, the entire operation 
is transparent to the program. 


The 
8088 memory 
address 
space 
is logically 


organized as a linear array of up to one million 
bytes. Since the 8088 uses an 8-bit-wide data bus, 
memory consists of a single bank. Address bit AO 
is used to address memory, and a BHE signal is 
not provided. 


Word (16-bit) operands can be located at odd- or 
even-address boundaries. 
The low-order byte of 


the word is stored in the lower-valued 
address 


location, and the high-order byte is stored in the 
next, higher-valued 
address location. 
The 8088 


automatically 
executes 
two 
bus 
cycles 
when 


accessing word operands. 


The 8086 and 
8088 CPUs 
support 
both 
I/O 


mapped 
I/O 
and 
memory 
mapped 
I/O. 
I/O 


mapped I/O permits an I/O device to reside in a 
separate 
address 
space 
(first 
64k 
of 
address 


space), and the standard 
I/O 
instruction 
set is 


available 
for device communications. 
Memory 
mapped 
I/O 
permits 
an I/O 
device to reside 
anywhere 
in memory 
and allows the complete 


CPU 
instruction 
set 
to 
be 
used 
for 
I/O 
operations. 


The 8086 supports 
both 
8-bit and 
16-bit I/O 
devices. An 8-bit I/O device may be associated 
with either the upper or lower half of the data 
bus. (Assigning an equal number 
of devices to 


each half of the data bus distributes bus loading.) 
When an I/O device is assigned to the lower half 
of the bus (D7-DO), all I/O 
addresses must be 


even (AO equal "0"), 
and when an I/O device is 


assigned to the upper half of the bus, all I/O 
addresses must be odd (AOequal" 
I"). Note that 


since AO always will be either a "I" 
or a "0" for 


a specific device, it cannot be used as an address 
input to select registers within the I/O 
device. 


When an I/O device on the upper half of the bus 
and an I/O device on the lower half of the bus are 
assigned addresses that differ only by the state of 
AO (adjacent 
odd and even addresses), 
AO and 


BHE both must be conditions of device selection 
to prevent a write operation 
to one device from 


overwriting data in the other device. 


To permit data transfers to 16-bit I/O devices to 
be performed 
in a single bus cycle, the device is 


assigned an even address. To ensure that the I/O 
device is selected only for word transfers, AOand 
BHE both must be conditions of device selection. 


The 8088, since its data bus is eight bits wide, is 
designed to support 8-bit I/O devices and places 
no restrictions on odd or even addresses. 


When the 8086 or the 8088 is operated 
in the 


minimum mode, the CPU's read and write com- 
mands (RD and WR) are common 
for memory 


and I/O devices. If the memory and I/O address 
spaces overlap, device selection must be qualified 
by M/IO (8086) or 10/M 
(8088) to determine if 


the device is memory or I/O. This restriction does 
not apply to systems in which I/O and memory 
addresses do not overlap or to systems that use 
memory-mapped 
I/O 
exclusively. 
In the max- 
imum mode, the CPU generates (through the bus 
controller) separate memory read/write 
and I/O 
read/write 
commands 
in place of the M/IO 
or 


10/M signal. In a maximum mode system, an 1/0 
device is assigned 
to an I/O 
address 
or to a 


memory address (memory mapped I/O) by con- 
necting either the memory or I/O read/write com- 
mand 
lines to 
the 
device's 
command 
inputs. 


When 
the 
I/O 
and 
memory 
address 
spaces 


overlap, 
device selection 
is determined 
by the 


appropriate read/write command set. 


CPU 
interrupts 
can be software 
or hardware 


initiated. 
Software 
interrupts 
originate 
directly 


from 
program 
execution 
(i.e., 
execution 
of a 


breakpointed 
instruction) 
or indirectly 
through 
program logic (i.e., attempting to divide by zero). 
Hardware interrupts originate from external logic 
and 
are 
classified 
as either 
non-mask able 
or 


maskable. 
All interrupts, 
whether 
software 
or 


hardware initiated, result in the transfer of con- 
trol to a new program location. A 256-entry vec- 
tor table, which. contains address pointers to the 
interrupt routines, resides in absolute locations 0 
through 3FFH. Each entry in this table consists of 
two 16-bit address values (four bytes) that are 
loaded 
into 
the 
code 
segment 
(CS) 
and 
the 


instruction pointer (IP) registers as the interrupt 
routine address when an interrupt 
is accepted. 


Figure 4-18 illustrates the organization of the 256- 
entry vector table. 


Memory 
Table 
Vector 
Address 
Entry 
Definition 


::: I=======~=:=::=:=====::::I} 
Vector255,. 


I 
I 


I 
Reserved 


CS5 


'P5 


CS' 


,P. 


CS3 
L 


'P3 


CS2 


IP2 


CS' 


'P1 


CS Value 
- 
Vector 
0 (CS 0) 


IP Value 
- 
Vector 
0 (lP 0) 


As shown in figure 4-18, the first five interrupt 
vectors are associated with the software-initiated 
interrupts and the hardware non-maskable 
inter- 


rupt (NMI). The next 27 interrupt 
vectors are 
reserved by Intel and should not be used if com- 
patibility with future Intel products is to be main- 
tained. The remaining interrupt 
vectors (vectors 


32 thorugh 255) are available for user interrupt 
routines. 


The non-mask able interrupt 
(NMI) occurs as a 


result of a positive transition 
at the CPU's NMI 


input pin. This input is asynchronous 
and, 
in 


order to ensure that it is recognized, is required to 
have a minimum 
duration 
of two clock cycles. 


NMI is typically used with power fail circuitry, 
error correcting memory or bus parity detection 
logic to allow fast response to these fault condi- 
tions. When NMI is activated, 
control is trans- 


ferred to the interrupt 
service routine pointed to 


by vector 2 following execution 
of the current 


instruction. 
When a non-mask able interrupt 
is 


acknowledged, 
the current contents of the flags 


register 
are pushed 
onto 
the stack 
(the stack 


pointer 
is decremented 
by two), 
the interrupt 


enable 
and trap 
bits in the flags register 
are 


cleared (disabling mask able and single-step inter- 
rupts), 
and 
the vector 
2 CS and 
IP address 


pointers are loaded into the CS and IP registers as 
the interrupt service routine address. 


The CPU provides a single interrupt request input 
(INTR) that can be software masked by clearing 
the 
interrupt 
enable 
bit 
in the 
flags 
register 


through the execution of a CLi instruction. 
The 


INTR input is level triggered and is synchronized 
internally to the positive transition 
of the CLK 


signal. In order to be accepted before the next 
instruction, INTR must be active during the clock 
period preceding the end of the current instruc- 
tion (and the interrupt enable bit must be set). 


As shown in figure 4-19, when a maskable inter- 
rupt 
is acknowledged, 
the CPU 
executes two 


interrupt acknowledge bus cycles. 


During the first bus cycle, the CPU floats the 
address/ data bus and activates the INTA (Inter- 
rupt 
Acknowledge) 
command 
output 
during 


states T2 through T4' In the minimum mode, the 
CPU 
will not 
recognize 
a hold 
request 
from 


another 
bus 
master 
until 
the 
full 
interrupt 


acknowledge sequence is completed. 
In the max- 


imum mode, the CPU activates the LOCK output 
from state T2 of the first bus cycle until state T2 
of the second bus cycle to signal all 8289 Bus 
Arbiters in the system that the bus should not be 
accessed by any 
other 
processor. 
During 
the 


second bus cycle, the CPU 
again activates 
its 


INTA command 
output. 
In 
response 
to 
the 


ALE F\ 
~n 
~r 


'LOCK 
\ 
/ 


\ 
---J1 
\~_~I 


AD7-ADO 
----------------------- 
•••••( 
VECTOR 
TYPE 
)•••--- 


'MAXIMUM 
MODE 
ONLY 
"SEVERAL 
(3 TYPICAL) 
IDLE 
CLOCK 
STATES 
OCCUR 
BETWEEN 
THE 
FIRST 
AND 
SECOND 
INTERRUPT 
ACKNOWLEDGE 
BUS CYCLES 
IN THE 
8086 CPU (DURING 
THIS 
INtERVAL 
THE 
BUS 
IS DRIVEN). 
INTERRUPT 
ACKNOWLEDGE 
BUS 
CYCLES 
OCCUR 
BACK·TO·BACK 
IN 
THE 
8088 CPU. 


second INTA, the external interrupt system (e.g., 
an Intei® 8259A Programmable 
Interrupt 
Con- 
troller) places a byte on the data bus that iden- 
tifies the 
source 
of 
the 
interrupt 
(the vector 
number or'vector 
"type"). 
This byte is read by 
the CPU and then multiplied 
by four with the 
resultant value used as a pointer into the interrupt 
vector table. 
Before calling the corresponding 
interrupt 
routine, 
the CPU 
saves the machine 
status by pushing the current contents of the flags 
register onto the stack. The CPU then clears the 
interrupt enable and trap bits in the flags register 
to prevent subsequent 
maskable 
and sing\e-step 
interrupts, 
"and establishes the interrupt 
routine 
return linkage by pushing the current CS and IP 
register contents onto the stack before loading the 
new CS and IP register values from the vector 
table. 


The four classes of interrupts are prioritized with 
software-initiated 
interrupts 
having the highest 
priority and with maskable and single-step inter- 
rupts sharing the lowest priority (see section 2.6). 
Since the CPU disables maskable and single-step 
interrupts 
when acknowledging 
any interrupt, 
if 
recognition of maskable interrupts 
or single-~tep 
operation 
is required 
as part 
of tht; interrupt 
routine, the routine first must. set these bits. 


The processing times for the various classes of 
interrupts are given in table 4-6. (These times also 
are included with the 8086/8088 instruction times 
cited in section 2.7.) 


Interrupt 
Class 
Processing Time 


External Maskable Interrupt 
(INTR) 
61clocks 


Non-Maskable Interrupt (NMI) 
. 50clocks 


INT (with vector) 
51clocks 
INTType 3 
52clocks 
INTO 
53clocks 


Single Step 
50clocks 


Note that the times shown in table 4-6 represent 
only the time required 
to process the interrupt 
request after it has been recognized. To determine 
interrupt 
latency (the time interval between the 
posting of the interrupt request and the execution 
of 
"useful" 
instructions 
within 
the 
interrupt 


routine), additional time must be included for the 
completion on an instruction being executed when 
the interrupt 
is posted (interrupts 
are generally 
processed 
only at instruction 
boundaries), 
for 
saving the contents 
of any additional 
registers 
prior 
to 
interrupt" 
processing 
(interrupts 
automatically save only CS, IP and Flags) and for 
any wait states that may be incurred during inter- 
rupt processing. 


Machine 
Instruction 
Encoding 
and 
Decoding 


Writing a MOV instruction 
in ASM-86 in the 
form: 


will cause the assembler to generate 1 of 28 pos- 
sible forms of the MOV machine instruction. 
A 
programmer 
rarely'needs 
to know, the details of 
machine 
instruction 
formats 
or encoding. 
An 
exception may occur during'debugging 
when it 
may be necessary to monitor instructions fetched 
on the bus, read unformatted 
memory dumps, 
etc, 
This 
section 
provides 
the 
information 
necessary to translate or decode an 8086 or 8088 
machine instruction. 


To pack instructions 
into memory as densely as 
possible, the 8086 and 8088 CPUs utilize an effi- 
cient coding technique. Machine instructions vary 
from one to six bytes in length. One-byte instruc- 
tions, which generally operate on single registers 
or flags, 
are simple to identify. 
The keys to 
decoding longer instructions 
are in the first two 
bytes. The format 
of these bytes can vary, but 
most instructions 
follow the format 
shown in 
figure 4-20. 


The 
first 
six bits 
of 
a multibyte 
instruction 
generally contain 
an opcode that identifies the 
basic instruction 
type: 
ADD, 
XOR, 
etc. The 
following 
bit, 
dilled 
the 
0 
field, 
generally 
specifies the "direction" 
of the operation: 
I = the 
REG 
field 
in the 
second 
byte 
identifies 
the 
destination operand, 0 = the REG field identifies 
the source operand. 
The W field distinguishes 
between byte and word operations: 0 = byte, 1 = 
word. 


One of three additional 
single-bit fields, S, V or 
Z, appears in some instruction formats. S is used 
in conjunction 
with W to indicate sign extension 


of immediate fields in arithmetic instructions. 
V 
distinguishes 
between 
single- 
and 
variable-bit 


shifts and rotates. Z is used as a compare bit with 


the zero 
flag in conditional 
repeat 
and 
loop 


instructions. 
All single-bit field settings are sum- 
marized in table 4-7. 


BYTE 3 
BYTE 4 
BYTE 5 
BYTE 6 
------~-----r-----~------l 
I 
I 
I 
I 
LOW DISP/DATA 
IHIGH DISP/DATA I 
LOW DATA 
I 
HIGH DATA 
I 
I 
I 
I 
I 
____________ 
-l. 
' 
J 
llli 


REGiSTER OPERAND/REGISTERS 
TO USE IN EA CALCULATION 


REGISTER OPERAND/EXTENSION 
OF OPCODE 


REGISTER MODE/MEMORY 
MODE WITH DISPLACEMENT LENGTH 


WORD/BYTE OPERATiON 


DIRECTiON is TO REGISTER/DiRECTION 
is FROM REGiSTER 


OPERATION (INSTRUCTION) CODE 


Field 
Value 
Function 


S 
0 
No sign extension 
1 
Sign extend 8-bit immediate data to 16bits if W=1 


W 
0 
Instruction operates on byte data 
1 
Instruction operates on word data 


0 
0 
Instruction source is specified in REGfield 
1 
Instruction destination is specified in REGfield 


V 
0 
Shift/rotate count is one 
1 
Shift/ rotate count is specified in CL register 


Z 
0 
Repeatlloop while zero flag is clear 
1 
Repeat/loop while zero flag is set 
, 


The second byte of the instruction 
usually iden- 


tifies 
the 
instruction's 
operands. 
The 
MOD 


(mode) 
field 
indicates 
whether 
one 
of 
the 


operands is in memory or whether both operands 
are registers (see table 4-8). The REG (register) 
field identifies a register that is one of the instruc- 
tion operands 
(see table 4-9). In a number 
of 
instructions, 
chiefly 
the 
immediate-to-memory 
variety, 
REG 
is used as an extension 
of the 


opcode to identify 
the type of operation. 
The 
encoding of the R/M (register/memory) 
field (see 


table 4-10) depends on how the mode field is set. 
If MOD = 11 (register-to-register 
mode), 
then 


R/M 
identifies the second register operand. 
If 


MOD selects memory mode, then R/M indicates 
how the effective address of the memory operand 
is to be calculated. 
Effective address calculation 


is covered in detail in section 2.8. 


Bytes 3 through 6 of an instruction 
are optional 


fields that usually contain the displacement value 
of a memory operand and/or 
the actual value of 


an immediate constant operand. 


CODE 
EXPLANATION 


00 
Memory Mode, no displacement 
follows* 


01 
Memory Mode, 8-bit 
displacement 
follows 


10 
Memory Mode, 16-bit 
displacement 
follows 


11 
Register Mode (no 
displacement) 


*Except when RIM = 110,then 16-bit 
displacement 
follows 


REG 
W=O 
W=1 


000 
AL 
AX 


001 
CL 
CX 


010 
OL 
OX 


011 
BL 
BX 


100 
AH 
SP 


101 
CH 
BP 


110 
OH 
SI 


111 
BH 
01 


There may be one or two displacement bytes; the 
language translators 
generate one byte whenever 


possible. 
The MOD field indicates 
how many 


displacement 
bytes are present. 
Following 
Intel 
convention, 
if the displacement 
is two bytes, the 


most-significant 
byte 
is stored 
second 
in the 


instruction. 
If the displacement 
is only a single 


byte, the 8086 or 8088 automatically 
sign-extends 


this quantity to 16-bits before using the informa- 
tion in further 
address calculations. 
Immediate 
values always follow any displacement values that 
may be present. The second byte of a two-byte 
immediate value is the most significant. 


Table 4-12 lists the instruction 
encodings for all 


8086/8088 instructions. 
This table can be used to 


predict the machine 
encoding 
of any ASM-86 


instruction. 
Table 
4-13 
lists 
the 
8086/8088 


machine instructions in order by the binary value 
of their first byte. 
This table can be used to 


decode any machine instruction 
from its binary 


representation. 
Table 
4-11 
is 
a 
key 
to 
the 


abbreviations 
used in tables 4-12 and 4-13. Table 


4-14 is a more 
compact 
instruction 
decoding 


guide. 


MOD=11 
EFFECTIVE ADDRESS CALCULATION 


RIM 
W=O 
W=1 
RIM 
MOD=OO 
MOD=01 
MOD=1~ 


000 
AL 
AX 
000 
(BX)+(SI) 
(BX) + (SI) + 08 
(BX) + (SI) + 016 


001 
CL 
CX 
001 
(BX)+(OI) 
(BX) + (01)+ 08 
(BX)+(01)+016 


010 
OL 
OX 
010 
(BP)+(SI) 
(BP)+(SI)+08 
(BP) + (SI) + 016 


011 
BL 
BX 
011 
(BP)+(OI) 
(BP) + (01)+ 08 
(BP) + (01)+ 016 


100 
AH 
SP 
100 
(SI) 
(SI)+08 
(SI)+016 
101 
CH 
BP 
101 
(01) 
(01)+08 
(01)+016 
110 
OH 
SI 
110 
DIRECT ADDRESS 
(BP)+08 
(BP)+016 


111 
BH 
01 
111 
(BX) 
(BX)+08 
(BX)+016 


MOD 


REG 


R!M 


SR 


W,S,D,V,Z 


DATA-B 


DATA-SX 


Mode field; described 
in this chapter. 


Register field; described 
in this chapter. 


Register! Memory field; described in this chapter. 


Segment register code: OO=ES,01=CS, 10=SS, 11=DS. 


Single-bit instruction 
fields; described in this chapter. 


B-bit immediate constant. 


B-bit immediate 
value that is automatically 
sign-extended 
to 16-bits 
before use. 


Low-order byte of 16-bit immediate constant. 


High-order byte of 16-bit immediate constant. 


Low-order 
byte of optional B- or 16-bit unsigned displacement; 
MOD 
indicates if present. 
r 


High-order 
byte 
of optional 
16-bit unsigned 
displacement; 
MOD 
indicates if present. 


Low-order byte of new IP value. 


High-order byte of new IP value 


Low-order byte of new CS value. 


High-order byte of new CS value. 


B-bit signed increment to instruction 
pointer. 


Low-order byte of signed 16-bit instruction 
pointer increment. 


High-order byte of signed 16-bitinstruction 
pointer increment. 


Low-order 
byte of direct address (offset) of memory operand; EA not 
calculated. 


High-order byte of direct address (offset) of memory operand; EA not 
calculated. 


Bits may contain any value. 


First 3 bits of ESC opcode. 


Second 3 bits of ESC opcode. 


8-bit general register operand. 


16-bit general register operand. 


8-bit memory operand (any addre,ssing mode). 


16-bit memory operand (any addressing mode). 


8-bit immediate operand. 


16-bit immediate operand. 


Segment register operand. 


Byte string addressed by 01. 


DATA-LO 


DATA-HI 


(DISP-LO) 


IP-LO 


IP-HI 


CS-LO 


CS-HI 


IP-INC8 


IP-INC-LO 


IP-INC-HI 


ADDR-LO 


xxx 


YYY 


REG8 


REG16 


MEM8 


MEM16 


IMMED8 


IMMED16 


SEGREG 


DEST-STR8 


IDENTIFIER 
EXPLANATION 
, 


SRC-STR8 
Byte string addressed by SI. 
I 


OEST-STR16 
Word string addressed by 01. 


SRC-STR16 
Word string addressed by SI. 


SHORT-LABEL 
Label within ±127 bytes of instruction. 


NEAR-PROC 
' Procedure in current code segment. 


FAR-PROC 
Procedure in another code segment. 


NEAR-LABEL 
Label in current 
code segment 
but farther than -128 to +127 bytes 


from instruction. 


FAR-LABEL 
Label in another code segment. 


SOURCE-TABLE 
XLAT translation table addressed by BX. 


OPCOOE 
ESCopcode operand. 


SOURCE 
ESC register or memory operand. 


100010dw 
mod 
reO 
rIm 
(OISP-LO) 
(OISP-HI) 
I 


1 1 0 0 0 1 1 w 
mod 
o 0 0 
rim 
(OISP-LOI 
(OISP-HI) 
I 
data 
I 
datalfw-' 
I 


1 0 1 1 w 
reg 
da1a 
dataUw-l 


1010000w 
addr-Io 
addr-hl 


1010001w 
addr-Io 
addr-hi 


1 0 0 0 1 1 1 0 
mod 
o 
SA 
rIm 
IOISP-LO) 
IOISP-HI) 
I 


10001100 
mod 
o 
SA 
rIm 
IOISP-LO) 
(OISP-HI) 
I 


1 1 1 0 0 1 , 
w 
DATA-8 


1 1 1 0 1 1 1 w 


1 1 0 1 0 1 11 


1 0 0 0 1 1 o 1 
mod 
reg 
rim 
(DISP-LO) 
(DISP-HII 


1 1 0 0 0 1 o 1 
mod 
reg 
rim 
(DISP-LO) 
(DISP-HII 


1 1 0 0 0 1 o 0 
mod 
reg 
rim 
(DISP-LOI 
(DISP-HI) 


1 0 0 1 11 
, 
1 


1 0 0 1 11 
1 0 


1 0 0 1 1 1 0 0 


1 0 0 1 1 1 0 , 


OOOOOOdw 
mod 
reg 
rim 
(DISP-LO) 
(DISP-Hl) 
I 


100000sw 
mod 
o 0 0 
rim 
(DISP-LO) 
(DISP-HII 
I 
data 
I 
data if s: w-01 
I 


0OOOO10w 
data 
dataifw-1 


0OO100dw 
mod 
reg 
rim 
(DISP-LO) 
(DISP-HI) 
I 


100000sw 
mod 
o 1 0 
rim 
(DISP-LO) 
(DISP-HI) 
T 
data 
T 
data If s: w-01 
I 


0OO1010w 
data 
data if w-1 


1111111 
w 
mod 
0 
0 
0 
rIm I 
(DISP-LO) 
I 


(DISP-HI) l 


01000reg 


o 0 1 , 
0 1 


" 
o 0 1 o 0 1 " 


SUB,. Subtr.ct: 


Reg/memory 
and register to either 


Immediate 
from reglsterfmemory 


Reg/memory 
and register 
to either 


Immediate 
from reglsterfmemory 


Immediate 
from accumulator 


DEC Decrement: 


Reglsterlmemory 


Register 


NEG Change 
sign 


CMP = Compar.: 


Register/memory 
and register 


Immediate 
with reglsterfmemory 


Immediate 
with accumulator 


DAS Decimal 
adjust 
for subtract 


MUL Multiply 
(unsigned) 


IMUL Integer 
multiply 
(signed) 


AAM ASCII adjust 
for multiply 


DIY Divide 
(unsigned) 


IDlY 
Integer 
divide 
(signed) 


AAD ASCII adjust 
for divide 


CBW Convert 
byle to word 


CWO Convert 
word 10 double 
word 


NOT Invert 


SHL/SAL 
Shift logicalfarithmetic 
left 


$HR Shilt logical 
right 


$AR Shift arithmetic 
right 


ROL Rotate 
left 


001010dw 
mod 
reg 
rim 
IDISP-LO) 
IDISP-HI) 


100000sw 
mod 
1 
0 
1 
rim 
IDISP-LO) 
IDISP-HI) 
dala 
I 
data if s: w=01 
I 


o 0 1 0 1 lOw 
data 
data if w-1 


o 0 0 1 1 0 d 
w 
mod 
reg 
rim 


1 0 0 0 0 0 s w 
mod 
0 
1 
1 
rIm 


IDISP-LO) 


(DISP-LO) 


IDISP-HI) 


(DISP-HI) 


1 1 1 1 1 1 1 w 
mod 
0 
0 
1 
rIm 


01001reg 


1 1 1 1 0 1 1 w 
mod 
0 
1 
1 
rIm 


001110dw 
mod 
reg 
rim 
IDISP-LOI 
IDISP-HI) 


100000sw 
mod , 1 , rim 
IDISP-LO) 
(DISP-HI) 
data 
I 


dala if s: w=1 
I 


o 0 1 1 1 1 0 w 
dala 


o 0 1 11 ',1 


o 0 1 o 1 1 11 


" 
1 101 
, 
w 
mod 
1 
o 
0 
rim 
IDISP-LO) 
IDISP-HI) 


1 1 1 1 0 1 , 
w 
mod 
1 
o 
, 
rim 
IDISP-LO) 
IDISP-HI) 


1 1 0 
1 0 
1 0 0 
00001010 
(DISP-LO) 
IDISP-HI) 


1 1 1 1 0 1 1 w 
mod 
1 
1 o 
rim 
IDISP-LO) 
(DISP·HI) 


1 1 1 1 0 1 1 w 
mod 
1 
1 , rim 
IDISP-LO) 
IDISP-HII 


1 1 0 1 o 
1 0 1 
00001010 
IDISP-LO) 
(DISP-HII 


1 0 0 1 1 0 0 0 


1 0 0 1 1 0 0 1 


1 1 1 1 0 1 1 w 
mod 
0 , 0 
rim 
(DISP-LO) 
(D1SP·Hlj 


1 1 0 1 0 0 v 
w 
mod , o 
0 
rim 
IDISP-LOI 
(DISP·Hl) 


1 1 0 1 0 0 v w 
mod , 0 
1 
rim 
IDISP-LOI 
(DISP·HI) 


110100vw 
mod 
1 
1 
1 
rim 
IDISP-LO) 
IDISP-HI) 


110100vw 
mod 
o 
0 
0 
rim 
IDISP-LO) 
IDISP-HI) 


1 1 0 1 00 
v 
w 
mod 
o 
0 
, 
rim 
IDISP-LO) 
IDISP-HI) 


11 
o 1 o 0 
••• w 
mod 
o 
1 o 
rim 
IDISP-LO) 
IDISP-HI) 


110100vw 
mod 
0 
1 
1 
rim 
IDISP-LO) 
IDISP-HI) 


0OOO10dw 
mod 
reg 
rim 
IDlSP-LOI 
IDISP-HII 
I 


1000000w 
mod 
o 
0 
, 
rim 
(DISP-LOI 
(DISP-HII 
I 
data 
I 
data if w-1 
1 


0OOO110w 
data 
data ifw-' 


001100dw 
mod 
reg 
rim 
(DISP-LOI 
{DISP-HII 
1 


o 0 , 
1 0 1 0 w 
data 
(DISP-LOI 
(DISP-HII 
T 
data 
T 
data Itw-1 
1 


o 0 , 
1 0 1 0 w 
data 
data if w-1 


1 1 1 1 0 0 1 z 


1 0 1 0 0 1 0 w 


1 0 1 0 0 1 1 w 


1 0 1 0 1 " 
w 


1 0 1 0 1 , 
0 w 


1 0 1 0 1 0 1 w 


, , 
, 0 , o 0 0 
IP-INC-lO 
tP-INC-HI 


111 
11 
111 
mod 
o 
1 
0 
rim 
(OISP-lOI 
(DISP-HI) 
I 


, 0 0 , 
, o 1 0 
lP-to 
IP-hi 


CS-Io 
CS-hi 


""", 
, 
mod 
01 
1 
rim 
(DISP-lO) 
(DISP-HI) 
I 


11 
1 0 1 o 0 1 
IP-INC-lO 
IP·INe-HI 


, 
1 
, 
0 
, 
011 
IP-tNC8 


"" 
, 1 1 1 
mod 
1 o 
0 
rim 
(DISP-lO) 
(DISP-HI) 
I 


, 
, 
, 
0 
, o 
1 0 
IP-Io 
IP-hi 


CS-l0 
CS-h; 


""", 
, 
mod 
1 
0 
1 
rim 
(DISP-lOI 
(DISP-HI) 
I 


11 
o 0 0 0 , 
, 


'10000'0 
data-to 
data-hi 
I 


, , 0 0.' 
011 


, 


1 , 0 0 1 o 1 0 
data·lo 
data-hi 
I 


o 1 1 1 0 1 o 0 
IP-INCB 


o 1 , 
, , 
, 0 0 
Ip·INC8 


o 1 11 
, 1 1 0 
Ip·INC8 


o 1 11 
o 0 , 0 
IP-INC8 


o 
1 1 1 0 1 i 0 
IP·INC8 


o 
1 1 , 
, 0 , 0 
IP·INC8 


o 
1 1 , 0 0 0 0 
IP-INCB 


o 1 1 , 
, 0 0 0 
tP-tNC8 


o 1 , 
, 0 , o 
1 
lP-INC8 


o 1 11 
, 
, 0 , 
IP-INC8 


o 
1 1 "" 
, 
IP-INCB 


o 1 , 
, 0 0 , 
, 
lP-INCS 


o , 
1 , 0 , 
, 
, 
tP~INC8 


o 1 1 , , 
0 , 
, 
tP-INC8 


o 
1 1 , 0 0 0 , 
Ip·INC8 


o 1 1 1 1 0 0 1 
IP-INCS 


11 
1 0 0 0 1 0 
IP-INCS 


11 
1 0 0 0 0 1 
IP-INC8 


1 1 1 0 0 0 0 0 
IP-INC8 


1 1 1 0 0 0 
1 1 
IP-INC8 


1 1 0 0 1 1 0 1 
OATA-8 
I 


1 1 0 0 
1 1 0 0 


1 1 0 0 1 1 1 0 


1 1 0 0 1 1 11 


1ST BYTE 
2ND BYTE 
BYTES 3, 4, 5, 6 
ASM-86 
INSTRUCTION 
FORMAT 
HEX 
BINARY 


00 
0000 
0000 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
ADD 
REG8/MEM8,REG8 
01 
0000 
0001 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
ADD 
REG16/MEM16,REG16 
02 
0000 
0010 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
ADD 
REG8,REG8/MEM8 


! 
, 


03 
0000 
0011 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
ADD 
REG16,REG16/MEM16 
04 
0000 
0100 
DATA-8 
ADD 
AL,IMMED8 


05 
0000 
0101 
DATA-LO 
DATA-HI 
ADD 
AX,IMMED16 


06 
0000 
0110 
PUSH 
ES 


07 
0000 
0111 
POP 
ES 


•., I DJ • t:. 
2ND BYTE 
BYTES 3,4,5,6 
ASM-86 
INSTRUCTION 
FORMAT 
HEX 
BINARY 


08 
0000 
1000 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
OR 
REG8/MEM8,REG8 


09 
0000 
1001 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
OR 
REG16/MEM16,REG16 


OA 
0000 
1010 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
OR 
REG8,REG8/MEM8 


OB 
0000 
1011 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
OR 
REG16,REG16/MEM16 


OC 
0000 
1100 
DATA-8 
OR 
AL,IMMED8 


OD 
0000 
1101 
DATA-LO 
DATA-HI 
OR 
AX,IMMED16 


OE 
0000 
1110 
PUSH 
CS 


OF 
0000 
1111 
(not used) 


10 
0001 
0000 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
ADC 
REG8/MEM8,REG8 


11 
0001 
0001 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
ADC 
REG16/MEM16,REG16 


12 
0001 
0010 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
ADC 
REG8,REG8/MEM8 


13 
0001 
0011 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
ADC 
REG16,REG16/MEM16 


14 
0001 
0100 
DATA-8 
ADC 
AL,IMMED8 


15 
0001 
0101 
DATA-LO 
DATA-HI 
ADC 
AX,IMMED16 


16 
0001 
0110 
PUSH 
SS 


17 
0001 
0111 
POP 
SS 
18 
0001 
1000 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
SBB 
REG8/MEM8,REG8 


19 
0001 
1001 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
SBB 
REG161 M EM16, REG16 


1A 
0001 
1010 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
SBB 
REG8,REG8/MEM8 


1B 
0001 
1011 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
SBB 
REG16, REG161 M EM16 


1C 
0001 
1100 
DATA-8 
SBB 
AL,IMMED8 


10 
0001 
1101 
DATA-LO 
DATA-HI 
SBB 
AX,IMMED16 


1E 
0001 
1110 
PUSH 
DS 
1F 
0001 
1111 
POP 
DS 
20 
0010 
0000 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
AND 
REG81 M EM8, REG8 


21 
0010 
0001 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
AND 
REG16/MEM16,REG16 


22 
0010 
0010 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
AND 
REG8,REG8/MEM8 
23 
0010 
0011 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
AND 
REG16,REG16/MEM16 
24 
0010 
0100 
DATA-8 
AND 
AL,IMMED8 


25 
0010 
0101 
DATA-LO 
DATA-HI 
AND 
AX,IMMED16 
26 
0010 
0110 
ES: 
(segment override 
prefix) 
27 
0010 
0111 
DAA 
28 
0010 
1000 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
SUB 
REG8/MEM8,REG8 
29 
0010 
1001 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
SUB 
REG16/MEM16,REG16 
2A 
0010 
1010 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
SUB 
REG8,REG8/MEM8 
2B 
0010 
1011 
MOD REG RIM 
(DISP-LO,(DISP-HI) 
SUB 
REG16,REG16/MEM16 
2C 
0010 
1100 
DATA-8 
SUB 
AL,IMMED8 
2D 
0010 
1101 
DATA-LO 
DATA-HI 
SUB 
AX,IMMED16 
2E 
0010 
1110 
., 
CS: 
(segment override 
prefix) 
2F 
0010 
1111 
DAS 
30 
0011 
0000 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
XOR 
REG8/MEM8,REG8 
31 
0011 
0001 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
XOR 
REG161 MEM16, REG16 
32 
0011 
0010 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
XOR 
REG8,REG8/MEM8 
33 
0011 
0011 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
XOR 
REG16,REG16/MEM16 
34 
0011 
0100 
DATA-8 
XOR 
AL,IMMED8 
35 
0011 
0101 
DATA-LO 
DATA-HI 
XOR 
AX,IMMED16 
36 
0011 
0110 
SS: 
(segment override 
prefix) 


1ST BYTE 
2ND BYTE 
BYTES 3,4,5,6 
ASM-86 
INSTRUCTION 
FORMAT 
HEX 
BINARY 


37 
0011 
0110 
AAA 


38 
0011 
1000 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
CMP 
REG8/MEM8,REG8 


39 
0011 
1001 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
CMP 
REG16/MEM16,REG16 


3A 
0011 
1010 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
CMP 
REG8, REG81 M EM8 


3B 
0011 
1011 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
CMP 
REG16,REG16/MEM16 
3C 
0011 
1100 
DATA-8 
CMP 
AL,IMMED8 
3D 
0011 
1101 
DATA-La 
DATA-HI 
CMP 
AX,IMMED16 
3E 
0011 
1110 
DS: 
(segment 
override 
prefix) 
3F 
0011 
1111 
AAS 
40 
0100 
0000 
INC 
AX 
41 
0100 
0001 
INC 
CX 
42 
0100 
0010 
INC 
DX 
43 
0100 
0011 
INC 
BX 
44 
0100 
0100 
INC 
SP 
45 
0100 
0101 
INC 
BP 
46 
0100 
0110 
INC 
SI 
47 
0100 
0111 
INC 
DI 
48 
0100 
1000 
DEC 
AX 
49 
0100 
1001 
DEC 
CX 
4A 
0100 
1010 
DEC 
DX 
4B 
0100 
1011 
DEC 
BX 
4C 
0100 
1100 
DEC 
SP 
4D 
0100 
1101 
DEC 
BP 
4E 
0100 
1110 
DEC 
SI 
4F 
0100 
1111 
DEC 
DI 
50 
0101 
0000 
PUSH 
AX 
51 
0101 
0001 
PUSH 
CX 
52 
0101 
0010 
PUSH 
DX 
53 
0101 
0011 
PUSH 
BX 
54 
0101 
0100 
PUSH 
SP 
55 
0101 
0101 
PUSH 
BP 
56 
0101 
0110 
PUSH 
SI 
57 
0101 
0111 
PUSH 
DI 
58 
0101 
1000 
POP 
AX 
59 
0101 
1001 
POP 
CX 
5A 
0101 
1010 
POP' 
DX 
5B 
0101 
1011 
POP 
BX 
5C 
0101 
1100 
POP 
SP 
5D 
0101 
1101 
POP 
BP 
5E 
0101 
1110 
POP 
SI 
5F 
0101 
1111 
POP 
DI 
60 
0110 
0000 
(not used) 
61 
0110 
0001 
(not used) 
62 
0110 
0010 
(not used) 
63 
0110 
0011 
(not used) 
64 
0110 
0100 
(not used) 
65 
0110 
0101 
(not used) 
66 
0110 
0110 
(not used) 
67 
0110 
0111 
(not used) 


4-29 
Mnemonics 
© Intel, 1978 


1ST BYTE 
2ND BYTE 
BYTES 3,4,5,6 
ASM-86 
INSTRUCTION 
FORMAT 


HEX 
BINARY 


68 
0110 
1000 
(not used) 


69 
0110 
1001 
(not used) 
6A 
0110 
1010 
(not used) 


6B 
0110 
1011 
(not used) 


6C 
0110 
1100 
. 
(not used) 


60 
0110 
1101 
(not used) 


6E 
0110 
1110 
(not used) 
6F 
0110 
1111 
(not used) 
70 
0111 
0000 
IP-INC8 
JO 
SHORT-LABEL 
71 
0111 
0001 
IP-INC8 
JNO 
SHORT-LABEL 
72 
0111 
0010 
IP-INC8 
JB/JNAEI 
SHORT-LABEL 
JC 
73 
0111 
0011 
IP-INC8 
JNB/JAEI 
SHORT-LABEL 


JNC 


74 
0111 
0100 
IP-INC8 
JE/JZ 
SHORT-LABEL 


75 
0111 
0101 
IP-INC8 
JNE/JNZ 
SHORT-LABEL 
76 
0111 
0110 
IP-INC8 
JBE/JNA 
SHORT-LABEL 
77 
0111 
0111 
IP-INC8 
JNBE/JA 
SHORT-LABEL 
78 
0111 
1000 
IP-INC8 
JS 
SHORT-LABEL 
79 
0111 
1001 
IP-INC8 
JNS 
SHORT-LABEL 
7A 
0111 
1010 
IP-INC8 
JP/JPE 
SHORT-LABEL 
7B 
0111 
1011 
IP-INC8 
JNP/JPO 
SHORT-LABEL 
7C 
0111 
1100 
IP-INC8 
JLlJNGE 
SHORT-LABEL 
70 
0111 
1101 
IP-INC8 
JNLlJGE 
SHORT-LABEL 
7E 
0111 
1110 
IP-INC8 
JLE/JNG 
SHORT-LABEL 
7F 
0111 
1111 
IP-INC8 
JNLE/JG 
SHORT-LABEL 
80 
1000 
0000 
MOD 000 RIM 
(DISP-LO),(DISP-HI). 
ADD 
REG8/MEM8,IMMED8 
DATA-8 
80 
1000 
0000 
MOD 001 RIM 
(DISP-LO),(DISP-HI). 
OR 
REG8/MEM8,IMMED8 
DATA-8 
80 
1000 
0000 
MOD010 
RIM 
(DISP-LO),(DISP-HI). 
ADC 
REG8/MEM8,IMMED8 
DATA-8 
80 
1000 
0000 
MOD011 
RIM 
(DISP-LO),(DISP-HI), 
SBB 
REG8/MEM8,IMMED8 


. DATA-8 
80 
1000 
0000 
MOD 100 RIM 
(DISP-LO),(DISP-HI), 
AND 
REG8/MEM8,IMMED8 
DATA-8 
80 
1000 
0000 
MOD101 
RIM 
(DISP-LO),(DISP-HI), 
SUB 
REG8/MEM8,IMMED8 
DATA-8 
80 
1000 
0000 
MOD 110 RIM 
(DISP-LO),(DISP-HI), 
XOR 
REG8/MEM8,IMMED8 
DATA-8 
80 
1000 
0000 
MOD111 
RIM 
(DISP-LO),(DISP-HI), 
CMP 
REG8/MEM8,IMMED8 
DATA-8 
81 
1000 
0001 
MOD 000 RIM 
(DISP-LO),(DISP-HI), 
ADD 
REG16/MEM16,IMMED16 


DATA-LO,DATA-HI 
81 
1000 
0001 
MOD 001 RIM 
(DISP-LO),(DISP-HI), 
OR 
REG16/MEM16,IMMED16 


DATA-LO,DATA-HI 
81 
1000 
0001 
MOD 010 RIM 
(DISP-LO),(DISP-HI), 
ADC 
REG16/MEM16,IMMED16 


DATA-LO,DATA-HI 
81 
1000 
0001 
MOD011 
RIM 
(DISP-LO),(DISP-HI), 
SBB 
REG16/MEM16,IMMED16 
DATA-LO,DATA-HI 


••.•"~n 
•..•A ~."' 
•.• 


1ST BYTE 
2ND BYTE 
BYTES 3,4,5,6 
ASM-86 
INSTRUCTION 
FORMAT 
HEX 
BINARY 


81 
1000 
0001 
MOD100 
RIM 
(DISP-LO),(DISP-HI), 
AND 
REG16/MEM16,IMMED16 
, 


DATA-LO,DATA-HI 
, 


81 
1000 
0001 
MOD101 
RIM 
(DISP-LO),(DISP-HI), 
SUB 
REG16/MEM16,IMMED16 
DATA-LO,DATA-HI 


81 
1000 
0001 
MOD 110 RIM 
(DISP-LO),(DISP-HI), 
XOR 
REG16/MEM16,IMMED16 
DATA-LO,DATA-HI 
81 
1000 
0001 
MOD111 
RIM 
(DISP-LO),(DISP-HI), 
CMP 
REG16/MEM16,IMMED16 
DATA-LO,DATA-HI 
82 
1000 
0010 
MOD 000 RIM 
(DISP-LO),(DISP-HI), 
ADD 
REG8/MEM8,IMMED8 
DATA-8 
82 
1000 
0010 
MOD001 
RIM 
(not used) 
82 
1000 
0010 
MOD010 
RIM 
(DISP-LO),(DISP-HI), 
ADC 
REG8/MEM8,IMMED8 
DATA-8 
82 
1000 
0010 
MOD011 
RIM 
(DISP-LO),(DISP-HI), 
SBB 
REG8/MEM8,IMMED8 
DATA-8 
82 
1000 
0010 
MOD 100 RIM 
(not used) 
82 
1000 
0010 
MOD101 
RIM 
(DISP-LO),(DISP-HI), 
SUB 
REG8/MEM8,IMMED8 
DATA-8 
82 
1000 
0010 
MOD 110 RIM 
(not used) 
82 
1000 
0010 
MOD 111 RIM 
(DISP-LO),(DISP-HI), 
CMP 
REG8/MEM8,IMMED8 
DATA-8 
83 
1000 
0011 
MOD 000 RIM 
(DISP-LO),(DISP-HI), 
ADD 
REG16/MEM16,IMMED8 
DATA-SX 
83 
1000 
0011 
MOD 001 RIM 
(not used) 
83 
1000 
0011 
MOD 010 RIM 
(DISP-LO), 
(DISP-HI), 
ADC 
REG161 M EM16, IMM ED8 
DATA-SX 
83 
1000 
0011 
MOD011 
RIM 
(DISP-LO),(DISP-HI), 
SBB 
REG161 MEM16,IMMED8 
DATA-SX 
I 


83 
1000 
0011 
MOD100 
RIM 
(not used) 
83 
1000 
0011 
MOD 101 RIM 
(DISP-LO),(DISP-HI), 
SUB 
REG161 M EM16,IM MED8 
DATA-SX 
, 


83 
1000 
0011 
MOD 110 RIM 
(not used) 
83 
1000 
0011 
MOD 111 RIM 
(DISP-LO),(DISP-HI), 
CMP 
REG16/MEM16,IMMED8 
DATA-SX 
84 
1000 
0100 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
TEST 
REG8/MEM8,REG8 
85 
1000 
0101 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
TEST 
REG161 M EM16, REG16 
86 
1000 
0110 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
XCHG 
REG8,REG8/MEM8 
87 
1000 
0111 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
XCHG 
REG16, REG161 M EM16 
88 
1000 
1000 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
MOV 
REG81 M EM8, REG8 
89 
1000 
1001 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
MOV 
REG161 M EM161 REG16 
8A 
1000 
1010 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
MOV 
REG8, REG81 M EM8 
8B 
1000 
1011 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
MOV 
REG16,REG16/MEM16 
8C 
1000 
1100 
MODOSR 
RIM 
(DISP-LO),(DISP-HI) 
MOV 
REG161 MEM16,SEGREG 
, 


8C 
1000 
1100 
MOD 1-- RIM 
(not used) 
, 


8D 
1000 
1101 
MOD REG RIM 
(DISP-LO),(DISP-HI) 
LEA 
REG16,MEM16 
8E 
1000 
1110 
MODOSR 
RIM 
(DISP-LO),(DISP-HI) 
MOV 
SEGREG,REG16/MEM16 
8E 
1000 
1110 
MOD1-- 
RIM 
(not used) 
8F 
1000 
1111 
MODOOOR/M 
(DISP-LO),(DISP-HI) 
POP 
REG16/MEM16 
8F 
1000 
1111 
MOD001 
RIM 
(not used) 
8F 
1000 
1111 
MOD010 
RIM· 
(not used) 
\ 
• r' J> 


HARDWARE 
REFERENCE 
INFORMATION 


Table 4-13. Machine Instruction Decoding Guide (Cont'd.) 


1ST BYTE 
2ND BYTE 
BYTES 3,4,5,6 
ASM·86 
INSTRUCTION 
FORMAT 
HEX 
BINARY 


8F 
1000 
1111 
MOD011 
RIM 
(not used) 


BF 
1000 
1111 
MOD100 
RIM 
(not used) 


BF 
1000 
1111 
MOD101 
RIM 
(not used) 


BF 
1000 
1111 
MOD110 
RIM 
(not used) 


BF 
1000 
1111 
MOD111 
RIM 
(not used) 


90 
1001 
0000 
NOP 
(exchange 
AX,AX) 


91 
1001 
0001 
XCHG 
AX,CX 


92 
1001 
0010 
XCHG 
AX,DX 


93 
1001 
0011 
XCHG 
AX,BX 


94 
1001 
0100 
XCHG 
AX,SP 


95 
1001 
0101 
XCHG 
AX,BP 


96 
1001 
0110 
XCHG 
AX,SI 


97 
1001 
0111 
XCHG 
AX,DI 


9B 
1001 
1000 
CBW 
99 
1001 
1001 
CWD 
9A 
1001 
1010 
DISP-LO 
DISP-HI,SEG-LO, 
CALL 
FAR_PROC 
SEG-HI 
9B 
1001 
1011 
WAIT 
9C 
1001 
1100 
PUSHF 


9D 
1001 
1101 
POPF 
9E 
1001 
1110 
SAHF 
9F 
1001 
1111 
LAHF 
AO 
1010 
0000 
ADDR-LO 
ADDR-HI 
MOV 
AL,MEMB 
A1 
1010 
0001 
ADDR-LO 
ADDR-HI 
MOV 
AX,MEM16 
A2 
1010 
0010 
ADDR-LO 
ADDR-HI 
MOV 
MEMB,AL 
A3 
1010 
0011 
ADDR-LO 
ADDR-HI 
MOV 
MEM16,AL 
A4 
1010 
0100 
MOVS 
DEST-STRB,SRC-STRB 
A5 
1010 
0101 
MOVS 
DEST-STR16,SRC-STR16 
A6 
1010 
0110 
CMPS 
DEST-STRB,SRC-STRB 
A7 
1010 
0111 
CMPS 
DEST-STR16,SRC-STR16 
A8 
1010 
1000 
DATA-8 
TEST 
AL,IMMEDB 
A9 
1010 
1001 
DATA-LO 
DATA-HI 
TEST 
AX,IMMED16 
AA 
1010 
1010 
STOS 
DEST-STRB 
AB 
1010 
1011 
STOS 
DEST-STR16 
AC 
1010 
1100 
LODS 
SRC-STRB 
AD 
1010 
1101 
LODS 
SRC-STR16 
AE 
1010 
1110 
SCAS 
DEST-STR8 
AF 
1010 
1111 
SCAS 
DEST-STR16 
BO 
1011 
0000 
DATA-8 
MOV 
AL,IMMEDB 
B1 
1011 
0001 
DATA-8 
MOV 
CL,IMMED8 
B2 
1011 
0010 
DATA-8 
MOV 
DL,IMMEDB 
B3 
1011 
1011 
DATA-8 
MOV 
BL,IMMEDB 
B4 
1011 
0100 
DATA-8 
MOV 
AH,IMMEDB 
B5 
1011 
0101 
DATA-B 
MOV 
CH,IMMEDB 
B6 
1011 
0110 
DATA-8 
MOV 
DH,IMMEDB 
B7 
1011 
0111 
DATA-B 
MOV 
BH,IMMEDB 
BB 
1011 
1000 
DATA-LO 
DATA-HI 
MOV 
AX,IMMED16 
B9 
1011 
1001 
DATA-LO 
DATA-HI 
MOV 
CX,IMMED16 
BA 
1011 
1010 
DATA-LO 
DATA-HI 
MOV 
DX,IMMED16 
BB 
1011 
1011 
DATA-LO 
DATA-HI 
MOV 
BX,IMMED16 
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Table 4-13. Machine Instruction Decoding Guide (Cont'd.) 


1ST BYTE 
2ND BYTE 
BYTES 3,4,5,6 
ASM-86 
INSTRUCTION 
FORMAT 
HEX 
BINARY 


BC 
1011 
1100 
DATA-LO 
DATA-HI 
MOV 
SP,IMMED16 
BD 
1011 
1101 
DATA-LO 
DATA-HI 
MOV 
BP,IMMED16 
BE 
1011 
1110 
DATA-LO 
DATA-HI 
MOV 
SI,IMMED16 
BF 
1011 
1111 
DATA-LO 
DATA-HI 
MOV 
DI,IMMED16 
CO 
1100 
0000 
(not used) 
C1 
1100 
0001 
(not used) 
C2 
1100 
0010 
DATA-LO 
DATA-HI 
RET 
IMMED16 (intraseg) 


C3 
1100 
0011 
RET 
(intrasegment) 
C4 
1100 
0100 
MOD REG RIM 
(DISP-LO),{DISP-HI) 
LES 
REG16,MEM16 
C5 
1100 
0101 
MOD REG RIM 
(DISP-LO),{DISP-H 
I) 
LDS 
REG16,MEM16 
C6 
1100 
0110 
MODOOOR/M 
(DISP-LO),{DISP-HI), 
MOV 
MEM8,IMMED8 
DATA-8 
C6 
1100 
0110 
MOD 001 RIM 
(not used) 
C6 
1100 
0110 
MOD 010 RIM 
(not used) 
C6 
1100 
0110 
MOD011 
RIM 
(not used) 
C6 
1100 
0110 
MOD 100 RIM 
(not used) 
C6 
1100 
0110 
MOD 101 RIM 
(not used) 
C6 
1100 
0110 
MOD 110 RIM 
(not used) 
C6 
1100 
0110 
MOD111 
RIM 
(not used) 
C7 
1100 
0111 
MOD 000 RIM 
(DISP-LO),{DISP-HI), 
MOV 
MEM16,IMMED16 
DATA-LO,DATA-HI 
C7 
1100 
0111 
MOD 001 RIM 
(not used) 
C7 
1100 
0111 
MOD010 
RIM 
(not used). 


C7 
1100 
0111 
MOD011 
RIM 
(not used) 
C7 
1100 
0111 
MOD 100 RIM 
(not used) 
C7 
1100 
0111 
MOD101 
RIM 
(not used) 
C7 
1100 
0111 
MOD 110 RIM 
(not used) 
C7 
1100 
0111 
MOD 111 RIM 
{not used 
C8 
1100 
1000 
(not used) 
C9 
1100 
1001 
(not used) 
CA 
1100 
1010 
DATA-LO 
DATA-HI 
RET 
IMMED16 (intersegment) 
CB 
1100 
1011 
RET 
(intersegment) 


CC 
1100 
1100 
INT 
3 
CD 
1100 
1101 
DATA-8 
INT 
IMMED8 
CE 
1100 
1110 
INTO 
CF 
1100 
1111 
IRET 
DO 
1101 
0000 
MOD 000 RIM 
(DISP-LO),{DISP-HI) 
ROL 
REG8/MEM8,1 
DO 
1101 
0000 
MOD 001 RIM 
(DISP-LO),{DISP-HI) 
ROR 
REG8/MEM8,1 
DO 
1101 
0000 
MOD 010 RIM 
(DISP-LO),{DISP-HI) 
RCL 
REG8/MEM8,1 
DO 
1101 
0000 
MOD011 
RIM 
(DISP-LO),{DISP-HI) 
RCR 
REG8/MEM8,1 
DO 
1101 
0000 
MOD 100 RIM 
(DISP-LO),{DISP-HI) 
SALISHL 
REG8/MEM8,1 
DO 
1101 
0000 
MOD 101 RIM 
(DISP-LO),{DISP-HI) 
SHR 
REG8/MEM8,1 
DO 
1101 
0000 
MOD110R/M 
(not used) 
DO 
1101 
0000 
MOD111 
RIM 
(DISP-LO),{DISP-HI) 
SAR 
REG8/MEM8,1 
D1 
1101 
0001 
MODOOOR/M 
(DISP-LO),{DISP-HI) 
ROL 
REG16/MEM16,1 
D1 
1101 
0001 
MOD 001 RIM 
(DISP-LO),{DISP-HI) 
ROR 
REG16/MEM16,1 
D1 
1101 
0001 
MOD 010 RIM 
(DISP-LO),{DISP-HI) 
RCL 
REG16/MEM16,1 
D1 
1101 
0001 
MOD011 
RIM 
(DISP-LO),{DISP-HI) 
RCR 
REG16/MEM16,1 
D1 
1101 
0001 
MOD 100 RIM 
(DISP-LO),{DISP-HI) 
SALISHL 
REG16/MEM16,1 
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Table 4-13. Machine Instruction Decoding Guide (Cont'd.) 


1ST BYTE 
2ND BYTE 
BYTES 3,4,5,6 
ASM-86 
INSTRUCTION 
FORMAT 
HEX 
BINARY 


01 
1101 
0001 
MOD 101 RIM 
(OISP-LO),(OISP-HI) 
SHR 
REG16/MEM16,1 


01 
1101 
0001 
MOD 110 RIM 
(not used) 


01 
1101 
0001 
MOD 111 RIM 
(OISP-LO),(OISP-HI) 
SAR 
REG16/MEM16,1 
02 
1101 
0010 
MOOOOO RIM 
(OISP-LO),(OISP-HI) 
ROL 
REG8/MEM8,CL 


02 
1101 
0010 
MOD 001 RIM 
(OISP-LO),(OISP-HI) 
ROR 
REG8/MEM8,CL 


02 
1101 
0010 
MOO010R/M 
(OISP-LO),(OISP-HI) 
RCL 
REG8/MEM8,CL 


02 
1101 
0010 
M00011 
RIM 
(OISP-LO),(OISP-HI) 
RCR 
REG8/MEM8,CL 


02 
1101 
0010 
MOD 100 RIM 
(OISP-LO),(OISP-HI) 
SALISHL 
REG8/MEM8,CL 


02 
1101 
0010 
M00101 
RIM 
(OISP-LO),(OISP-HI) 
SHR 
REG8/MEM8,CL 


02 
1101 
0010 
MOD 110 RIM 
(not used) 


02 
1101 
0010 
MOD 111 RIM 
(OISP-LO),(OISP-HI) 
SAR 
REG8/MEM8,CL 


03 
1101 
0011 
MOOOOOR/M 
(OISP-LO),(OISP-HI) 
ROL 
REG16/MEM16,CL 


03 
1101 
0011 
MOD 001 RIM 
(OISP-LO),(OISP-HI) 
ROR 
REG16/MEM16,CL 


03 
1101 
0011 
MOD 010 RIM 
(OISP-LO),(OISP-HI) 
RCL 
REG16/MEM16,CL 


03 
1101 
0011 
M00011 
RIM 
(OISP-LO),(OISP-HI) 
RCR 
REG16/MEM16,CL 


03 
1101 
0011 
MOD 100 RIM 
(OISP-LO),(OISP-HI) 
SALISHL 
REG16/MEM16,CL 


03 
1101 
0011 
MOD 101 RIM 
(OISP-LO),(OISP-HI) 
SHR 
REG16/MEM16,CL 
03 
1101 
0011 
MOD 110 RIM 
(not used) 
03 
1101 
0011 
MOD 111 RIM 
(OISP-LO),(OISP-HI) 
SAR 
REG16/MEM16,CL 
04 
1101 
0100 
00001010 
AAM 
05 
1101 
0101 
00001010 
AAO 
06 
1101 
0110 
(not used) 
07 
1101 
0111 
XLAT 
SOURCE-TABLE 
08 
1101 
1000 
MOD 000 RIM 
1XXX 
MOD YYY RIM 
(DISP-LO), 
(DISP-HI) 
ESC 
OPCOOE,SOURCE 
OF 
1101 
1111 
MOD111 
RIM 
EO 
1110 
0000 
IP-INC-8 
LOOPNEI 
SHORT~ABEL 
LOOPNZ 
E1 
1110 
0001 
IP-INC-8 
LOOPEI 
SHORT-LABEL 
LOOPZ 
E2 
1110 
0010 
IP-INC-8 
LOOP 
SHORT-LABEL 
E3 
1110 
0011 
IP-INC-8 
JCXZ 
SHORT-LABEL 
E4 
1110 
0100 
OATA-8 
IN 
AL,IMMED8 
E5 
1110 
0101 
OATA-8 
IN 
AX,IMME08 
E6 
1110 
0110 
OATA-8 
OUT 
AL,IMME08 
E7 
1110 
0111 
DATA-8 
OUT 
AX,IMMED8 
E8 
1110 
1000 
IP-INC-LO 
IP-INC-HI 
CALL 
NEAR-PROC 
E9 
1110 
1001 
IP-INC-LO 
IP-INC-HI 
JMP 
NEAR-LABEL 
EA 
1110 
1010 
IP-LO 
IP-H I,CS-LO, CS-H I 
JMP 
FAR-LABEL 
EB 
1110 
1011 
IP-INC8 
JMP 
SHORT-LABEL 
EC 
1110 
1100 
IN 
AL,OX 
ED 
1110 
1101 
IN 
AX,DX 
EE 
1110 
1110 
OUT 
AL,DX 
EF 
1110 
1111 
OUT 
AX,OX 
FO 
1111 
0000 
LOCK 
(prefix) 
F1 
1111 
0001 
(not used) 
F2 
1111 
0010 
REPNE/REPNZ 
F3 
1111 
0011 
REP/REPE/REPZ 
F4 
1111 
0100 
HLT 
F5 
1111 
0101 
CMC 
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1ST BYTE 
2ND BYTE 
BYTES 3,4,5,6 
ASM·86 
INSTRUCTION 
FORMAT 
HEX 
BINARY 


F6 
1111 
0110 
MODOOO RIM 
(DISP-LO),(DISP-HI), 
TEST 
REG8/MEM8,IMMED8 
DATA-8 
F6 
1111 
0110 
MOD 001 RIM 
(not used) 
F6 
1111 
0110 
MOD010R/M 
(DISP-LO),(DISP-HI) 
NOT 
REG8/MEM8 
F6 
1111 
0110 
MOD011 
RIM 
(DISP-LO),(DISP-HI) 
NEG 
REG8/MEM8 
F6 
1111 
0110 
MOD100 
RIM 
(DISP-LO),(DISP-HI) 
MUL 
REG8/MEM8 
F6 
1111 
0110 
MOD101 
RIM 
(DISP-LO),(DISP-HI) 
IMUL 
REG8/MEM8 
F6 
1111 
0110 
MOD 110 RIM 
(DISP-LO),(DISP-HI) 
DIV 
REG8/MEM8 
F6 
1111 
0110 
MOD111 
RIM 
(DISP-LO),(DISP-HI) 
IDIV 
REG8/MEM8 
F7 
1111 
0111 
MODOOOR/M 
(DISP-LO),(DISP-HI), 
TEST 
REG16/MEM16,IMMED16 
DATA-LO,DATA-HI 
F7 
1111 
0111 
MOD 001 RIM 
(not used) 
F7 
1111 
0111 
MOD010R/M 
(DISP-LO),(DISP-HI) 
NOT 
REG16/MEM16 
F7 
1111 
0111 
MOD011 
RIM 
(DISP-LO),(DISP-HI) 
NEG 
REG16/MEM16 
F7 
1111 
0111 
MOD100 
RIM 
(DISP-LO),(DISP-HI) 
MUL 
REG16/MEM16 
F7 
1111 
0111 
MOD101 
RIM 
(DISP-LO),(DISP-HI) 
IMUL 
REG16/MEM16 
F7 
1111 
0111 
MOD110 
RIM 
(DISP-LO),(DISP-HI) 
DIV 
REG16/MEM16 
F7 
1111 
0111 
MOD111 
RIM 
(DISP-LO),(DISP-HI) 
IDIV 
REG16/MEM16 
F8 
1111 
1000 
CLC 
F9 
1111 
1001 
STC 
.. 


FA 
1111 
1010 
CLI 
- 


FB 
1111 
1011 
STI 
FC 
1111 
1100 
CLD 
FD 
1111 
1101 
STD 
FE 
1111 
1110 
MODOOOR/M 
(DISP-LO),(DISP-HI) 
INC 
REG8/MEM8 
FE 
1111 
1110 
MOD 001 RIM 
(DISP-LO),(DISP-HI) 
DEC 
REG8/MEM8 
FE 
1111 
1110 
MOD010R/M 
(not used) 
FE 
1111 
1110 
MOD011 
RIM 
(not used) 
FE 
1111 
1110 
MOD 100 RIM 
(not used) 
FE 
1111 
1110 
MOD 101 RIM 
(not used) 
FE 
1111 
1110 
MOD110 
RIM 
(not used) 
FE 
1111 
1110 
MOD111 
RIM 
(not used) 
FF 
1111 
1111 
MOD 000 RIM 
(DISP-LO),(DISP-HI) 
INC 
MEM16 
FF 
1111 
1111 
MOD 001 RIM 
(DISP-LO),(DISP-HI) 
DEC 
MEM16 
FF 
1111 
1111 
MOD010 
RIM 
(DISP-LO),(DISP-HI) 
CALL 
REG16/MEM16 
(intra) 
FF 
1111 
1111 
MOD011 
RIM 
(DISP-LO),(DISP-HI) 
CALL 
MEM16 (intersegment) 
FF 
1111 
1111 
MOD 100 RIM 
(DISP-LO),(DISP-HI) 
JMP 
REG16/MEM16 
(intra) 
FF 
1111 
1111 
MOD101 
RIM 
(DISP-LO),(DISP-HI) 
JMP 
MEM16 (intersegment) 
FF 
1111 
1111 
MOD110 
RIM 
(DISP-LO),(DISP-HI) 
PUSH 
MEM16 
FF 
1111 
1111 
MOD111 
RIM 
(not used) 


Lo 
0 
1 
2 
3 
4 
5 
6 
7 
8 
9 
A 
8 
C 
0 
E 
f 


0 
ADD 
ADD 
ADD 
ADD 
ADD 
ADD 
PUSH 
POP 
DR 
OR 
OR 
OR 
OR 
OR 
PUSH 
b.f.r/m 
w.f.r/m 
b.t.r/m 
w.t.r/m 
b. ia 
w,la 
ES 
ES 
b.f.r/m 
w.f,rJm' 
b.t.r/m 
w.l.rfm 
b.i 
w.i 
CS 


1 
ADC 
ADC 
ADC 
ADC 
ADC 
ADC 
PUSH 
POP 
SBB 
SBB 
SBB 
SBB 
SBB 
SBB 
PUSH 
POP 


b.t.r/m 
w,f,r/m 
b.t.r/m 
w.t.r/m 
b.i 
W.I 
SS 
SS 
b.t.r/m 
w,f,r/m 
b.t.r/m 
W.t.r/m 
b.1 
W.I 
OS 
OS 


2 
AND 
AND 
AND 
AND 
AND 
AND 
SEG 
DAA 
SUB 
SUB 
SUB 
SUB 
SUB 
SUB 
SEG 
DAS 
b.t.r/m 
w.f.r/m 
b.t.r/m 
w.t.r/m 
b.i 
w,i 
~ES 
b.t.r/m 
w.f.r/m 
b.t.r.'m 
w.t.r/m 
b.i 
W.I 
CS 


3 
XOR 
XOR 
XOR 
XOR 
XOR 
XOR 
SEG 
AAA 
CMP 
CMP 
CMP 
CMP 
CMP 
CMP 
SEG 
AAS 
b.f.r/m 
w.f.r/m 
b.t.r/m 
w.t.r/m 
b.i 
W.I 
~SS 
b.t.r/m 
w.t.r/m 
b.t.r/m 
w,t.rfm 
b.1 
w .• 
OS 


4 
INC 
INC 
INC 
INC 
INC 
INC 
INC 
INC 
DEC 
DEC 
DEC 
DEC 
DEC 
DEC 
DEC 
DEC 


AX 
CX 
OX 
BX 
SP 
BP 
SI 
01 
AX 
CX 
OX 
BX 
SP 
BP 
SI 
01 


5 
PUSH 
PUSH 
PUSH 
PUSH 
PUSH 
PUSH 
PUSH 
PUSH 
POP 
POP 
POP 
POP 
POP 
POP 
POP 
POP 


AX 
CX 
OX 
BX 
SP 
BP 
SI 
01 
AX 
CX 
OX 
BX 
SP 
BP 
SI 
01 


8 


7 
JO 
JNO 
JBI 
JNBI 
JEI 
JNEI 
JBEI 
JNBEI 
JS 
JNS 
JPI 
JNPI 
JlI 
JNlI 
JLEI 
JNLEI 


JNAE 
JAE 
JZ 
JNZ 
JNA 
JA 
JPE 
JPO 
JNGE 
JGE 
JNG 
JG 


8 
Immed 
Immed 
Immed 
Immed 
TEST 
TEST 
XCHG 
XCHG 
MOV 
MOV 
MOV 
MOV 
MOV 
LEA 
MOV 
POP 


b.r/m 
w.r/m 
b.r/m 
is.r/m 
b.r/m 
w,r/m 
b.r/m 
w.r/m 
b.t.r/m 
w,f,rlm 
b.t.r/m 
w.t.r/m 
sr,l,r/m 
sr,t,r/m 
rim 


9 
XCHG 
XCHG 
XCHG 
XCHG 
XCHG 
XCHG 
XCHG 
XCHG 
CBW 
CWO 
CALL 
WAIT 
PUSHf 
POPf 
SAHf 
LAHf 
AX 
CX 
OX 
BX 
SP 
BP 
SI 
01 
I.d 


A 
MOV 
MOV 
MOV 
MOV 
MOVS 
MOVS 
CMPS 
CMPS 
TeST 
TeST 
STOS 
STOS 
LODS 
LODS 
SCAS 
SCAS 
m 
- 
AL 
m 
- 
AX 
AL 
- 
m 
AX 
- 
m 
b,I,S 
w,l,a 


B 
MOV 
MOV 
MOV 
MOV 
MOV 
MOV 
MOV 
MOV 
MOV 
MOV 
MOV 
MOV 
MOV 
MOV 
MOV 
MOV 


1_ 
AL 
i - 
CL 
i - 
DL 
i - 
BL 
i _ 
AH 
i - 
CH 
i - 
OH 
1- 
BH 
I 
- 
AX 
1- 
CX 
i-OX 
i - 
BX 
i - 
SP 
i - 
BP 
I 
- 
SI 
1-01 


C 
RET. 
RET 
LES 
LOS 
MOV 
MOV 
RET. 
RET 
INT 
INT 
INTO 
IRET 
(i·SP) 
b.i.rlm 
w,i.r/m 
1.(i·SP) 
I 
Type 
3 
(Any) 


0 
Shift 
Shift 
Shift 
Shift 
AAM 
AAD 
XLAT 
ESC 
ESC 
ESC 
ESC 
ESC 
ESC 
ESC 
ESC 


b 
w 
b.v 
w.v 
0 
1 
2 
3 
4 
5 
6 
7 


E 
LOOPNZI 
LOOPZI 
LOOP 
JCXZ 
IN 
IN 
OUT 
OUT 
CALL 
JMP 
JMP 
JMP 
IN 
IN 
OUT 
OUT 
LOOPNE 
LOOPE 
b 
w 
b 
w 
d 
d 
I.d 
si.d 
v,b 
v,w 
v,b 
v,w 


f 
LOCK 
REP 
REP 
HLT 
CMC 
Grp 1 
Grp 1 


CLC 
STC 
CLI 
STI 
CLD 
STD 
Grp 2 
Grp 2 


l 
b.r/m 
w.r/m 
b.r/]11 
w,r/m 


modOr/m 
000 
001 
010 
011 
100 
101 
110 
111 


Immed 
ADD 
OR 
ADC 
SBB 
AND 
SUB 
XOR 
CMP 
Stull 
ROL 
ROR 
RCL 
RCR 
SHlISAl 
SHR 
- 
SAR 


Grp 1 
TEST 
- 
NOT 
NEG 
MUL 
IMUL 
DIV 
IDIV 


Grp2 
INC 
DEC 
CALL 
CALL 
JMP 
JMP 
PUSH 
- 


id 
1.ld 
,d 
I.•d 


b = byte 
operation 
d = direct 
I = Irom 
CPU reg 


i = immediate 
ia =- immed. to accum. 
id 
= indirecf 
is 
= immed. 
byte. 
sign 
ext. 


I = long 
Ie. intersegment 


m = memory 
rim 
= EA is second 
byte 
si 
= short 
intrasegment 
sr = segment 
register 
t = to CPU reg 
v = variable 
w = word operation 
z = zero 


Figure 
4-22 illustrates 
the internal 
operation 
and 
bus activity 
that occur 
as an 8086 CPU 
executes 
a 
sequence 
of instructions. 
This 
figure 
presents 
the 
signals 
and 
timing 
relationships 
that 
are impor- 
tant in understanding 
8086 operation. 
The follow- 
ing discussion 
is intended 
to help in the interpreta- 
tion of the figure. 


Figure 
4-22 shows 
the repeated 
execution 
of an 
instruction 
loop. 
This 
loop 
is defined 
in both 
machine 
code 
and 
assembly 
language 
by figure 
4-21. A loop 
was chosen 
both 
to demonstrate 
the 
effects 
of a program 
jump 
on the queue 
and 
to 
make the instruction 
sequence 
easy to follow. 
The 
program 
sequence 
shown 
was selected 
for several 
reasons. 
First, 
consisting 
of 
seven 
instructions 
and 
16 bytes, 
the sequence 
is typical 
of the tight 
loops 
found 
in 
many 
application 
programs. 
Second, 
this particular 
sequence 
contains 
several 
short, 
fast-executing 
instructions 
that 
demonstrate 
both 
the effect 
of the queue 
on CPU 
performance 
and the interaction 
between 
the exe- 
cution 
unit 
(EU) 
fetching 
code 
from 
the 
queue 
and the bus interface 
unit (BIU) 
filling the queue 
and 
performing 
the 
requested 
bus 
cycles. 
Last, 
for 
the purpose 
of this 
discussion, 
code, 
stack, 
and memory 
data 
references 
were arranged 
to be 
aligned 
on even word boundaries. 


ASSEMBLY 
LANGUAGE 


MOV AX, OF802H 
PUSH AX 
MOVCX, 
BX 
MOV OX, CX 
ADD AX, [SI] 
ADD SI, 8086H 
JMP$-14 


MACHINE 
CODE 


B802F8 
50 
8BCB 
8BD1 
0304 
81C68680 
EBFO 


Figure 
4-22 
can 
be more 
easily 
interpreted 
by 
keeping 
the following 
guidelines 
in mind. 


• 
The queue status 
lines (QSO, QSI) are the key 
indicators 
of EU activity. 


• 
Status 
lines 
S2 
through 
SO are 
the 
main 
indicators 
of 8086/8088 
bus activity. 


• 
Interaction 
of 
the 
BIU 
and 
EU 
is via 
the 
queue 
for pre fetched 
opcodes 
and via the EU 
for requested 
bus cycles for data operands. 


Keeping 
these guidelines 
in mind, 
the instruction 


sequence 
depicted 
in figure 
4-22 can be described 


as follows. 
Starting 
the loop 
arbitrarily 
in clock 


cycle I with the queue 
reinitialization 
that 
occurs 


as part 
of the JMP 
instruction, 
JMP 
instruction 


execution 
is completed 
by the EU, while the BIU 


performs 
an opcode 
fetch 
to begin 
refilling 
the 


queue. 
(Note 
that 
a shorthand 
notation 
has been 


used 
in the 
figure 
to 
represent 
the 
two 
queue 


status 
lines 
and 
the 
three 
status 
lines-active 


periods 
on any 
of these 
lines 
are noted 
and 
the 


binary 
value 
of the lines is indicated 
above 
each 


active region.) 


In clock 
cycle 
8, the queue 
status 
lines 
indicate 


that the first byte of the MOY immediate 
instruc- 


tion has been removed 
from 
the queue 
(one clock 


cycle after 
it was placed 
there 
by the BIU fetch) 


and that 
execution 
of this instruction 
has begun. 


The second 
byte of this instruction 
is taken 
from 


the 
queue 
in clock 
cycle 
10 and 
then, 
in clock 


cycle 12, the EU pauses 
to wait one clock cycle for 


the BIU's 
second 
opcode 
fetch 
to be completed 


and 
for 
the 
third 
byte 
of the 
MOY 
immediate 


instruction 
to 
be 
available 
for 
execution 


(remember 
the queue 
status 
lines indicate 
queue 


activity 
that 
has 
occurred 
in the 
previous 
clock 


cycle) . 


Clock cycle 13 begins 
the execution 
of the PUSH 


AX 
instruction, 
and 
in clock 
cycle 
IS, the BIU 


begins the fourth 
opcode 
fetch. 
The BIU finishes 


the fourth 
fetch in clock cycle 18 and prepares 
for 


another 
fetch when 
it receives 
a request 
from 
the 


EU for a memory 
write 
(the stack 
push). 
Instead 


of completing 
the opcode 
fetch 
and 
forcing 
the 


EU to wait four 
additional 
clock 
cycles, 
the BIU 


immediately 
aborts 
the 
fetch 
cycle 
(resulting 
in 


two idle clock 
cycles 
(TI) in clock 
cycles 
19 and 


20) and performs 
the required 
memory 
write. This 


interaction 
between 
the EU and 
BIU results 
in a 


single clock extension 
to the execution 
time of the 


PUSH 
AX instruction, 
the maximum 
delay ·that 


can occur in response 
to an EU bus cycle request. 


Execution 
continues 
in clock 
cycle 
24 with 
the 


execution 
of 
back-to-back, 
register-to-register 


MOY instructions. 
The first of these instructions 


takes 
full advantage 
of the prefetched 
opcode 
to 


complete 
this operation 
in two clock 
cycles. 
The 


second 
MOY 
instruction, 
however, 
depletes 
the 


queue 
and 
requires 
two 
additional 
clock 
cycles 


(clock cycles 28 and 29). 
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In clock cycle 30, the ADD memory indirect to 
AX instruction 
begins. In the time required to 
execute this instruction, 
the BIU completes two 
opcode 
fetch 
cycles and 
a memory 
read 
and 
begins a fourth opcode fetch cycle. Note that in 
the case of the memory read, the EU's request for 
a bus cycle occurs at a point in the BIU fetch cycle 
where it can be incorporated 
directly (idle states 
are not required and no EU delay is imposed). 


In clock 
cycle 44, 
the 
EU 
begins 
the 
ADD 
immediate instruction, 
taking four bytes from the 
queue and completing 
instruction 
execution 
in 
four clock cycles. Also during this time, the BIU 
senses a full queue in clock cycle 45 and enters a 
series of bus idle states (five or six bytes constitute 
a full queue in the 8086; the BIU waits until it can 
fetch a full word of opcode before accessing the 
bus). 


At clock cycle 47, the BIU again begins a bus 
cycle sequence, 
one that 
is destined 
to be an 
"overfetch" 
since the EU is executing a JMP 
instruction. 
As part of the JMP instruction, 
the 
queue reinitialization 
(which began the instruc- 
tion sequence) occurs. 


The entire sequence of instructions 
has taken 55 
clock cycles. Eighteen opcode bytes were fetched, 
one word memory read occurred, and one word 
stack write was performed. 


This example was, by design, partially bus limited 
and indicates the types of EU and BIU interaction 
that can occur in this situation. Most application 


code sequences, however, use a higher proportion 
of more complex, 
longer-executing 
instructions 
and addressing modes, and therefore tend to be 
execution 
limited. 
In 
this 
case, 
less BIU-EU 
interaction 
is required, 
the queue more often is 
full, and more idle states occur on the bus. 


The previous 
example 
sequence 
can be easily 
extended to incorporate 
wait states in the bus 
access cycles. In the case of a single wait state, 
each bus cycle would be lengthened to five clock 
cycles with a wait state (Tw) inserted between 
every T3 and T4 state of the bus cycle. As a first 
approximation, 
the instruction sequence exection 
time would appear to be lengthened by 10 clock 
cycles, one cycle for each useful read or write bus 
cycle that occurs. Actually, 
this approximation 
for the number of wait states inserted is incorrect 
since the queue can compensate for wait states by 
making use of previously idle bus time. For the 
example sequence, this compensation 
reduced the 
actual execution time by one wait state, and the 
sequence was completed 
in 64 clock cycles, one 
less than the approximated 
65 clock cycles. 


The Intel® 8089 I/O Processor 
(lOP) combines 
the functions of a DMA controller with the pro- 
cessing capabilities of a microprocessor. 
In addi- 
tion to the normal DMA function of transferring 
data, 
the 
8089 
is 
capable 
of 
dynamically 
translating 
and 
comparing 
the 
data 
as 
it 
is 
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transferred 
and of supporting 
a number of ter- 
minate conditions 
including byte count expired, 


data compare or miscompare and the occurrence 
of an external 
event. 
The 
8089 contains 
two 
separate 
DMA 
channels, 
each 
with 
its 
own 
register 
set. 
Depending 
on 
the 
established 
priorities 
(both 
inherent 
and 
program 
deter- 
mined), 
the 
two 
channels 
can 
alternate 
(interleave) their respective operations. 


Designed expressly to relieve the 8086 or 8088 
CPU of the overhead associated with I/O opera- 
tions, the 8089, when configured 
in the remote 
mode, can perform a complete I/O task while the 
CPU is performing 
data processing tasks. The 
8089, when it has completed its I/O task, can then 
interrupt the CPU. 


Transfer 
flexibility 
is an integral 
part 
of the 
8089's design. 
In addition 
to routine 
transfers 
between an I/O peripheral and memory, transfers 
can be performed 
between two I/O 
devices or 
between two areas of memory. Transfers between 
dissimilar bus widths are automatically 
handled 
by the 8089. When data is transferred 
from an 
8-bit peripheral bus to a l6-bit memory bus, the 
8089 
reads 
two 
bytes 
from 
the 
peripheral, 
assembles the bytes into a 16-bit word and then 
writes the single word to the addressed memory 
location. Also, both 8- and 16-bit peripherals can 
reside on the same (l6-bit) bus; byte transfers are 
performed 
with the 8-bit peripheral, 
and word 
transfers 
are 
performed 
with 
the 
16-bit 
peripheral. 


The 8089 can be implemented 
in one of two 


system configurations: 
a "local" 
mode in which 


the 8089 shares the system bus with an 8086 or 
8088 CPU and a "remote" 
mode in which the 


8089 has exclusive access to its own dedicated bus 
as well as access to the system bus. Note that in 
either the local or remote mode, 
the 8089 can 


address a full megabyte of system memory and 
64k bytes of I/O space. 


In the local mode, the 8089 acts as a slave to an 
8086 or 8088 CPU that is operating in the max- 
imum 
mode. 
In 
this configuration, 
the 
8089 


shares 
the 
system 
address 
latches, 
data 


transceivers and bus controller with the CPU as 
shown in figure 4-23. 


Since the lOP and CPU share the system bus, 
either the lOP or the CPU will have access to the 
bus at anyone 
time. When one processor is using 


the 
bus, 
the 
other 
processor 
floats 
its 


address/data 
and 
control 
lines. 
Bus 
access 


between the lOP and CPU is determined through 
the request/grant 
function. 
Recalling the CPU's 


request/grant 
sequence, the lOP requests the bus 


from the CPU, the CPU grants the bus to the 
lOP, and the lOP relinquishes the bus to the CPU 
when its operation 
is complete. 
Remember that 


the CPU cannot 
request the bus from the lOP 


(the CPU is only capable of granting the bus and 
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must wait for the lOP to release the bus). Also, 
since the request/grant 
pulse exchange must be 


synchronized, 
both the CPU and lOP must be 


referenced to the same clock signal. 


The 8089 lOP, when used in the local mode, can 
be added to an 8086 or 8088 maximum mode con- 
figuration with little affect on component 
count 


(channel attention decoding logic as required) and 
offers 
the 
benefits 
of 
intelligent 
DMA 


(scan/match, 
translate, variable termination con- 


ditions), 
modular 
programming 
in 
a 
full 


megabyte of memory address space and a set of 
optimized I/O instructions that are unavailable to 
the 8086 and 8088 CPUs. The major disadvantage 
to the local configuration 
is that since the system 


bus 
is shared, 
bus 
contention 
always 
exists 


between the CPU and lOP. The use of the bus 
load limit field in the channel control word can 
help reduce lOP bus access during task block pro- 
gram execution (bus load limiting has no affect on 
DMA 
transfers) 
although, 
for 
I/O 
intensive 


systems, the remote mode should be considered. 


The 8089, when used in the remote mode, pro- 
vides a multiprocessor 
system with true parallel 
processing. In this mode, the 8089 has a separate 
(local) bus and memory for I/O peripheral com- 
munications, 
and the system bus is completely 


isolated from the I/O peripheral(s). 
Accordingly, 


I/O transfers between an I/O peripheral and the 
lOP's 
local memory 
can occur simultaneously 


with CPU operations on the system bus. 


As shown in figure 4-24, to interface the 8089 to 
the system bus, data 
transceivers 
and address 


latches are used to separate the lOP's 
local bus 


from the system bus, an 8288 Bus Controller 
is 


used to generate the bus control signals for both 
the local and system buses as well as to govern the 
operation of the transceivers/latches, 
and an 8289 


Bus Arbiter is used to control access to the system 
bus (each processor in the system would have an 
associated 
8289 Bus Arbiter). 
To interface 
the 


8089 to its local bus, 
another 
set of address 
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latches is required (unless MCS-85™ multiplexed 
address 
components 
are exclusively interfaced) 
and, depending on the bus loading demands, one 
(8-bit bus) or two (l6-bit 
bus) data transceivers 


would be used. 


In the remote mode, the lOP's local bus is treated 
as I/O space (up to 64k bytes), and the system bus 
is treated 
as memory 
space (l megabyte). 
The 


8288 Bus Controller's 
I/O command outputs con- 


trol the local (I/O) bus, and its memory command 
outputs 
control 
the system (memory) bus. The 
8289 Bus Arbiter, 
which is operated 
in its lOB 


(I/O 
~ripheral 
bus) 
mode, 
also 
decodes 
the 


lOP's S2 through SOstatus outputs. In this mode, 
the 8289 will not request the multimaster 
system 


bus when the lOP indicates an operation 
on its 
local bus. If the lOP's 
bus arbiter currently has 


access to the system bus, the CPU's 
arbiter (or 


any other arbiter in the system) can acquire use of 
the system bus at this time (a bus arbiter main- 
tains bus access until another arbiter requests the 
bus). 


The 8089 utilizes the same bus structure 
as an 


8086 or 8088 CPU that is configured in the max- 
imum mode and performs a bus cycle only on de- 
mand (e.g., to fetch an instruction 
during task 


block execution or to perform 
a data transfer). 


The bus cycle itself is identical to an 8086 or 8088 
CPU's bus cycle in that all cycles consist of four 
T-states 
and 
use 
the 
same 
time-multiplexing 


technique of the addressdata 
lines. As shown in 


the following timing diagrams, 
the address (and 


ALE signal) is output during state T I for either a 
read or write cycle. Depending 
on the type of 


cycle indicated, the address/data 
lines are floated 


during state T2 for a read cycle (figure 4-25) or 
data is output on these lines during a write cycle 
(figure 4-26). During state T3, write data is main- 
tained or read data is sampled, and the busy cycle 
is concluded in state T4' 


Since the 8089 is capable of transferring data to or 
from both 8-bit and 16-bit buses, when an 8-bit 
physical bus is specified (bus width is specified 
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present on the AD IS through AD8 address/data 
lines is maintained 
for the entire bus cycle as 
shown in figure 4-25 and, 
unless added 
drive 
capability is required, the associated address latch 
can be eliminated. An 8-bit data bus is compatible 
with the 8088 CPU 
and 
with 
the MCS-85™ 


multiplexed 
address 
peripherals 
(8155, 
8185, 
etc.). 


with respect to the use of the low- and hlgh-oraer 
halves of the data bus. Table 4-14 defines the data 
bus use for the various combinations of bus width 
and address boundary. 


The S2 through SOstatus lines define the bus cycle 
to be performed. 
These lines are used by an 8288 


Bus Controller 
to generate all memory and 1/0 


command 
and control signals, and are decoded 


according to table 4-15. 


Physical Bus Width' 


logical 
Address 
--=- 
BusWidth' 
Boundary 
16 
8 
Byte Transfer 
Word Transfer 


Even 
AD7-ADO= DATA 
AD7-ADO= DATA 
N/A 
(SHE not used) 
(SHE high) 
8 
I 


Odd 
AD7-ADO= DATA 
AD15-AD8 = DATA 
N/A 
(SHE not used) 
(SHE low) 


- 


AD7-ADO= DATA 
AD15-ADO= DATA 
Even 
Illegal 
(SHE high) 
(SHE low) 
16 


Odd 
Illegal 
AD15-AD8= DATA 
N/A' 
(SHE low) 


Notes: 


1. 
logical 
bus width is specified 
by the WID instruction 
prior to the DMA transfer. 


2. 
Physical bus width is specified when the 8089is initialized. 


3. 
A word transfer 
to or from an odd boundary 
is performed 
as two byte transfers. 
The first byte trans- 
ferred 
is the low-order 
byte on the high-order 
data bus (AD15-AD8), and the -second byte is the high- 
order byte on the low-order 
data bus (AD7-ADO). The 8089 automatically 
assembles 
the two bytes in 
their proper order. 


Status Output 
Bus Cycle Indicated 
Bus Controller 


52 
51 
SO 
Command Output 


0 
0 
0 
Instruction fetch from I/O space 
INTA 
0 
0 
1 
Data read from I/O space 
10RC 
0 
1 
0 
Data write to I/O space 
10WC,AIOWC 
0 
1 
1 
Not used 
None 
1 
0 
0 
Instruction 
fetch from system memory 
MRDC 
1 
0 
1 
Data read from system memory 
MRDC 
1 
1 
0 
Data write to system memory 
q' 
MWTC,AMWC 
1 
1 
1 
Passive 
None 


Note that the 8089 indicates an instruction 
fetch 
from 110 space as a status of zero (S2, SI and SO 
equal 0). Since the 8288 Bus Controller 
decodes 
an input 
status 
value of zero as an interrupt 
acknowledge bus cycle, the bus controller's 
INTA 
output must be OR'ed with its IORC output to 
permit fetching of task block instructions 
from 
local 8089 memory 
(remote 
configuration) 
or 
system 
I/O 
space 
(local 
and 
remote 
configurations). 


The S2 through SO status lines become active in 
state T4 if a subsequent 
bus cycle is to be per- 
formed. These lines are set to the passive state (all 
"ones") 
in the state immediately prior to state T4 
of the current bus cycle (state T3 or Tw) and are 
floated when the 8089 does not have access to the 
bus. 


The S6 through 
S3 status lines are multiplexed 
with the high-order 
address bits (AI9-AI6) 
and, 
accordingly, become valid in state T2 of the bus 
cycle. The S4 and S3 status lines reflect the type of 
bus cycle being performed 
on the corresponding 
channel as indicated in table 4-16. 


Status Output 
Type of Cycle 
S4 
S3 


0 
0 
DMA on Channel 1 
0 
1 
DMA on Channel2 
1 
0 
Non-DMA on Channel 1 
1 
1 
Non-DMA on Channel 2 


The S6 and S5 status lines are always" 
I" on the 
8089. Since these lines are not both "1" 
on the 
other processors in the 8086 family (S6 is always 
"0" 
on the 8086 and 8088 CPUs), 
these status 
lines 
can 
be 
used 
as 
a 
"signature" 
in 
a 
multiprocessor 
environment 
to identify the type 
of processor performing the bus cycle. 


The 8089 includes the same provision as do the 
8086 and 8088 CPUs for the insertion 
of wait 
states (Tw) in a bus cycle when the associated 
memory or 110 device cannot respond within the 
alloted time interval or when, in the remote mode, 
the 8089 must wait for access to the system bus. 
An 8284 Clock Generator/Driver 
is used to con- 
trol the insertion 
of wait states 
which, 
when 
required, are inserted between states T3 and T4. 
The actual insertion of wait states is accomplished 
by deactivating 
one of the 8284's RDY inputs 


(RDYI or RDY2). Either of these inputs, when 
enabled 
by its corresponding 
AENI 
or AEN2 
input, can be deactivated directly by the memory 
or I/O device when it must extend the 8089's bus 
cycle (when the addressed device is not ready to 
present or accept data). The 8284's READY out- 
put, which is synchronized to the CLK signal, is 
directly connected to the 8089's READY input. 
As shown in figure 4-27, when the addressed 
device requires 
one or more wait states to be 
inserted into a bus cycle, it deactivates the 8284's 
RDY input prior to the end of state T2. The 
READY output 
from the 8284 is subsequently 
deactivated at the end of state T2 which causes the 
8089 to insert wait states following state T3. To 
exit the wait state, the device activates the 8284's 
RDY input which causes the READY input to the 
8089 to go active on the next clock cycle and 
allows the 8089 to enter state T4. 
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Periods 
of 
inactivity 
can 
occur 
between 
bus 
cycles. These inactive periods are referred to as 
idle states (TI) and, as with the 8086 and 8088 
CPUs, can result from the execution of a "long" 
instruction or the loss of the bus to another pro- 
cessor during task block instruction 
execution. 
Additionally, 
the 8089 can experience idle states 
when it is in the DMA mode and it is waiting for a 
DMA request from the addressed 
110 device or 
when the bus load limit (BLL) function is enabled 
for a channel performing 
task block instruction 
execution and the other channel is idle. 


Initialization 
of the lOP is generally the respon- 
sibility of the host processor which, as stated in 
Chapter 
3, prepares 
the communications 
data 
structure in shared memory. Initialization 
of the 
lOP itself begins with the activation of its RESET 
input. This input (originating 
typically from an 


8284 Clock Generator/Driver) 
must be held active 


for at least five clock cycles to allow the 8089's 
internal reset sequence to be completed. Note that 
like the 8086 and 8088 CPUs, the RESET input 
must be held active for at least 50 microseconds 
when power is first applied. Following the reset 
interval, 
the host processor 
signals the lOP to 


begin its initialization 
sequence by activating the 


8089's CA (Channel Attention) 
input. The 8089 


will not recognize a pulse at its CA input until one 
clock cycle after the RESET input returns to an 
inactive level. Note that the minimum width for a 
CA pulse is one clock cycle and that this pulse 
may go active prior to RESET returning 
to an 


inactive level provided 
that the negative-going, 


trailing-edge of the CA pulse does not occur prior 
to one clock cycle after 
RESET goes inactive. 
Figure 4-28 illustrates the timing for this portion 
of the initialization sequence. 
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Coincident 
with the trailing 
edge of the first 


CA pulse following reset, the 8089 samples its 
SEL (Select) input 
from the host processor 
to 


determine 
master/slave 
status 
for 
its 
request/grant 
circuity. If the SEL input is low, 
the 8089 is designated a "master," 
and if the SEL 


input is high, the 8089 is designated a "slave." 
As 


a master, 
the 8089 assumes that it has the bus 
initially, and it will subsequently grant the bus to 
a requesting slave when the bus becomes available 
(i.e., the 8089 will respond to a "request" 
pulse 
on its RQ/GT line with a "grant" 
pulse). A single 


8089 in the remote configuration 
(or one of two 


8089s 
in 
a 
remote 
configuration) 
would 
be 
designated a master. As a slave, the 8089 can only 
request the bus from a master processor (Le., the 
8089 initiates the request/grant 
sequence by out- 
putting a "request" 
pulse on its RQ/GT line). An 


8089 that shares a bus with an 8086 or 8088 (or 
one of two 8089s in a remote 
configuration) 


would be designated a slave. Note that since the 
8086 and 8088 CPUs can grant the bus only in 
response to a request, whenever an 8086 or 8088 


and an 8089 share a common bus, the 8089 must 
be designated the slave. Also, when the RQ/GT 
line is not used (i.e., a single 8089 in the remote 
configuration), 
the 8089 must 
be designated 
a 


master. 


In addition 
to determining 
master/slave 
status, 


the CA pulse also causes the 8089 to begin execu- 
tion of its internal ROM initialization 
sequence. 


Note that since the 8089 must have access to the 
system bus in order to perform this sequence, the 
8089 
immediately 
initiates 
a 
request/grant 
sequence (if designated a slav.e) and, if required, 
then requests the bus through 
the 8289 Arbiter. 


(If designated a master, the 8089 requests the bus 
through the 8289 Arbiter.) In the execution of the 
initialization 
sequence, the 8089 first fetches the 
SYSBUS byte from location FFFF6H. The W bit 
(bit 0) of this byte specifies the physical 
bus width 


of the system 
bus. Depending on the bus width 
specified, the 8089 then fetches the address of the 
system configuration 
lrlock (SCB) contained 
in 


locations FFFF8H through FFFFBH in either two 
bus cycles (16-bit bus, W bit equal I) or four bus 
cycles (8-bit bus, W bit equal 0). The SCB offset 
and segment address values fetched are combined 
into a 20-bit physical address that is stored in an 
internal register. Using this address, the 8089 next 
fetches the system operation 
command 
(SOC) 


byte. 
As 
explained 
in 
Chapter 
3, 
this 
byte 
specifies both the request/grant 
operational mode 


(R bit) and the physical 
width of the I/O bus (I 


bit). After reading the SOC byte, the 8089 fetches 
the channel control 
block (CB) offset and seg- 


ment address values. These values are combined 
into a 20-bit physical address and are stored in 
another internal register. To inform the host CPU 
that it has completed the initialization 
sequence, 


the 8089 clears the Channel 
1 Busy flag in the 


channel control block by writing an all "zeroes" 
byte to CB + I. 


After the lOP has been initialized, 
the system 


configuration block may be altered in order to in- 
itialize another 
lOP. Once an lOP has been in- 


itialized, 
its channel 
control 
block 
in system 


memory cannot be moved since the CB address, 
which is internally stored by the lOP during the 
initialization 
sequence, is automatically 
accessed 


on every subsequent CA pulse. 


As previously stated, 
the generation 
of the CA 
and SEL inputs to the lOP are the responsibility 
of the host CPU. Typically, these signals result 
from 
the 
CPU's 
execution 
of 
an 
I/O 
write 


instruction to one of two adjacent I/O ports (I/O 
port addresses that only differ by AO). Figure 4-29 
illustrates a simple decoding circuit that could be 
used to generate the CA and SEL signals. Note 
that by qualifying the CA output with 10WC, the 
SEL output, since it is latched for the entire I/O 
bus cycle, is guaranteed to be stable on the trailing 
edge of the CA pulse. 


~ 


Ao ------------ 
•• 
SEL 


During normal 
operation, 
the I/O 
supervisory 


program running in the host CPU will receive a 
request to perform 
a specific I/O operation 
on 


one of the 8089's channels. 
In response to this 


request, 
the supervisory 
program 
will typically 


perform the following sequence of operations: 


• 
Check 
the 
availability 
of 
the 
specified 


channel by examining the channel's busy flag 
in the Channel Control Block. If it is possible 
for another processor to access the channel, a 
semaphore 
operation 
(implemented 
by 
a 


locked XCHG instruction) 
is used to check 


channel availability. 


• 
Load the variable 
parameters 
required 
for 


the intended 
operation 
into the channel's 


parameter block. 


• 
Load the channel 
command 
word (CCW) 
into the channel control block. 


• 
Establish 
the necessary linkages by writing 


the starting address of the channel program 
(task block) in the first four bytes of the 


parameter 
block and writing the address of 


, 
the parameter 
block in the channel control 


block. 


• 
Issue 
a 
channel 
attention 
(CA) 
to 
the 


specified channel. 


In response to the CA, the 8089 interrupts 
any 


current activity at its first opportunity 
(see "Con- 


current Channel Operation" 
in section 3.2) and 


begins 
execution 
of 
an 
internal 
instruction 


sequence that fetches and decodes the channel 
command 
word (CCW) and then performs 
the 


operation 
indicated (i.e., start, halt or continue 


channel program execution). 


If the CCW specifies start channel program (start 
task 
block 
execution), 
the 
address 
of 
the 


parameter 
block 
is fetched 
from 
the channel 


control block, 
the address of the first channel 
program instruction 
(contained 
in the first four 


bytes of the parameter block) is fetched and then 
loaded into the TP (task pointer) 
register and, 


finally, 
task block 
execution 
is initiated 
from 


either system or I/O space. Task block execution 
continues, 
subject to the activity on the other 


channel 
as described 
in "Concurrent 
Channel 


Operation," 
until 
a 
XFER 
instruction 
is 


executed. Following execution of this instruction, 
the next sequential channel program instruction is 
executed 
before 
the channel 
enters 
the DMA 


transfer mode. 


If the CCW specifies halt channel, 
the current 


operation on the specified channel is halted. If the 
channel is performing task block execution (either 
chained 
or not chained), 
channel 
operation 
is 
stopped at an instruction 
boundary, 
and if the 
channel is performing 
a DMA transfer, 
channel 


operation 
is stopped 
at a DMA transfer 
cycle 


boundary. 
Note that a channel will not stop a 


locked DMA transfer until the operation is com- 
pleted. There are two unique halt channel com- 
mands. One command 
simply halts the channel 


and clears the busy flag in the channel control 
block. This command 
is used when the halted 


operation is to be discarded. The other command 
halts the channel, saves the task pointer and pro- 
gram status word (PSW) byte, and clears the busy 
flag. This command 
is used when the halted 


operation 
is to be resumed. 
Note that this halt 


command will not affect the integrity of resumed 
task block execution 
or a memory-to-memory 


DMA transfer, but could affect the integrity of a 
synchronized 
DMA 
transfer 
(a DMA 
request 


occuring 
while the channel 
is halted 
could be 


missed). 


(and 
the 
busy 
flag 
is set). 
Since 
this 
command 


restores 
the task 
pointer 
and 
PSW, 
it should 
be 


used only if the task pointer 
and PSW have been 


saved by a previous 
halt command. 


Table 
4-17 outlines 
the various 
CCW 
command 


execution 
times. 
Note 
that 
the times 
listed 
in the 


table 
for the halt 
commands 
do not 
include 
the 


time 
required 
to 
complete 
any 
current 
channel 


activity 
when 
the 
channel 
attention 
is received 


(completion 
of the current 
DMA 
transfer 
cycle or 


task block instruction). 


The number 
of bytes 
transferred 
during 
a single 
DMA 
cycle is determined 
by both 
the source 
and 


destination 
logical 
bus 
widths 
as wel1 as by the 


bus widths 
by assembling 
bytes 
or disassembling 
words 
in its internal 
assembly 
Jfegister 
file. 
As 


explained 
in Chapter 
3, 
the 
DMA 
source 
and 


destination 
bus widths 
are defined 
by the execu- 


tion 
of 
a 
WID 
instruction 
during 
task 
block 


(channel 
command) 
execution. 
Note 
that 
the bus 
widths 
specified 
remain 
in force until changed 
by 


a subsequent 
WID instruction. 
Table 
4-18 defines 


the 
various 
byte 
(B) 
and 
word 
(W) 
source/ 
destination 
transfer 
combinations 
based 


on address 
boundary 
and bus Width specified. 


The 8089 additional1y 
optimizes 
bus accesses 
dur- 


ing 
transfers 
between 
dissimilar 
bus 
widths 


whenever 
possible. 
When 
either 
the 
source 
or 


destination 
is 
a 
16-bit 
memory 
bus 
(auto- 


incrementing) 
that 
is initial1y 
aligned 
on an odd 


CCWCommand 
Minimum Time" 
Maximum Time"" 


CA NOP 
48 + 2n clocks 
48 + 2n clocks 


CA Halt (no save) 
48 + 2n clocks 
48 + 2n clocks 
CA Halt (with save) 
94 + 5n clocks 
100 + 6n clocks 
CA Start (memory) 
108 + 6n clocks 
124 + 10n clocks 


CA Start 
(1/0) 
96 + 5n clocks 
108 + 8n clocks 
CAContinue 
95 + 5n clocks 
103 + 6n clocks 


Notes: 
n 
is the number 
of wait states 
per bus cycle. 


• 
Minimum time occurs 
when both the channel 
control 
block and parameter 
block addresses 
are aligned 
on 
an even address 
boundary 
and a 16-bit bus is used. 


Maximum time occurs 
when 
both the channel 
control 
block and parameter 
block addresses 
are aligned 
on an odd address 
boundary 
on a 16-bit bus or when an 8-bit bus is used. 


Logical Bus Width 
Address 
Boundary 
(Source 
-+ Destination) 


(Source 
-+ Destination) 
8-+8 
8 -+ 16 
16 -+ 8 
16 -+ 16 


Even -+ Even 
B-+B 
BIB -+ W 
W -+ BIB 
W-+W 


Even -+ Odd 
B-+B 
B-+B 
W -+ BIB 
W -+ BIB 


Odd -+ Even 
B-+B 
BIB -+ W 
B-+B 
BIB -+ W 


Odd -+ Odd 
B-+B 
B-+B 
B-+B 
B-+B 


address boundary (causing the first transfer cycle 
to be byte-to-byte), 
following the first transfer 


cycle, the memory address will be aligned on an 
even address boundary, 
and word transfers will 
subsequently occur. For example, when perform- 
ing a memory-to-port 
transfer from a 16-bit bus 
to an 8-bit bus with the source beginning on an 
odd address boundary, 
the first transfer cycle will 
be byte-to-byte (B -+ B) as indicated in table 4-18, 
but 
subsequent 
transfers 
will 
be 
word-to- 
byte/byte (W -+ B/B). 


All DMA transfer 
cycles consist of at least two 
bus cycles; one bus cycle to fetch (read) the data 
form the source into the lOP, and one bus cycle 
to store (write) the data previously fetched from 
the lOP 
into the destination. 
Note that in all 


transfers, 
the data 
passes through 
the lOP 
to 


allow mask/compare 
and translate operations to 
be optionally 
performed 
during the transfer 
as 


well as to allow the data 
to be assembled 
or 
disassembled. 


The lOP performs DMA transfers in one of three 
modes: unsynchronized, 
source synchronized 
or 


destination 
synchronized 
(the transfer 
mode is 


specified in the channel control register). The un- 
synchronized mode is used when both the source 
and destination devices do not provide a data re- 
quest (DRQ) signal to the lOP as in the case of a 
memory-to-memory 
transfer. In the synchronized 


transfer modes, the source (source synchronized) 
or destination 
(destination 
synchronized) 
device 


initiates the transfer cycle by activating the lOP's 
DRQI 
(channel 
I) or DRQ2 (channel 2) input. 


The DRQ 
input 
is asynchronous 
and 
usually 


originates from an I/O device controller 
rather 
than from a memory circuit. This input is latched 
on the positive 
transition 
of the clock (CLK) 


signal and therefore must remain active for more 
than 
one 
clock 
period 
(more 
than 
200 


nanoseconds when using a 5 MHz clock) in order 
to guarantee that it is recognized. 


During state T I of the associated fetch bus cycle 
(source synchronized) or store bus cycle (destina- 
tion synchronized), 
the lOP outputs the address 


of the I/O device (the port address). This address 
must 
be 
decoded 
(by 
external 
circuitry) 
to 


generate the DMA acknowledge 
(DACK) signal 
to the I/O controller as the response to the con- 
troller's 
DMA request. 
An I/O 
controller 
will 
typically use DACK as a conditional input for the 
removal of DRQ. (After receipt of the DACK 
signal, 
most 
Intel 
peripheral 
controllers 
deac- 


tivate DRQ following receipt of the correspon- 
ding read or write signal.) Figures 4-30 and 4-31 
illustrate the DRQ/DACK 
timing for both source 


synchronized (i.e., port-to-memory) 
and destina- 
tion 
synchronized 
(i.e., 
memory-to-port) 


transfers. 


Table 4-19 defines the DMA transfer 
cycles in 
terms of the number of bus and clock cycles re- 
quired. Note that the number of clocks required 
to complete a transfer cycle does not take into ac- 
count the effects of possible concurrent 
opera- 


tions on the other channel or wait states within 
any of the bus cycles. 
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TRANSFER CYCLE 
--FETCH 
8USCYClE-_I __ 
STOREBUSCYClE-- 


11 
I 
12 
I 
T3 
I 
~ 
11 
I 
12 
I 
T3 
I 
~ 


(DECODED 110 AOD~~~~) --l VALID 110 ADDRESS PRESENT \ 
_ 


NOTES: 


1. INDICATES 
THE 
NUMBER 
OF 
IDLE 
CLOCK 
CYCLES 
INSERTED 
BEFORE 
THE 
NEXT 


TRANSFER CYCLE BEGINS. IF ORO 1$ RECEIVED PRIOR TO STATE 14 OF THE CURRENT 
FETCH 
CYCLE, 
THE 
NEXT 
FETCH 
CYCLE 
BEGINS 
IMMEDIATELY 
FOLLOWING 
THE 
CURRENT STORE CYCLE. 


2. IF THE 8089 IS IDLE WHEN 
ORO IS RECOGNIZED, 
FIVE IDLE CLOCK 
CYCLES OCCUR 
BEFORE THE ASSOCIATED TRANSFER CYCLE IS INITIATED. 


(DECODED 110 ADD~:i~ 
~I VALID 110 ADDRESS PRESENT \ 


NOTES: 
1. FIRST DMA FETCH CYCLE 
OCCURS IMMEDIATELY 
ARER 
THE lAST 
TASK BLOCK 
INSTRUCTION IS EXECUTED. 


2. FETCH BUS CYCLE 2 BEGINS IMMEDIATELY FOllOWING 
STORE BUS CYCLE 1. 


3. INDICATES THE NUMBER 
OF IDLE CLOCK CYCLES INSERTED 
BEFORE STORE BUS 
CYCLE 2 BEGINS. IF ORO IS RECEIVED PRIOR TO STATE T4 OF STORE BUS CYCLE 1, 
STORE BUS CYCLE 2 BEGINS IMMEDIATELY FOLLOWING 
FETCH BUS CYCLE 2. 


4. IF THE 1089 IS IDLE WHEN DRO IS RECOGNIZED, 
FIVE IDLE CLOCK CYCLES OCCUR 
BEFORE THE ASSOCIATED STORE BUS CYCLE IS INITIATED. 


Transfer Mode 
logical Bus Width 
Unsynchronized 
Source Synchronized 
Destination Synchronized 


Source Destination 
Bus Cycles 
TotaP 
Bus Cycles 
Total' 
Bus Cycles 
Total' 


Required 
Clocks 
Required 
Clocks 
Required 
Clocks 


8 
8 
2 (1 fetch, 1 store) 
8' 
2 (1 fetch, 1 store) 
8' 
2 (1 fetch, 1 store) 
8' 


8 
16' 
3 (2 fetch, 1 store) 
12 
3 (2 fetch, 1 store) 
16' 
3 (2 fetch, 1 store) 
12 


16' 
8 
3 (1 fetch, 2 store) 
12 
3(1 fetch, 2 store) 
12 
3 (1 fetch, 2 store) 
16' 


16' 
16' 
2 (1 fetch, 1 store) 
8 
2 (1 fetch, 1 store) 
8 
2 (1 fetch, 1 store) 
8 


Notes: 
1. 
The "Total Clocks Required" does not include wait states. One clock cycle per wait state must be 
added to each fetch and/or store bus cycle in which a wait state is inserted. When performing a 
memory-to-memory transfer, three additional clocks must be added to the total clocks required (the 
first fetch cycle of any memory-to-memory transfer requires seven clock cycles). 


2. 
When performing a translate operation, one additional 7-clock bus cycle must be added to the values 
specified in the table. 


3. 
Word transfers in the table assume an even address word boundary. Word transfers to or from odd 
address boundaries are performed as indicated in table 4-18and are subject to the bus cycle/clock 
requirements for byte-to-byte transfers. 


4. 
Transfer cycles that include two synchronized bus cycles (i.e., synchronous transfers between 
dissimilar logical bus widths) insert four idle clock cycles between the two synchronized bus cycles 
to allow additional time for the synchronzing device to remove its initial DMA request. 


DACK latency is defined as the time required for 
the 
8089 to 
acknowledge, 
by 
outputting 
the 


device's 
corresponding 
port 
address, 
a DMA 


request at its DRQ input. This response latency is 
dependent on a number of factors including the 
transfer 
cycle being performed, 
activity on the 


other channel, memory address boundaries, 
wait 


states present in either bus cycle and bus arbitra- 
tion times. 


Generally, 
when the other channel 
is idle, the 


maximum DACK latency is five clock cycles (I 
microsecond at 5 MHz), excluding wait states and 
bus arbitration 
times. An exception occurs when 


performing 
a word transfer 
to or from an odd 


memory address boundary. 
This operation, 
since 


two store 
(source 
synchronized) 
or two fetch 


(destination synchronized) bus cycles are required 
to access memory, has a maximum possible laten- 
cy of nine clock cycles. When the other channel is 
performing 
DMA 
transfers 
of 
equal 
priority 
("P" 
bits equal), interleaving occurs at bus cycle 


boundaries, 
and the maximum latency is either 
nine clock cycles when the other channel is per- 
forming a normal 4-clock fetch or store bus cycle 
or twelve clock cycles when the other channel is 
performing the first fetch cycle of a memory-to- 
memory transfer. If the other channel is perform- 
ing "chained" 
task block instruction execution of 


equal priority, maximum latency can be as high as 
12 clock cycles (channel 
command 
instruction 


execution is interrupted 
at machine cycle boun- 
daries 
which 
range 
from 
two to 
eight clock 


cycles). 


As stated in Chapter 
3, a channel can exit the 
DMA transfer 
mode (and return to task block 


execution) 
on any of the following 
terminate 


conditions: 


• 
Single cycle transfer 


• 
Byte count expired 


• 
Mask/compare 
match or mismatch 


• 
External event 


The terminate 
conditions 
are specified 
by in- 


dividual 
fields in the channel 
control 
register. 
More 
than 
one 
terminate 
condition 
can 
be 
specified for a transfer (e.g., a transfer can be ter- 
minated when a specific byte count is reached or 
on the occurrence 
of an external event). When 


more than one terminate 
condition 
is possible, 


displacements 
(which 
are 
added 
to 
the 
task 
pointer register value) are specified to cause task 
block execution to resume at a unique entry point 
for 
each condition. 
Three 
reentry 
points 
are 
available: TP, TP + 4 and TP + 8. The time inter- 
val between the occurrence of a terminate condi- 
tion and the resumption of task block execution is 
12 clock cycles for reentry point TP and 15 clock 
cycles for reentry points TP + 4 and TP + 8. 


When interfacing a peripheral to an 8-bit physical 
data bus, the 8089 uses only the lower half of the 
address/data 
lines (AD7-ADO) as the bidirec- 


tional data bus, and the upper half of the ad- 
dress/ data 
lines (AD 15-AD8) maintain 
address 


information for the entire bus cycle. Consequent- 
ly, with this bus configuration, 
only one octal 
latch (e.g., an Intel® 8282/83 Octal Latch) is re- 
quired 
since only 
the 
lower 
half 
of 
the 
ad- 


dress/data 
lines is time-multiplexed 
(unless the 
address bus requires the increased current drive 
capability and capacitive load immunity provided 
by the latch). 


When interfacing 
a peripheral 
to a 16-bit data 
bus, both the lower and upper halves of the ad- 
dress/data 
lines are time-multipelxed, 
and two oc- 


tal latches are required. Note that unlike the 8086 
and 8088 CPUs, the 8089 does not time-multiplex 
BHE (this signal is valid for the entire bus cycle). 
Both 8- and 16-bit peripherals can be interfaced to 
a 16-bit bus. An 8-bit peripheral can be connected 
to either the upper or lower half of the bus. An 8- 
bit peripheral on the lower half of the bus must 
use an even source/destination 
address, and an 8- 
bit peripheral on the upper half of the bus must 
use an odd source/destination 
address. To take 


advantage of word transfers, 
a 16-bit peripheral 
must use an even source/destination 
address. 


To 
prepare 
a peripheral 
device 
for 
a 
DMA 
transfer, command and parameter data is written 
to the device's 
command/status 
port. 
This is 


usually accomplished 
using pointer register GC. 


Recalling that the 8089 executes one additional 
task block instruction 
following execution of the 
XFER instruction 
(the XFER instruction 
causes 


the 8089 to enter the DMA mode), this additional 
instruction is used to access the command port of 
an I/O 
device that 
immediately 
begins 
DMA 


operation 
on receipt or me lasl 
COffilllClIlU 
\lllC 
8271 Floppy 
Disk Controller 
begins its DMA 


transfer 
on 
receipt 
of 
the' 
last 
command 


parameter). 
Since a translate DMA operation re- 


quires the use of all three pointer registers (OA 
and OB specify the source and destination 
ad- 


dresses; 
OC specifies the base address 
of the 


translation 
table), when it is necessary to use the 


last task block instruction 
to start 
the device, 


command 
port 
access 
can 
be 
accomplished 


relative to one of the pointer registers or relative 
to the PP register. If the device's data port ad- 
dress (OA or OB) is below the device's command 
port 
address, 
either 
an offset 
or an 
indexed 


reference can be used to access the command 
port. 


A peripheral's 
(or peripheral 
controller's) 
DMA 


communication 
protocol 
with 
the 
8089 is as 
follows: 


• 
The peripheral 
(when source or destination 


synchronized) initiates a DMA transfer cycle 
by activating the 8089's DRQ (DMA request) 
input. 


• 
The 
8089 
acknowledges 
the 
request 
by 
placing the peripheral's 
assigned data port 


address on the bus during state T I of the cor- 
responding 
fetch 
(source synchronized) 
or 


store (destination 
synchronized) 
bus cycle. 


The peripheral 
is responsible 
for decoding 


this 
address 
as 
the 
DMA 
acknowledge 


(DACK) to its request. 


• 
The 
data 
is 
transferred 
between 
the 


peripheral 
and 
the 
8089 during 
the 
T2 


through 
T4 state interval of the bus cycle. 


The peripheral must remove its DMA request 
during this interval. 


• 
The peripheral, when ready, requests another 
DMA transfer cycle by again activating the 
DRQ 
input, 
and 
the 
above 
sequence 
is 
repeated. 


• 
The peripheral 
can, as an option, 
end the 
DMA transfer by activating the 8089's EXT 
(external terminate) input. 


The 8089 can support mulitple peripheral devices 
on a single channel provided that only one device 
is in the active transfer mode at anyone 
time. To 


interface 
multiple 
devices, 
the 
DMA 
request 


(DRQ) lines are OR'ed together as are the exter- 
nal terminate (EXT) lines. Unique port addresses 
are, however, assigned to each device so that an 
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ed to only the active device. DACK decoding can 
be accomplished 
with an Intel 
® 8205 Binary 


Decoder or a ROM circuit. Note that the 8089 can 
only determine which device has requested service 
or terminated 
by the context of the task block 


program. 


Most peripheral devices interfaced to the 8089 will 
use 
the 
decoded 
DMA 
acknowledge 
signal 
(DACK) as the "chip 
select" 
input. 
Peripheral 


devices that do not follow this convention 
must 


use DACK as a conditional input of chip select. 


While most interrupts 
associated 
with the 8089 
will be DMA requests or external terminates, non- 
DMA 
related 
interrupts 
can 
additionally 
be 
supported. 


One technique that would be used when an 8089 is 
the local configuration 
(or when an 8086 or 8088 


and an 8089 are locally connected 
as a remote 


module) is to allow the CPU to accept the inter- 
rupt and then direct the 8089 to the interrupt ser- 
vice routine. 
Another 
technique 
is to allow the 


8089 to "poll" 
the device to determine when an 


interrupt 
has 
occurred 
(most 
peripheral 
con- 


trollers have an interrupt pending bit in a status 
word). 
The 8089's 
bit testing 
instructions 
are 
ideally suited for polling. 


When the 8089 is in a remote configuration, 
non- 


DMA related interrupts can be supported with the 
addition 
of 
an 
Intel® 
8259A 
Programmable 


Interrupt 
Controller. 
Systems that require 
this 


type of interrupt structure would dedicate one of 
the 8089's channels 
to interrupt 
servicing. 
In 
implementing this structure, 
the interrupt output 


from the 8259A is directly connected to the chan- 
nel's external 
terminate 
(EXT) input, 
and the 


channel's DMA request (DRQ) input is not used. 
A task block program is initially executed to per- 
form a source-synchronized 
DMA transfer (with 


an external terminate) on the "interrupt" 
channel 


to "arm" 
the interrupt 
mechanism. 
Since the 


DRQ input is not used, when the channel enters 
the DMA transfer mode, the channel idles while 
waiting for the first DMA request (which never 
occurs). The other channel, 
since the interrupt 


channel is idle, operates at maximum throughput. 
When an interrupt 
occurs, the "pseudo" 
DMA 


transfer 
is immediately 
terminated, 
and 
task 


block instruction execution is resumed. The task 
block program would write a "poll" 
command to 


the 8259A's command 
port and then read the 


IJ 
A ft"... •..•."-"V""."'-.'"~--_· 
nm,,'li1~ -llI~U 
w:uon 
oacK Into 
ItS associated 
interrupt (the device is identified by a 3-bit binary 
assembly language equivalent (table 4-26). 
number in the associated data byte). The device 
number read would be used by the task block pro- 
gram as a vector into a jump table for the device's 
interrupt service routine. Pertinent interrupt data 
could be written into the associated 
parameter 
block for subsequent 
examination 
by the host 
processor. 


The interrupt 
mechanism 
previously 
described, 
since it uses the 8089's external terminate 
func- 


tion, 
provides 
an 
extremely 
fast 
interrupt 
response time. 


Note that when using dynamic 
RAM memory 
with the 8089, an Intel® 8202 Dynamic RAM 
Controller 
can be used to simplify the interface 
and to perform 
the RAM refresh cycle. When 
maximum transfer rates are required, 
the RAM 
refresh cycle can be externally 
initiated 
by the 
8089. By connecting the decoded DACK (DMA 
acknowledge) 
signal 
to 
the 
8202's 
REFRQ 
(refresh request) input, the refresh cycle will occur 
coincident 
with the 
110 device bus cycle and 
therefore 
will not 
impose 
wait 
states 
in the 
memory bus cycle. 


Most 8089 programming will be performed at the 
assembly language level using ASM-89, the 8089 
assembler. During program debugging, however, 
it may be necessary to work directly with machine 
instructions when monitoring the bus, reading un- 
formatted memory dumps, etc. This section con- 
tains both a table to encode any ASM-89 instruc- 
tion into its corresponding 
machine instruction 


Figure 4-32 shows the format of a typical 8089 
machine instruction. 
Except for the LPDI 
and 


memory-to-memory 
forms 
of 
the 
MOV 
and 
MOVB instructions 
that are six bytes long, all 
8089 machine instructions consist of from two to 
five bytes. The first two bytes are always present 
and are generally formatted 
as shown in figure 
4-32 (table 4-24 contains 
the exact encoding of 
every instuction). 


Bits 5 through 7 of the first byte of an instruction 
comprise the R/B/P 
field. This field identifies a 
register, bit select or pointer register operand as 
outlined in table 4-20. 


Code 
Register 
Bit 
Pointer 


000 
GA 
0 
GA 


001 
GB 
1 
GB 


1 
010 
GC 
2 
GC 


011 
BC 
3 
N/A 


100 
TP 
4 
TP 


101 
IX 
5 
N/A 


110 
CC 
6 
N/A 
111 
MC 
7 
N/A 


The WB field (bits 3 and 4 of the first byte) in- 
dicates how many displacement/data 
bytes are 
present in the instruction as outiined in table 4-21. 
The displacement 
bytes are 
used 
in program 
transfers; one byte is present for short transfers, 
while long transfers 
contain 
a two-byte (word) 
displacement. 
As mentioned 
in Chapter 
3, the 


MEMORY 
AODRESSING 
MODE 


NUMBER 
OF DISPLACEMENT/DATA 
BYTES 


REGISTER, 
BIT, 
POINTER 
SELECT 


displacement is stored in two's complement nota- 
tion with the high-order 
bit indicating the sign. 
Data bytes contain the value of an immediate con- 
stant 
operand. 
A byte 
immediate 
instruction 
(e.g., MOVBI) will have one data byte, and a 
word immediate 
instruction 
(e.g., 
ADD!) 
will 
have two bytes (a word) of immediate 'data. An 
instruction 
may contain 
either dispiacement 
or 
data bytes, but not both (the TSL instruction is an 
exception and contains one byte of displacement 
and one byte of data). 'If an offset byte is present, 
the displacement/data 
byte(s) always follow the 
offset byte. 


Code 
Interpretation 


00 
No displacement/data 
bytes 
01 
One displacement/data 
byte 
10 
Two displacement/data 
bytes 
11 
TSL instruction 
only 


The AA field specifies the addressing mode that 
the processor is to use in order to construct the ef- 
fective address of a memory operand. 
Four ad- 
dressing modes are available as outlined in table , 
4-22. (Address modes are described in detail in 
section 3.8.) 


Code 
Interpretation 
.. 


00 
Base register only 
01 
Base register plus offset 
10 
Base register plus IX 
11 
Base register plus IX, 
auto-increment 


Bit 0 of the first instruction byte indicates whether 
the instruction 
operates 
on a byte (W=O) or a 
word (W=l). 


Bits. 7 through 
2 of ,the second instruction 
byte 
specify the instruction 
opcode. 
The opcode, 
in 
conjunction 
with the W field of the first byte, 


identifies the instruction. 
For 'example, 
the op- 
code "l1lOl1" 
denotes 
the decrement 
instruc- 
tion; if W=O, the assembly language instruction is 
DECB, while if W=l, 
the instruction 
is DEC. 


Table 4-26 lists, in hexadecimal order, the opcode 
of every assembly language instruction. 


The MM field (bits 0 and 
\) indicates 
which 
pointer (base) register is to be used to construct 
the effective address of a memory operand. Table 
4-23 defines the MM field encoding. 
(Memory 
operand addressing is described in section 3.8.) 


Code 
Base Register 


00 
, 
GA 


.01 
GB 
.. 
10 
GC 
11 
pp 


When the AA field value is "0\ '~.(base register 
+ offset addressing), the third byte of the instruc- 
tion contains the offset value. This unsigned value 
is added 
to the 
content 
of the 
base 
register 
specified by the MM field to form the effective 
address of the memory operand.' 
. 


When the AA field va,lue is "lO," 
the IX register 
value is added to the content of the base register 
specified by the MM field to provide a 64k range 
of effective addresses. (Note that the upper four 
bits of the IX register are not sign-extend~d.) 


When the AA field value is "11," 
the IX register 
value is added to the base register value to for.m 
the effective address as described for an AA field 
value of "10." 
In this addressing mode, however, 


the IX register value is incremented by one after 
every byte accessed. 


AAROOAA1 
100000MM 
offset if AA-01 


AAAOOAAl 
100001MM 
offset if AA ••01 
1 


OOOOOAA1 
100,100MM 
offset 
if AA-01 
OOOOOAA, 
1"00"MM 
I 
offset If AA-01 
I 


M~VB 
• 
Move 
byte 
variable 


Memory 
to register 


Register 
to memory 


AAAOO 
•••••• O 
100000MM 
offset 
if AA-01 


AAAOO 
•••••• O 
100001MM 
offset 
if AA-=Ol 


00000 
•••••• 0 
100100MM 
offset 
if AA-01 
o 0 0 0 0 A 
A 
0 
11 
1 0 0 11M 
M 
1 
offset 
If AA-01 
I 


MOVSI 
••• Move 
byte 
immediate 


ImmedIate 
to register 


Immediate 
to memory 


MOVP 
:= Move 
pointer 


Memory 
to pointer 
register 


I p 
P P 0 0 A A 
1 
11 
0 0 0 10M 
M I 
offset 
If AA-01 


Memory 
to register 


Register 
to memory 


Immediate 
to register 


Immediate 
to memory 


INC = Increment 
word 
by 1 


Register 


Memory 


100000 
A 
A 01111010 
M M I 
offset 
If AA-01 


DEC 
;;: Decrement 
word 
by 1 


Register 


I 0 0 0 0 0 A 
A 0 I 1 1 1 0 11M 
M I 
offset 
It AA-01 


AND 
:II:: AND word 
variable 


Memory 
to register 


Register 
to memory 


ANDB 
;:::AND 
byte 
variable 


Memory 
to register 


Register 
to memory 


Immediate 
to register 


Immediate 
to memory 


ANDBI 
.• AND 
byte 
immediate 


Immediate 
to register 


Immediate 
to memory 


Memory 
to register 


Register 
to memory 


AAAOOOOO 
00101100 


OOOOOAAl 
110111 
M M 
offset if AA ••Ol 1 


AAAOOAAl 
1 01 
0 "M 
M 
offset if AA ••Ol 
1 


IBBBOOAAO!,11101MM 


I B B BOO 
A A 0 I' 
1 1 1 10M 
M I 
offset 
jf AA-Ol 


1,0001 
A A 1 1,00111 
M M I 
offset 
if AA-Ol 


1,000100°1°0100000 


I, 
0 
0 
1 0 
0 
0 
1 I 0 
0 
1 0 
0 
0 
0 
0 
1__ 
d_i_'P_"'_O __ 
~ 
__ 
di_'P_"_hi 
__ 


label 
to register 


label 
to memory 


01000100 


1 1 1 001 
M M 


disp-8 


offset 
if AA-01 


lJZ = Long jump 
if word 
isO 


label 
to register 


• JNZ = Jump 
if word 
not 0 


label 
to register 


label 
to memory 


label 
to register 


label 
to memory 


I 0 0 0 0 1 A A 0 11 
0 1 1 0 0 M M I 
offset 
if AA"01 


"The 
A$M-89 
Assembler 
will 
automatically 
generate 
the 
long 
form 
of a program 
transfer 
instruction 
when 
the 


target 
is known 
to be beyond 
the byte-displacement 
range. 


11 
S 0" ° 0 0 0 0 1 0 0 0 0 0 0 0 0 I 


101100000100000000 


101000000100000000 


100100000101001000 


100000000100000000 
I 


Table 4-26 lists all of the 8089 machine instruc- 
tions in hexadecimal/binary 
order by their second 
byte. This table may be used to "decode" 
an 


assembled machine instruction 
into its ASM-89 
symbolic form. The preceding table (table 4-25) 
defines the notation used in table 4-26. 


5 


-0 
PPP 
RRR 
AA 
BBB 
offset-Io 
offset-hi 
segment-Io 
segment-hi 
data-8 
data-Io 
data-hi 
disp-8 
disp-Io 
disp-hi 
(offset) 


Logical width of source bus; 0=8, 1=16 
Logical width of destination 
bus; 0=8, 1~16 
Pointer register encoded in RI B/P field 
Register encoded in RI BI P field 
AA (addressing 
mode) field 
Bit select encoded in RI BI P field 
Low-order byte of offset word in doubleword 
pointer 
High-order byte of offset word in doubleword 
pointer 
Low-order byte of segment word in doubleword 
pointer 
High-order byte of segment word in doubleword 
pointer 
8-bit immediate constant 
Low-order byte of 16-bit immediate constant 
High-order byte of 16-bit immediate constant 
8-bit signed displacement 
Low-order byte of 16-bit signed displacement 
High-order byte of 16-bit signed displacement 
Optional 8-bit offset used in offset addressing 


Byte 1 
Byte 2 
Bytes 3, 4, 5, 6 
Hex 
Binary 


00000000 
00 
00000000 
01000000 
00 
00000000 
15000000 
00 
00000000 
01100000 
00 
00000000 
01 
00000001 


+ 
+ 
07 
00000111 
PPP10001 
08 
00001000 offset-Io, offset-h i,seg ment-I0, seg ment-h i 
09 
00001001 
+ 
+ 
1F 
00011111 
RRR01000 
20 
00100000 data-8 
RRR10001 
20 
00100000 data-Io,data-hi 
10001000 
20 
00100000 disp-8 
10010001 
20 
00100000 disp-Io,disp-hi 
21 
00100001 


+ 
+ 
23 
00100011 
RRR01000 
24 
00100100 data-8 
RRR10001 
24 
00100100 'data-Io,data-hi 
25 
00100101 


+ 
+ 
27 
00100111 
RRR01000 
28 
00101000 data-8 


NOP 
51NTR 
WID 
source-width,dest-width 
XFER 


} 
not used 


LPDI 
ptr-reg, immed32 


} 
not used 


ADDBI 
register,immed8 
ADDI 
register,immed16 
JMP 
short-label 
LJMP 
long-label 


} 
not used 


ORBI 
register,imrned8 
ORI 
register,immed16 


} . not used 


ANDBI 
register,immed8 


.•._v......_-.u. 
UUU7 
.•.•.•. 
Q"'UUI~ 
.l.Ui:tLIU\'LlUU 
U~\';UUlll!S U_UIUe;: \.\.....Olll- Q. 


Byte 1 
Byte 2 
Bytes 3, 4, 5, 6 
ASM89 
Instruction 
Format 
Hex 
Binary 


RRR10001 
28 
00101000 
data-Io,data-hi 
ANDI 
register,immed16 


29 
00101001 
} 
+ 
+ 
not used 


2B 
00101011 
RRROOOOO 
2C 
00101100 
NOT 
register 
20 
00101101 
} 
+ 
+ 
not used 
2F 
00101111 
RRR01000 
30 
00110000 
data-8 
MoVBI 
register,immed8 
RRR10001 
30 
00110000 
data-Io,data-hi 
MoVI 
register,immed16 
31 
00110001 
} 
+ 
+ 
not used 
37 
00110111 
RRROOOOO 
38 
00111000 
INC 
register 
39 
00111001 
} 
+ 
+ 
not used 
3B 
00111011 
RRROOOOO 
3C 
00111100 
DEC 
register 
3D 
00111101 
} 
+ 
+ 
not used 
3F 
00111111 
RRR01000 
40 
01000000 
disp-8 
JNZ 
register, 
short-label 
RRR10000 
40 
01000000 
disp-Io,disp-hi 
LJNZ 
register, 
long-label 
41 
01000001 
} 
+ 
t 
not used 
43 
01000011 
RRR01000 
44 
01000100 
disp-8 
JZ 
register 
,short-label 
RRR10000 
44 
01000100 
disp-Io,disp-hi 
LJZ 
register,short-Iabel 
45 
01000101 
} 
+ 
+ 
not used 
47 
01000111 
00100000 
48 
01001000 
HLT 
49 
01001001 
} 
+ 
+ 
not used 
4B 
01001011 
00001AAO 
4C 
010011MM 
} 
} 
+ 
+ 
+ 
(offset),data-8 
MoVBI 
mem8,immed8 
00001AAO 
4F 
010011MM 
00010AA1 
4C 
010011MM 
} 
} 
t 
+ 
+ 
(offset), 
data-Io, data-h i 
MoVI 
mem16,immed16 
00010AA1 
4F 
010011MM 
50 
01010000 
} 
+ 
+ 
not used 
7F 
·01111111 
RRROOAAO 
80 
100000MM 
} 
} 
+ 
+ 
(offset) 
MoVS 
register,mem8 
RRROOAAO 
83 
100000MM 


Byte 1 
Byte 2 
Bytes 3, 4, 5, 6 
A~M89 
Instruction 
Format 
Hex 
Binary 


RRROOAA1 
80 
100000MM 
} 
} 
+ 
+ 
+ 
(offset) 
MOV 
register,mem16 


RRROOAA1 
83 
100000MM 
RRROOAAO 
84 
100001MM 
} 
} 
+ 
t 
+ 
(offset) 
MOVB 
mem8,register 


RRROOAAO 
87 
100001MM 
RRROOAA1 
84 
100001MM 
} 
} 
+ 
+ 
+ 
(offset) 
MOV 
mem16,register 


RRROOAA1 
87 
100001MM 
PPPOOAA1 
88 
100010MM 
} 
} 


, 
. 


+ 
+ 
+ 
(offset) 
LPD 
ptr-reg,mem32 


PPPOOAA1 
8B 
100010MM 
PPPOOAA1 
8C 
100011MM 
} 
} 


" 


+ 
+ 
+ 
(offset) 
MOVP 
ptr-reg, mem24 
PPPOOAA1 
8F 
100011MM 
- 


OOOOOAAO 
90 
100100MM 
} 
} 
+ 
t 
t 
(offset),OOOOOAAO,110011MM,(offset) 
MOVB 
merri8,mem8 


OOOOOAAO 
93 
100100MM 
00000AA1 
90 
100100MM 
} 
} 
t 
t 
t 
(offset),OOOOOAA1,110011MM,(offset) 
MOV 
mem16,mem16 
00000AA1 
93 
100100MM 
' ' 


00011AAO 
94 
100101MM 
} 
} 
t 
t 
t 
(offset),data-8,d 
isp-8 
TSL 
mem8,immed8,short-label 
00011AAO 
97 
100101MM 
PPPOOAA1 
98 
100110MM 
} 
} 
t 
t 
t 
(offset) 
MOVP 
mem24,ptr-reg 
PPPOOAA1 
9B 
100110MM 
10001AA1 
9C 
100111MM 
} 
} 
t 
t 
+ 
(offset),disp-8 
CALL 
mem24,short-label 
10001AA1 
9F 
100111MM 
10010AA1 
9C 
100111MM 
} 
} 
+ 
+ 
t 
(offset),disp-Io,d 
isp-h i 
LCALL 
mem24,long-label 
10010AA1 
9F 
100111MM 
RRROOAAO 
AO 
101000MM 
} 
} 
t 
+ 
t 
(offset) 
ADDB 
register,mem8 
RRROOAAO 
A3 
101000MM 
RRROOAA1 
AO 
101000MM 
} 
} 
+ 
+ 
t 
(offset) 
ADD 
register, 
mem16 
RRROOAA1 
A3 
101000MM 
. 


RRROOAAO 
A4 
101001MM 
} 
} 
+ 
+ 
+ 
(offset) 
ORB 
registEH,mem8 
RRROOAAO 
A7 
101001MM 
RRROOAA1 
A4 
101001MM 
} 
} 
+ 
+ 
+ 
(offset) 
OR 
register,mem16 
RRROOAA1 
A7 
101001MM 
RRROOAAO 
A8 
101010MM 
} 
} 


1 


+ 
+ 
+ 
(offset) 
ANDB 
mem8,register 
RRROOAAO 
AB 
101010MM 


Byte 1 
Byte 2 
Bytes 3, 4, 5, 6 
ASM89 Instruction 
Format 
Hex 
Binary 


RRROOAA1 
A8 
101010MM 
} 
} 
+ 
+ 
+ 
(offset) 
AND 
mem16,register 
RRROOAA1 
AB 
101010MM 
RRROOAAO 
AC 
101011MM 
} 
} 
+ 
+ 
+ 
(offset) 
NOTB 
register,mem8 
RRROOAAO 
AF 
101011MM 
RRROOAA1 
AC 
101011MM 
} 
} 
+ 
+ 
+ 
(offset) 
NOT 
register,mem16 
RRROOAA1 
AF 
101011MM 
00001AAO 
BO 
101100MM 
} 
} 
+ 
+ 
+ 
(offset),disp-8 
JMCE 
mem8,short-label 
00001AAO 
B3 
101100MM 
00010AAO 
BO 
101100MM 
} 
} 
+ 
+ 
+ 
(offset) ,disp-Io,d 
isp-hi 
LJMCE 
mem8, long-label 
00010AAO 
B3 
101100MM 
00001AAO 
B4 
101101MM 
} 
} 
+ 
+ 
+ 
(offset),disp-8 
JMCNE 
mem8,short-label 
00001AAO 
B7 
101101MM 
00010AAO 
B4 
101101MM 
} 
} 
+ 
+ 
+ 
(offset) ,disp-Io,d 
isp-h i 
LJMCNE 
mem8, long-label 


00010AAO 
B7 
101101MM 
BBB01AAO 
B8 
101110MM 
} 
} 
+ 
+ 
+ 
(offset),disp-8 
JNBT 
mem8,bit-select,short-Iabel 
BBB01AAO 
BB 
101110MM 
BBB10AAO 
B8 
101110MM 
} 
} 
+ 
+ 
+ 
(offset),d 
isp-Io,d isp-h i 
LJNBT 
mem8,bit-select,long-label 
BBB10AAO 
BB 
101110MM 
BBB01AAO 
BC 
101111MM 
} 
} 
+ 
+ 
+ 
(offset),disp-8 
JBT 
mem8,bit-select,short-label 
BBB01AAO 
BF 
101111MM 
BBB10AAO 
BC 
101111MM 
} 
} 
t 
t 
t 
(offset),disp-Io,d 
isp-h i 
LJBT 
mem8,bit-select,long-label 
BBB10AAO 
BF 
101111MM 
00001AAO 
CO 
110000MM 
} 
} 
t 
t 
t 
(offset) ,data-8 
ADDBI 
mem8,immed8 
00001AAO 
C3 
110000MM 
00010AA1 
CO 
110000MM 
} 
} 
t 
t 
t 
(offset), data-I 0, data-h i 
ADDI 
mem16,immed16 
00010AA1 
C3 
110000MM 
00001AAO 
C4 
110001MM 
} 
} 
t 
+ 
+ 
(offset),data-8 
ORBI 
mem8,immed8 
00001AAO 
C7 
110001MM 
00010AA1 
C4 
110001MM 
} 
} 
t 
+ 
+ 
(offset), data-Io, data-h i 
ORI 
mem16,immed16 
00010AA1 
C7 
110001MM 
00001AAO 
C8 
110010MM 
} 
} 
+ 
+ 
+ 
(offset),data-8 
ANDBI 
mem8,immed8 
00001AAO 
CB 
110010MM 


, 


Byte 1 
Byte 2 
Bytes 3, 4, 5, 6 
ASM89 
Instruction 
Format 


Hex 
Binary 


00010AA1 
C8 
110010MM 
} 
(offset),data-Io,data-hi 
.} 
t 
t 
t 
ANDI 
mem16,immed16 


00010AA1 
CB 
110010MM 
, 


CC 
11001100 
} 
t 
t 
not used 


CF 
11001111 


RRROOAAO 
DO 
110100MM 
} 
}' 
ADDB 
t 
t 
+ 
(offset) 
mem8,register 


RRROOAAO 
D3 
110100MM 


RRROOAA1 
DO 
110100MM 
} 
,} 
+ 
+ 
+ 
(offset) 
ADD 
mem16,register 


RRROOAA1 
D3 
110100MM 


RRROOAAO 
D4 
110101MM 
} 
} 
+ 
+ 
+ 
(offset) 
ORB 
mem8,register 


RRROOAAO 
D7 
110101MM 


RRROOAA1 
D4 
110101MM 
} 
} 
+ 
+ 
t 
(offset) 
OR 
mem16,register 


RRROOAA1 
D7 
110101MM 


RRROOAAO 
D8 
110110MM 
} 
} 
t 
t 
t 
(offset) 
ANDB 
mem8,register 


RRROOAAO 
DB 
110110MM 


RRROOAA1 
D8 
110110MM 
} 
} 
AND 
me~16,register 
t 
t 
+ 
(offset) 


RRROOAA1 
DB 
110110MM 


RRROOAAO 
DC 
110111MM 
} 
} 
+ 
+ 
+ 
(offset) 
.. 
NOTB 
mem8,register 


RRROOAAO 
DF 
11b111MM 


RRROOAA1 
DC 
110111MM 
} 
} 
+ 
+ 
t 
(offset) 
NOT 
mem16,register 
RRROOAA1 
DF 
110111MM 


00001AAO 
EO 
111000MM 
} 


.- 


} 
+ 
+ 
+ 
(offset),disp-8 
JNZB 
mem8,short-label 
00001AAO 
E3 
111000MM 


00001AA1 
EO 
111000MM 
} 
} 
t 
+ 
+ 
(offset),disp-8 
JNZ 
mem16,short-label 
00001AA1 
E3 
111000MM 
00010AAO 
EO 
111000MM 
} 
} 
t 
t 
t 
(offset),d 
isp-Io,disp-hi 
LJNZB 
mem8,long-label 
00010AAO 
E3 
111000MM 


00010AA1 
EO 
111000MM 
} 
} 
t 
t 
t 
(offset) 
,disp-Io,disp-hi 
LJNZ 
mem16,longlabel 
00010AA1 
E3 
111000MM 


00001AAO 
E4 
111001MM 
} 
} 
t 
• 
• 
(offset),disp-8 
JZB 
mem8,short-label 
00001AAO 
E7 
111001MM 


00001AA1 
E4 
111001MM 
} 
} 
• 
• 
• 
(offset),disp-8 
JZ 
mem16,short-label 
00001AA1 
E7 
111001MM 


Byte 1 
Byte 2 
Bytes 3, 4, 5, 6 
ASM89 
Instruction 
Format 
Hex 
Binary 


00010AAO 
E4 
111001MM 
} 
} 
t 
t 
t 
(offset), d isp-Io, d isp-h i 
LJZB 
mem8,long-label 
00010AAO 
E7 
111001MM 
00010AA1 
E4 
111001MM 
} 
} 
t 
t 
t 
(offset), disp-Io, d isp-h i 
LJZ 
mem16,long-label 
00010AA1 
E7 
111001MM 
OOOOOAAO 
E8 
111010MM 
} 
} 
t 
t 
t 
(offset) 
INCB 
mem8 
OOOOOAAO 
EB 
111010MM 
00000AA1 
E8 
111010MM 
} 
} 
t 
t 
t 
(offset) 
INC 
mem16 
00000AA1 
EB 
111010MM 
OOOOOAAO 
EC 
111011MM 
} 
} 
t 
t 
t 
(offset) 
DECB 
mem8 
OOOOOAAO 
EF 
111011MM 
00000AA1 
EC 
111011MM 
} 
(offset) 
} 
t 
t 
t 
DEC 
mem16 
00000AA1 
EF 
111011MM 
FO 
11110000 
} 
t 
t 
not used 
F3 
11110000 
BBBOOAAO 
F4 
111101MM 
} 
(offset) 
} 
t 
t 
t 
SETB 
mem8,O-7 
BBBOOAAO 
F7 
111101MM 
BBBOOAAO 
F8 
111110MM 
} 
(offset) 
} 
CLR 
t 
t 
t 
mem8,0-7 
BBBOOAAO 
FB 
111110MM 
FC 
11111100 
} 
not used 
t 
t 
FF 
11111111 
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application notes, in the order listed, have been included within this appendix: 


AP-67 
AP-61 
AP-50 
AP-51 
AP-59 
AP-28A 
AP-43 


8086 System Design 
Multitasking for the 8086 
Debugging Strategies and Considerations for 8089 Systems 
Designing 8086, 8088, 8089 Multiprocessing Systems with the 8289 Bus Arbiter 
Using the 8259A Programmable 
Interrupt Controller 
Intel® Multibus™ Interfacing 
Using the iSBC-957™ Execution Vehicle for Executing 8086 Program Code 


APPLICATION 
NOJE 


September 1979 


8086 System Design 


2. 8086 OVERVIEW AND BASIC SYSTEM 
CONCEPTS 


A. Bus Cycle Definition 
B. Address and Data Bus Concepts 
C. System Data Bus Concepts 
D. Multiprocessor 
Environment 


3. 8086 SYSTEM DETAILS 


A. Operating 
Modes 
B. Clock Generation 
C. Reset 
D. Ready Implementation and Timing 
E. Interrupt 
Structure 
F. Interpreting the 8086 Bus Timing Diagrams 
G. Bus Control Transfer 


The 8086 family, Intel's new senes 01 microprocessors 
and system 
components, 
offers 
the designer 
an ad- 
vanced system architecture 
which can be structured 
to 
satisfy 
a broad 
range of applications. 
The variety 
of 
speed, configuration 
and component 
selections 
avail- 
able within the family enables optimization 
of a specific 
design to both cost and performance 
objectives. 
More 
Important 
however, the 8086 family concept allows the 
designer to develop a family of systems providing multi- 
ple levels of enhancement 
within a single design and a 
growth path for future designs. 


This application 
note is directed toward the Implemen- 
tation 
of the system 
hardware and will 
provide an in- 
troduction 
to a representative 
sample of the systems 
configurable 
with the 8086 CPU member of the family. 
Application 
techniques 
and timing analysis will be given 
to aid the designer in understanding 
the system require- 
ments, 
advantages 
and 
limitations. 
Additional 
Intel 
publications 
the reader may wish to reference are the 
8086 User's 
Manual 
(9800722A), 8086 Assembly 
Lan- 


SPECIFICATION 
(9800683), AP-45 Using the 8202 Dy- 


namic 
RAM 
Controller 
(9800809A), AP-51 
Designing 
8086, 8088, 8089 MUltiprocessor 
Systems with the 8289 
Bus Arbiter and AP-59 Using the 8259A Programmable 
Interrupt 
Controller. 
References 
to other Intel publica- 


tions wili be made throughout 
this note. 


2. 8086 OVERVIEW AND BASIC SYSTEM CONCEPTS 


2A. 8086 Bus Cycle Definition 


The 8086 is a true 16-bit microprocessor 
with 16-bit In- 


ternal and external data paths, one megabyte of memory 
address space (2**20) and a separate 64K byte (2**16) 
I/O address space. The CPU communicates 
with its ex- 


ternal environment 
via a twenty-bit 
time multiplexed 
ad- 
dress, 
status 
and data bus and a command 
bus. To 
transfer data or fetch instructions, 
the CPU executes a 
bus cycle (Fig. 2A1).The minimum bus cycle consists 
of 


four CPU clock cycles called T states. During the first T 
state (T1), the CPU asserts an address on the twenty-bit 
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multiplexed address/data/status bus. For the second T 
state (T2), the CPU removes the address from the bus 
and either three-states its outputs on the lower sixteen 
bus lines in preparation for a read cycle or asserts write 
data. Data bus transceivers are enabled in either T1 or 
T2 depending on the 8086 system configuration and the 
direction of the transfer (into or out of the CPU). Read, 
write or interrupt acknowledge commands are always 
enabled in T2. The maximum mode 8086 configuration 
(to be discussed later) also provides a write command 
enabled in T3 to guarantee data setup time prior to com- 
mand activation. 


During T2, the upper four multiplexed bus lines switch 
from 
address 
(A19·A16) 
to 
bus 
cycle 
status 


(56,55,54,53). The status 
information 
(Table 2A1) is 


available primarily for diagnostic monitoring. However, 
a decode of 53 and 54 could be used to select one of 
four banks of memory, one assigned to each segment 
register. This technique allows partitioning the memory 
by segment to expand the memory addressing beyond 
one megabyte. It also provides a degree of protection by 
preventing erroneous write operations to one segment 
from overlapping into another segment and destroying 
information in that segment. 


The CPU continues to provide status information on the 
upper four bus lines during T3 and will either continue 
to assert write data or sample read data on the lower six- 
teen bus lines. If the selected memory or I/O device is 
not capable of transferring data at the maximum CPU 
transfer 
rate, the device must signal the CPU "not 


ready" and force the CPU to insert additional 
clock 


cycles (Wait states TW) after T3. The 'not ready' indica- 
tion must be presented to the CPU by the start of T3. 
Bus activity during TW is the same as T3. When the 
selected device has had sufficient time to complete the 
transfer, it asserts "Ready" and allows the CPU to con- 
tinue from the TW states. The CPU will latch the data on 
the bus during the last wait state or during T3 if no wait 
states are requested. The bus cycle is terminated in T4 
(command lines are disabled and the selected external 
device deselects from the bus). The bus cycle appears 
to devices in the system as an asynchronous event con- 
sisting of an address to select the device followed by a 
read strobe or data and a write strobe. The selected 
device accepts bus data during a write cycle and drives 
the desired data onto the bus during a read cycle. On ter- 
mination of the command, the device latches write data 
or disables its bus drivers. The only control the device 
has on the bus cycle is the insertion of wait cycles. 


The 8086 CPU only executes a bus cycle when instruc- 
tions 
or operands 
must 
be transferred 
to 
or from 


memory or I/O devices. When not executing a bus cycle, 
the bus interface executes idle cycles (TI). During the 
idle cycles, the CPU continues to drive status informa- 
tion from the previous bus cycle on the upper address 
lines. If the previous bus cycle was a write, the CPU con· 
tinues to drive the write data onto the multiplexed bus 
until the start of the next bus cycle. If the CPU executes 
idle cycles following a read cycle, the CPU will not drive 
the lower 16 bus lines until 
the next bus cycle 
is 


required. 


5ince the CPU prefetches up to six bytes of the instruc- 
tion stream for storage and execution from an internal 
instruction 
queue, the relationship of instruction 
fetch 


and associated operand transfers may be skewed in 
time and separated by additional instruction 
fetch bus 


cycles. In general, if an instruction 
is fetched into the 


8086's internal instruction 
queue, several additional in- 


structions 
may be fetched 
before the instruction 
is 


removed from the queue and executed. If the instruction 
being executed from the queue is a jump or other con· 
trol transfer instruction, 
any instructions 
remaining in 


the queue are not executed and are discarded with no ef- 
fect on the CPU's operation. The bus activity observed 
during execution of a specific instruction 
is dependent 


on the 
preceding 
instructions 
but 
is always deter· 


ministic within the specific sequence. 


53 
54 
o 
0 
Alternate (relative to the E5 segment) 


1 
0 
5tack (relative to the 55 segment) 


o 
1 
Code/None (relative to the C5 seg- 


ment or a default of zero) 


Data (relative to the D5 segment) 


55 = IF (interrupt enable flag) 
56 = 0 (indicates the 8086 is on the bus) 


2B. 8086 Address and Data Bus Concepts 


5ince the majority of system memories and peripherals 
require a stable address for the duration of the bus 
cycle, the address on the multiplexed address/data bus 
during T1 should be latched and the latched address 
used to select the desired peripheral or memory loca- 
tion. 5ince the 8086 has a 16-bit data bus, the multi- 
plexed bus components of the 8085 family are not ap· 
plicable to the 8086 (a device on address/data bus lines 
8-15 will not be able to receive the byte selection ad· 
dress on lines 0-7).To demultiplex the bus (Fig. 281a), 
the 8086 system 
provides an Address Latch Enable 


signal (ALE) to capture the address in either the 8282 or 
8283 8-bit bi-stable latches (Diag. 2B1). The latches are 
either inverting (8283) or non-inverting (8282) and have 
outputs driven by three-state buffers that supply 32 mA 
drive capability and can switch a 300 pF capacitive load 
in 22 ns (inverting) or 30 ns (non-inverting). They prop- 
agate the address through to the outputs while ALE is 
high and latch the address on the falling edge of ALE. 
This 
only 
delays 
address 
access 
and chip 
select 


decoding by the propagation delay of the latch. The out· 
puts are enabled through the low active OE input. The 
demultiplexing 
of the 
multiplexed 
address/data 
bus 
(Iatchings of the address from the multiplexed bus), can 
be done locally at appropriate points in the system or at 
the CPU with a separate address bus distributing the ad· 
dress throughout the system (Fig. 2B1b). For optimum 
system performance and compatibility 
with multiproc· 


essor and MULTIBU5™ configurations, 
the latter tech· 


nique is strongly recommended over the first. The re- 
mainder of this note will assume the bus is demul- 
tiplexed at the CPU. 
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The programmer views the 8086 memory address space 
as a sequence of one million bytes in which any byte 
may contain an eight bit data element and any two con- 
secutive bytes may contain a 16-bit data element. There 
is no constraint on byte or word addresses (boundaries). 
The address space is physically implemented on a six- 
teen bit data bus by dividing the address space into two 
banks of up to 512K bytes (Fig. 2B2). One bank is con- 
nected to the lower half of the sixteen-bit data bus (D7-0) 
and contains even addressed bytes (AO= 0). The other 
bank is connected to the upper half of the data bus 
(D15-8) and contains odd addressed bytes (AO= 1). A 
specific byte within each bank is selected by address 
lines A19-A1. To perform byte transfers to even ad- 
dresses (Fig. 2B3a), the information is transferred over 
the lower half of the data bus (D7-0).AO(active low) is 
used to enable the bank connected to the lower half of 
the data bus to participate 
in the transfer. Another 


signal provided by the 8086, Bus High Enable (BHE), is 
used to disable the bank on the upper half of the data 
bus from participating in the transfer. This is necessary 
to prevent a write operation to the lower bank from 
destroying 
data in the upper bank. Since BHE is a 


multiplexed signal with timing identical to the A19-A16 
address lines, it also should be latched with ALE to pro- 
vide a stable signal during the bus cycle. During T2 
through T4, the BHE output is multiplexed with status 
line S7 which is equal to BHE.To perform byte transfers 
to odd addresses (Fig. 2B3b), the information is trans- 
ferred over the upper half of the data bus (D15-D8)while 
BHE (active low) enables the 
upper bank and AO 


disables the lower bank. Directing the data transfer to 
the appropriate half of the data bus and activation of 
BHE and AOis performed by the 8086,transparent to the 
programmer. As an example, consider loading a byte of 
data into the CL register (lower half of the CX register) 
from an odd addressed memory location (referenced 
over the upper half of the 16-bit data bus). The data is 
transferred into the 8086 over the upper 8 bits of the 
data bus, automatically redirected to the lower half of 
the 8086 internal 16-bit data path and stored into the CL 
register. This capability also allows byte I/O transfers 
with the AL register to be directed to I/O devices con- 
nected to either the upper or lower half of the 16-bit data 
bus. 


To access even addressed sixteen bit words (two con- 
secutive bytes with the least significant byte at an even 


byte address), A19-A1 select the appropriate 
byte within 
each bank and AO and BHE (active low) enable both 
banks simultaneously 
(Fig. 2B3c). To access an odd ad· 
dressed 
16-bit word (Fig. 2B3d), the least significant 
byte (addressed by A19-A1) is first transferred 
over the 
upper half of the bus (odd addressed byte, upper bank, 
BHE low active and AO= 1). The most significant 
byte is 
accessed 
by incrementing 
the address (A19-AO) which 
allows 
A19-A1 to address the next physical word loca· 
tion (remember, AO was equal to one which indicated a 
word referenced 
from an odd byte boundary). A second 
bus cycle is then executed to perform the transfer of the 
most significant 
byte with the lower bank (AOis now ac- 
tive low and BHE is high). The sequence is automatically 
executed 
by the 8086 whenever a word transfer 
is ex- 
ecuted to an odd address. Directing the upper and lower 
bytes of the 8086's internal sixteen-bit 
registers 
to the 
appropriate 
halves of the data bus is also performed 
automatically 
by the 8086 and is transparent 
to the pro- 
grammer. 
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During a byte read, the CPU floats the entire sixteen-bit 
data bus even though data is only expected on the upper 
or lower half of the data bus. As will be demonstrated 
later, this action simplifies 
the chip select decoding 
re- 


quirements 
for read only devices (ROM, EPROM). During 
a byte write 
operation, 
the 8086 will 
drive the entire 
sixteen-bit 
data bus. The information 
on the half of the 
data bus not transferring 
data is indeterminate. 
These 
concepts 
also apply to the I/O address space. Specific 
examples of I/O and memory interfacing 
are considered 
in the corresponding 
sections. 


2C. System Data Bus Concepts 


When referring 
to the system data bus, two implemen- 
tation 
alternatives 
must 
be considered; 
(a) the multi· 
plexed address/data 
bus (Fig. 2C1a) and a data bus buf- 
fered 
from 
the multiplexed 
bus by transceivers 
(Fig. 


2C1b). 


If memory or I/O devices are connected 
directly 
to the 
multiplexed 
bus, 
the 
designer 
must 
guarantee 
the 
devices do not corrupt the address on the bus during T1. 


· 
- 
-- 
The 8086 timing guarantees that read is not viilid .until 
after the address is latched by ALE (Oiag. 2C1).All Intel 
peripherals, EPROMproducts and RAM's for microproc- 
essors provide output enable or read inputs to allow 
connection to the multiplexed bus. 


Several techniques are available for interfacing devices 
without output enables to the multiplexed bus but each 
introduces ~ther restrictions 
or limitations. 
Consider 


Figure 2C3 which has chip select gated with read and 
write. Two problems exist with this technique. First, the 
chip select access time is reduced to the read access 
time, 
and may require a faster device if maximum 


system performance (no wait states) is to be achieved 
(Oiag. 2C2). Second, the designer must verify that chip 
select to write setup and hold times for the device are 
not violated (Olag.2C3).Alterniite techniques can be ex- 
tracted from the bus interfacing techniques given later 
in this section but are subject to the associated restric- 
tions. 
II'] general, the best solution 
is obtained with 


devices having output enables. 


A subsequent limitation on the multiplexed bus is the 
8086's drive capability of 2.0 mA and capacitive loading 
of 100 pF to guarantee the specified A.C. character- 
istics. 
Assuming 
capacitive 
loads of 20 pF per I/O 


device, 12 pF per address latch and 5-12 pF per memory 
device, a system mix of three peripherals and two to 
four memory devices (per bus line) are close to the 
loading limit. 
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To satisfy the capacitive loading and drive requirements 
of larger systems, 
the data bus must be buffered. 
The 


9286 non-inverting 
and 8287 inverting octal transceivers 


are offered as part of the 8086 family to satisfy this re- 
quirement. 
They have three-state 
output 
buffers 
that 


drive 32 mA on the bus interface and 10 mA on the CPU 
interface 
and can switch capacitive 
loads of 300 pF at 


the bus interface and 100 pF on the CPU interface 
in 22 


ns (8287) or 30 ns (8286).To enable and control the direc- 
tion of the transceivers, 
the 8086 system provides Data 


ENable (DEN) and Data Transmit/Receive 
(DT/Rj signals 


(Fig. 2C1b). These signals 
provide the appropriate 
tim- 


ing to guarantee 
isolation 
of the multiplexed 
bus from 


the system during T1 and elimination 
of bus contention 


with the CPU during read and write (Diag. 2C4). Although 
the memory and peripheral devices are isolated from the 
CPU (Fig. 2C4), bus contention 
may still 
exist 
in the 


system if the devices do not have an output enable con- 
trol other than chip select. As an example, bus conten- 
tion will exist during transition 
from one chip select to 


another (the newly selected 
device begins driving 
the 


bus before the previous device has disabled its drivers). 
Another, 
more severe case exists during a write cycle. 


From chip select to write active, a device whose outputs 
are controlled 
only 
by chip 
select, 
will 
drive the bus 


simultaneously 
with write data being driven through the 


transceivers 
by the CPU (Diag. 2C5). The same tech- 


nique given for circumventing 
these problems 
on the 


multiplexed 
bus can be applied here with the same limi- 


tations. 


One last extension 
to the bus implementation 
is a sec- 


ond level of buffering 
to reduce the total load seen by 


devices on the system 
bus (Fig. 2C5). This is typically 


done for multiboard 
systems 
and isolation 
of memory 


arrays. The concerns with this configuration 
are the ad- 


ditional 
delay for access and more important, 
control of 


the second transceiver in relationship 
to the system bus 


and the device 
being 
interfaced 
to the system 
bus. 


Several techniques 
for controlling 
the transceiver 
are 


given in Figure 
2C6. This 
first 
technique 
(Fig. 2C6a) 


simply 
distributes 
DEN 
and 
DT/R 
throughout 
the 


system. DT/R is inverted to provide proper direction 
con- 


trol for the second level transceivers. 
The second exam- 


ple (Fig. 2C6b) provides control 
for devices with output 


enables. 
RD is used to normally 
direct 
data from the 


system 
bus to the peripheral. 
The buffer 
is selected 


whenever a device on the local bus is ctlip selected. Bus 
contention 
is possible on the device's local bus during a 


read as the read simultaneously 
enables the device out- 


put and changes the transceiver 
direction. 
The conten- 


tion may also occur as the read is terminated. 


For devices without output enables, the same technique 
can be applied (Fig. 2C6c) if the chip select to the device 
is conditioned 
by read or write. 
Controlling 
the chip 


select with read/write 
prevents the device from driving 


against 
the transceiver 
prior 
to the command 
being 


received. The limitations 
with this technique 
are access 


limited to read/write time and limited CS to write setup 
and hold times. 
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An alternate technique applicable to devices with and 
without 
output enables is shown in Figure 2C6d. RD 


again controls the direction of the transceiver but It is 
not enabled until a command and chip select are active. 
The possibility 
for bus contention 
still 
exists 
but is 


reduced to variations 
in output 
enable vs. direction 


change time for the transceiver. Full access time from 
chip select is now available, but data will not be valid 
prior to write and will only be held valid after write by the 
delay to disable the transceiver. 
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Figure 2C6d. 
Bullering 
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One last technique is given for devices with separate in- 
puts and outputs (Fig. 2C6e).Separate bus receivers and 
drivers are provided rather than a single transceiver. The 
receiver is always enabled while the bus driver is con· 
trolled by RD and chip select. The only possibility 
for 


bus 
contention 
In this 
system 
occurs 
as multiple 


devices on each line of the local read bus are enabled 
and disabled during chip selection changes. 


Throughout this note, the multiplexed 
bus will be con· 


sidered the local CPU bus and the demultiplexed 
ad· 


dress and buffered data bus will be the system bus. For 
additional 
information 
on 
bus 
contention 
and 
the 


system problems associated with it, refer to Appendix 1. 
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20. 
Multiprocessor 
Environment 


The 8086 architecture supports multiprocessor systems 
based on the concept of a shared system bus (Fig. 2D1). 
All CPU's in the system communicate with each other 
and share resources via the system bus. The bus may be 
either the Intel Multibus™ system bus or an extension 
of the system bus defined in the previous section. The 
major addition 
required to the demultiplexed 
system 


bus is arbitration logic to control access to the system 
bus. As each CPU asynchronously requests access to 
the shared bus, the arbitration logic resolves priorities 
and grants bus access to the highest priority CPU. Hav- 
ing gained access to the bus, the CPU completes its. 
transfer and will either relinquish the bus or wait to be 
forced 
to 
relinquish 
the bus. For a discussion 
on 


Multibus™ arbitration techniques, refer to AP-28A, Intel 
Multibus ™ Interfacing. 


To support 
a multimaster 
interface 
to the Multibus 


system bus for the 8086 family, the 8289 bus arbiter is 
included as part of the family. The 8289 Is compatible 
with the 8086's local bus and in conjunction 
with the 


8288 bus controller, 
implements the Multibus protocol 


for bus arbitration. The 8289 provides a variety of arbitra· 
tion and prioritization 
techniques to allow optimization 


of bus availability, throughput and utilization of shared 
resources. Additional 
features (implemented 
through 


AP-fllnt support 01 a local l,;t'U DUS lor private resources. 


For specific configurations 
and additional 
Information 


on the 8289, refer to application note AP-51. 


3. 8086 
SYSTEM 
DETAILS 


3A. Operating 
Modes 


Possibly the most unique feature of the 8086 Is the abili- 
ty to select the base machine configuration most suited 
to the application. 
The MN/MX Input to the 8086 Is a 


strapping option which allows the designer to select 
between two functional 
definitions 
of a subset of the 


8086 outputs. 


MINIM.UM MODE 


The minimum 
mode 8086 (Fig. 3A1) is optimized 
for 


small 
to 
medium 
(one or two 
boards), single 
CPU 


systems. Its system architecture 
Is directed at satisfy- 


ing the requirements of the lower to middle segment of 
high performance 
16·bit applications. 
The CPU main- 


tains the full megabyte memory space, 64K byte I/O 
space and 16-bit data path. The CPU directly provides all 
bus 
control 
(DT/R, 
DEN, 
ALE, 
M/io), 
commands 


(RD,WR,INTA) and a simple 
CPU preemption 
mech- 


anism (HOLD, HLDA) compatible 
with existing 
DMA 


controllers. 


The maximum mode (Fig. 3A2) extends the system ar- 
chitecture 
to support 
multiprocessor 
configurations, 


and local 
instruction 
set extension 
processors 
(co- 


processors). Through addition of the 8288 bipolar bus 
controller, the 8086 outputs assigned to bus control and 
commands in the minimum mode are redefined to allow 
these extensions and enhance general system perform- 
ance. Specifically, (1)two prioritized levels of processor 
preemption 
(RQ/GTO, 
RQ/GTI) 
allow 
multiple 
proc- 


essors to reside on the 8086's local bus and share its in- 
terface to the system bus, (2)Oueue status (050,051) is 
available 
to allow 
external 
devices 
like ICE™-86 or 


special instruction set extension co-processors to track 
the CPU instruction 
execution, 
(3) access control 
to 


shared resources in multiprocessor 
systems 
is sup- 


ported 
by a hardware bus lock mechanism 
and (4) 


system 
command and configuration 
options 
are ex- 


panded via ancillary devices like the 8288 bus controller 
and 8289 bus arbiter. 


The queue status indicates what information 
is being 


removed from the internal queue and when the queue Is 
being reset due to a transfer of control (Table 3A1). By 
monitoring 
the SO,51,52 
status lines for instructions 


entering the 8086 (1,0,0 indicates code access while AO 
and BHE indicate word or byte) and 050, 051 for in- 
structions 
leaving the 8086's Internal queue, it is possi- 


ble to track the instruction 
execution. Since instruc- 


tions are executed from the 8086's internal queue, the 
queue status is presented each CPU clock cycle and is 
not related to the bus cycle activity. This mechanism (1) 
allows 
a 
co-processor 
to 
detect 
execution 
of 
an 


ESCAPE instruction 
which directs the co-processor to 


perform a specific task and (2) allows ICE-B6to trap ex- 
ecution of a specific memory location. An example of a 
circuit used by ICE is given in Figure 3A3. The first up 
down counter tracks the depth of the queue while the 
second captures the queue depth on a match. The sec- 
ond counter decrements on further fetches from the 
queue until the queue is flushed or the count goes to 
zero indicating 
execution 
of the match address. The 


first 
counter 
decrements 
on fetch 
from 
the queue 


(OSO= 1) and increments 
on code fetches 
into 
the 


queue. Note that a normal code fetch will transfer two 
bytes into the queue so two clock increments are given 
to the counter (T201 and T301) unless a single byte is 
loaded over the upper half of the bus (AO-P Is high). 
Since the execution unit (EU)is not synchronized to the 
bus interface unit (BIU), a fetch from the queue can oc- 
cur simultaneously 
with a transfer into the queue. The 


exclusive-or 
gate driving 
the ENP input of the first 


counter allows these simultaneous operations to cancel 
each other and not modify the queue depth. 
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The queue status Is valid during the CLK cycle after 
which the queue operation is performed. 


To address the problem of controlling access to shared 
resources, the maximum mode 8086 provides a hard- 
ware 
LOCK output. 
The LOCK output 
is activated 


through 
the instruction 
stream by execution 
of the 


LOCK prefix instruction. 
The LOCK output goes active 


in the first CPU clock cycle following execution of the 
prefix and remains active until the clock following the 
completion of the instruction following the LOCK prefix. 
To 
provide 
bus 
access 
control 
in 
multiprocessor 


systems, the LOCK signal should be incorporated into 
the system bus arbitration logic resident to the CPU. 


During normal multiprocessor 
system operation, 
pri- 
ority of the shared system bus is determined by the ar- 
bitration 
circuitry 
on a cycle by cycle basis. As each 


CPU requires a transfer over the system bus, it requests 
access to the bus via its resident bus arbitration logic. 
When the CPU gains priority (determined by the system 
bus arbitration 
scheme and any associated 
logic), it 


takes control of the bus, performs its bus cycle and 
either maintains 
bus control, voluntarily 
releases the 


bus or is forced off the bus by the loss of priority. The 
lock mechanism prevents the CPU from losing bus con- 
trol (either voluntarily or by force) and guarantees a CPU 
the ability to execute multiple bus cycles (during execu- 


tion of the locked instruction) without intervention and 
possible 
corruption 
of the data by another 
CPU. A 


classic 
use of the mechanism is the 'TEST and SET 


semaphore' 
during 
which 
a CPU must read from a 


shared memory location and return data to the location 
without 
allowing another CPU to reference the same 


location between the TEST operation (read)and the SET 
operation (write). In the 8086 this is accomplished with a 
locked exchange instruction. 


LOCK XCHG 
reg, MEMORY; 
reg is any register 
;MEMORY 
is the address 
of the 


;semaphore 


The activity of the LOCK output is shown in Diagram 
3A1. Another interesting use of the LOCK for multiproc- 
essor systems Is a locked block move which allows high 
speed message transfer from one CPU's message buf- 
fer to another. 


During the locked instruction, 
a request for processor 


preemption (RQ/GT) is recorded but not acknowledged 
until completion 
of the locked instruction. 
The LOCK 


has no direct affect 
on interrupts. 
As an exam£1e, a 


locked HALT instruction will cause HOLD (or RQ/GT)re- 
quests to be ignored but will allow the CPU to exit the 
HALT state on an interrupt. In genera:, prefix bytes are 
considered extensions of the instructions they precede. 
Therefore, interrupts that occur during execution of a 
prefix are not acknowledged (assuming interrupts 
are 


enabled) until completion 
of the instruction 
following 


the prefixes (except for instructions which allow servic- 
ing interrupts during their execution, Le., HALT, WAIT 
and repeated string primitives). Note that multiple prefix 
bytes may precede an instruction. As another example, 
consider a 'string primitive' preceded by the repetition 
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other repeated string operation. 
As long as the opera· 
tion is not interrupted, 
LOCK remains active. Further in· 
formation 
on the 
operation 
of an interrupted 
string 
operation with multiple 
prefixes is presented in the sec- 
tion dealing with the 8086 interrupt 
structure. 


Three additional 
status lines (SO, 51, 52) are defined to 
provide communications 
with the 8288 and 8289. The 
status 
lines tell the 8288 when to Initiate a bus cycle, 
what type of command to issue and when to terminate 
the bus cycle. The 8288 samples the status lines at the 
beginning of each CPU clock (CLK). To initiate a bus cy· 
cle, the CPU drives the status 
lines from the passive 
state (SO, 51, 52= 1) to one of seven possible command 
codes (Table 3A2). This occurs on the rising edge of the 
clock during T4 of the previous bus cycle or a TI (idle cy- 
cle, no current bus activity). The 8288 detects the status 
change by sampling the status lines on the high to low 
transition 
of each clock cycle. The 8288 starts a bus cy· 
cle by generating 
ALE and appropriate 
buffer direction 
control 
In the clock cycle immediately 
following 
detec· 
tion of the status change (T1). The bus transceivers 
and 
the selected 
command 
are enabled 
in the next clock 
cycle (T2) (or T3 for normal write commands). When the 
status 
returns 
to the passive state, the 8288 will 
ter· 
mlnate the command as shown in Diagram 3A2. 5ince 
the CPU will not return the status to the passive state 
until 
the 'ready' 
indication 
is received, 
the 8288 will 
maintain 
active 
command 
and 
bus 
control 
for 
any 
number of wait cycles. The status 
lines may also be 
used by other 
processors 
on the 8086's local bus to 
monitor 
bus activity 
and control 
the 8288 if they gain 
control 
of the local bus. 
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The 8288 provides the bus control (DEN, DT/A, ALE) and 
commands 
(INTA, MRDC, 10RC, MWTC, AMWC, 10WC, 


AIOWC) removed from the CPU. The command structure 
has separate read and write commands 
for memory and 
I/O to provide compatibility 
with the Multlbus 
command 
structure. 


The advanced write commands 
are enabled one clock 
period earlier than the normal write to accommodate 
the 
wider write pulse widths 
often required by peripherals 
and static RAMs. The normal write provides data setup 
prior to write to accommodate 
dynamic RAM memories 
and 110devices which strobe data on the leading edge of 
write. The advanced write commands 
do not guarantee 
that data Is valid prior to the leading edge of the com- 
mand. The DEN signal in the maximum mode Is Inverted 
from the minimum 
mode to extend transceiver 
control 
by allowing 
logical 
conjunction 
of 
DEN with 
other 
signals. While not appearing to be a significant 
benefit 
in the basic maximum mode configuration, 
Introduction 
of interrupt 
control 
and various system configurations 
will 
demonstrate 
the 
usefulness 
of 
qualifying 
DEN. 


Diagram 3A3 compares the timing of the minimum 
and 
maximum 
mode bus transfer commands. 
Although 
the 
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processor 
enVironments, 
large 
single 
Vl"U 
oeslgns 


(either Multibus 
systems or greater than two PC boards) 


should also use the maximum mode. Since the 8288 is a 
bipolar dedicated 
controller 
device, its output drive for 


the commands (32 mAl and tolerances 
on AC character- 


istics 
(timing 
parameters 
and worse case delays) pro- 


vide better large system performance than the minimum 
mode 8086. 


vl"u, 
me 00<:00 provloes aOal1lanal snapping 
opllon5 ana 


controls 
to support 
multiprocessor 
configurations 
and 


peripheral 
devices on the CPU local bus. These capa- 


bilities 
allow 
assigning 
resources 
(memory 
or 110) as 


shared (available on the Multlbus 
system bUs) or private 


(accessible 
only by this CPU) to reduce contention 
for 


access to the Multibus 
system bus and improve multi- 


CPU system performance. 
Specific configuration 
possi- 


bilities 
are discussed 
in AP-51. 
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38. Clock Generation 


The 8086 requires a clock signal with fast rise and fall 
times 
(10 ns max) between 
low and high voltages 
of 


- 0.5 to + 0.6 low and 3.9 to VCC + 1.0 high. The max- 
imum clock frequency 
of the 8086 is 5 MHz and 8 MHz 


for the 8086-2. Since the design of the 8086 incorporates 
dynamic 
cells, 
a minimum 
frequency 
of 2 MHz is re- 


quired to retain the state of the machine. 
Due to the 


minimum 
frequency 
requirement, 
single 
stepping 
or 
cycling 
of the CPU may not be accomplished 
by dis- 


abling the clock. The timing and voltage requirements 
of 


the CPU clock are shown in Figure 3S1. In general, for 
frequencies 
below the maximum, 
the CPU clock need 


not satisfy 
the frequency 
dependent 
pulse width 
limi- 


tations 
stated 
in the 
8086 data 
sheet. 
The 
values 


specified 
only reflect the minimum 
values which must 


be satisfied 
and are stated 
in terms of the maximum 


clock frequency. As the clock frequency approaches the 
maximum 
frequency 
of the CPU, the clock 
must con- 
form to a 33% duty cycle to satisfy the CPU minimum 
clock low and high time specifications. 


An optimum 
33% duty cycle 
clock 
with 
the required 


voltage levels and transition 
times can be obtained with 


the 8284 clock generator (Fig. 3S2). Either an external 
frequency source or a series resonant crystal may drive 
the 8284. The selected 
source must oscillate 
at 3X the 


desired CPU frequency. 
To select the crystal 
Inputs of 
the 8284 as the frequency 
source for clock generation, 
the Fie Input to the 8284 must be strapped to ground. 
The strapping 
option allows selecting 
either the crystal 


or the external frequency 
Input as the source for clock 


generation. 
Although 
the 8284 provides an Input for a 


tank circuit 
to accommodate 
overtone 
mode crystals, 
fundamental 
mode crystals are recommended 
for more 


accurate and stable frequency generation. When selec- 
ting a crystal for use with the 8284, the series resistance 
should be as low as possible. 
Since other circuit 
com- 
ponents will tend to shift the operating 
frequency from 


resonance, 
the operating 
Impedance 
will 
typically 
be 


higher than the specified 
series resistance. 
If the at- 
tenuation 
of the oscillator's 
feedback 
circuit 
reduces 


the loop gain to less than one, the oscillator 
will fall. 


Since the oscillator 
delays In the 8284 appear as Induc- 
tive elements 
to the crystal, causing 
It to run at a fre- 


quency 
below 
that 
of 
the 
pure series 
resonance, 
a 


capacitor should be placed In series with the crystal and 
the X2 Input of the 8284. This capacitor serves to cancel 
this Inductive 
element. The value of the capacitor 
(Cl) 


must not cause the impedance of the feedback circuit to 
reduce the loop gain below one. The impedance of the 
capacitor 
15 a function 
of the operating 
frequency 
and 


can be determined 
from the following 
equation: 


XCl = 1/211·PCl 


XTAL 
CJY 
CL 


It is recommended 
that the crystal 
series 
resistance 


plus XCl be kept less than ~K ohms. This capacitor also 
serves to debias the crystal 
and prevent a DC voltage 


bias from straining 
and perhaps damaging 
the crystal- 


line structure. 
As the crystal 
frequency 
increases, 
the 


amount of capacitance 
should be decreased. For exam- 


ple, a 12 MHz crystal may require Cl n, 24 pF while 22 
MHz may require Cl n, 8 pF. If very close correlation 
with 
the 
pure series 
resonance 
is not 
necessary, 
a 


nominal Cl value of 12-15 pF may be used with a 15 MHz 
crystal (5 MHz 8086 operation). Soard layout and compo- 
nent variances will affect 
the actual amount of Induc- 


tance and therefore the series capacitance 
required to 


cancel 
it out (this is especially 
true for wire-wrapped 


layouts). 


Two of the many vendors which supply crystals for Intel 
microprocessors 
are listed In Table 3S1 along with a list 


of crystal 
part numbers for various frequencies 
which 


may be of interest. For additional 
Information 
on speci- 
fying crystals 
for Intel components 
refer to application 
note AP-35. 


f 
Parallell 
Crystek(l) 
CTS Knlght,(2 


Series 
Corp. 
Inc. 


15.0MHz 
S 
CY15A 
MP150 


18.432 
S 
CY19S· 
MP184· 


24.0 MHz 
S 
CY24A 
MP240 


°lntal 
also suppllas 
a crystal 
numbered 
8801 lor this 
application. 


Notaa: 
1. Addrass: 
1000 Crystal 
Onva, Fort Mayars, 
Florida 
33901 


2. Addrass: 
400 Ralmann 
Ava., Sandwich, 
illinois 


II a high accuracy frequency source, externally 
variable 
frequency 
source or a common 
source for driving mul- 


tiple 
8284's is desired, 
the External 
Frequency 
Input 


(EFI) of the 8284 can be selected by strapping the FICin· 
put to 5 volts through n,1K ohms (Fig. 3S3). The external 
frequency 
source 
should 
be TIl 
compatible, 
have a 


50% duty cycle and oscillate 
at three times the desired 


CPU operating 
frequency. The maximum 
EFI frequency 


the 8284 can accept 
is slightly 
above 24 MHz with 


minimum 
clock low and high times of 13 ns. Although 


no minimum EFI frequency is specified, it should not 
violate the CPU minimum clock rate. If a common fre- 
quency 
source 
is 
used 
to 
drive 
multiple 
8284's 
distributed throughout the system, each 8284 should be 
driven by its own line from the source. ,To minimize 
noise in the system, each line should be a twisted pair 
driven by a buffer like the 74lS04 with the ground of the 
twisted pair connecting the grounds of the source and 
receiver. To minimize clock skew, the lines to all 8284's 
should be of equal length. A simple technique for gen- 
erating a master frequency source for additional 8284's 
is shown in Figure 364. One 8284 with a crystal is used 
to generate the desired frequency. The oscillator output 
of the 8284 (OSC) equals the crystal frequency and is 
used to drive the external frequency to all other 8284's 
in the system. 
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The oscillator 
output 
is inverted from the oscillator 
signal used to drive the CPU clock generator circuit. 
Therefore, the oscillator output of one 8284 should not 
drive the EFI input of a second 8284 if both are driving 
clock 
inputs 
of separate CPU's that are to be syn- 


chronized. The variation on EFI to' ClK 
delay over a 
range of 8284's may approach 35 to 45 ns.lf, however, all 
8284's are of the same package type, have the same 
relative supply voltage and operate in the same tem- 
perature environment, the variation will be reduced to 
between 15 and 25 ns. 


There are three frequency outputs from the 8284, the 
oscillator 
(OSC) mentioned 
above, the system 
clock 
(ClK) 
which drives the CPU, and a peripheral clock 


(PClK) that runs at one half the CPU clock frequency. 
The oscillator output is only driven by the crystal and is 
not affected by the Fie strapping option. If a crystal is 
not connected to the 8284 when the external frequency 
input is used, the oscillator output is indeterminate. The 
CPU clock 
is derived 
from 
the 
selected 
frequency 


source 
by an internal 
divide 
by three counter. 
The 


counter generates the 33% duty cycle clock which is op- 
timum 
for 
the 
CPU at 
maximum 
frequency. 
The 
peripheral ciock has a 50% duty cycle and is derived 
from the CPU clock. Diagram 360 shows the relation- 
ship of ClK 
to OSC and PClK to ClK. The maximum 
skew is 20 ns between OSC and ClK, and 22 ns between 
ClK and PClK. 


Since the state of the 8284 divide by three counterls 
in- 


determinate at system initialization (power on), an exter- 
nal sync to the counter (CSYNC) is provided to allow 
synchronization 
of the CPU clock to an external event. 


When CSYNC is brought high, the ClK and PClK out- 
puts are forced high. When'CSYNC returns low, the next 
positive clock from the frequen'cy source starts clock 
generation. CSYNC must be active for a minimum of two 
periods of the frequency source. If CSYNC is asynchro- 
nous to the frequency source, the circuit in Figure 365 
should be used for synchronization. 
The two latches 


minimize the probability 
of a meta-stable state in the 
latch driving CSYNC. The latches are clocked with the 
inverse of the frequency source to guarantee the 8284 
setup and hold time of CSYNC to the frequency source 
(Diag. 361). If a single 8284 is to be synchronized to an 
external event and an external frequency source is not 
used, the oscillator output of the 8284 may be used to 


synchronize 
CSYNC (Fig. 386). Since the oscillator 
out- 


put is inverted from the Internal oscillator 
signal, the In· 


verter in the previous example is not required. If multiple 
8284's are to be synchronized, 
an external 
frequency 


source must drive all 8284's and a single CSYNC syn· 
chronlzation 
circuit 
must drive the CSYNC Input of all 


8284's (Fig. 3B7). Since activation 
of CSYNC may cause 


violation 
of CPU minimum clock low time, It should only 


be enabled during reset or CPU clock high. CSYNC must 
also be disabled a minimum 
of four CPU clocks before 


the end of reset to guarantee proper CPU reset. 
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Due to the fast transitions 
and high drive (5 mAl of the 


8284 ClK output, It may be necessary to put a 10 to 100 
ohm resistor 
In series with the clock 
line to eliminate 


ringing (resistor value depending on the amount of drive 
required). If multiple 
sources 
of ClK 
are needed with 


minimum 
skew, ClK 
can be buffered 
by a high drive 


device (74S241) with outputs tied to 5 volts through 100 
ohms to guarantee VOH = 3.9 min (8086 minimum clock 
input high voltage) (Fig. 388). A single 8284 should not 
be used to generate the ClK for multiple 
CPU's that do 


not share a common 
local (multiplexed) 
bus since the 


8284 synchronizes 
ready to the CPU and can only ac· 


commodate 
ready for a single CPU. If multiple 
CPU's 


share a local bus, they should be driven with the same 
clock to optimize 
transfer 
of bus control. 
Under these 
circumstances, 
only one CPU will be using the bus for a 


particular 
bus cycle which 
allows 
sharing 
a common 


READY signal (Fig. 389). 
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3C. 
Reset 


The 8086 requires a high active reset with minimum 
pulse width of four CPU clocks except after power on 
which requires a 50 I's reset pulse. Since the CPU Inter- 
nally synchronizes reset with the clock, the'reset Is in- 
ternally active for up to one clock period after the exter- 
nal reset. Non-Maskable Interrupts 
(NMI) or hold re- 
quests on RQ/GT which occur during the Internal reset, 
are not acknowledged. A minimum mode hold request 
or maximum mode RQ pulses active immediately after 
the internal reset will be honored before the first 
in- 
struction fetCh. 


From reset, the 8086 will condition the bus as shown in 
Table 3C1. The multiplexed 
bus will three-state upon 
detection 
of reset by the CPU. Other signals which 
three-state will be driven to the inactive state for one 
clock low interval prior to entering three-state (Fig. 3C1). 
In the minimum mode, ALE and HLDA are driven inac- 
tive and are not three-stated. In the maximum mode 
RQ/GT lines are held Inactive and the queue status in: 
dlcates no activity. The queue status will not indicate a 
reset of the queue so any user defined external circuits 
monitoring 
the queue should 
also 
be reset by the' 


system reset. 22K ohm pull-up resistors should be con- 
nected to the CPU command and bus control lines to 


guarantee' the inactive state of these,lines 
in systems 
where leakage currents or bus capacitance may cause 
the voltage levels to settle below the minimum 
high 
voltage of devices in the system. In maximum mode 
systems, 
the 8288 contains 
internal 
pull-ups on the 
SO-S2 inputs to maintain the inactive state for these 
lines when the CPU floats the bus. The high state of the 
status lines during reset causes the 8288 to treat the 
reset sequence as a passive state. The condition of the 
8288 outputs for the passive state are shown in Table 
3C2. If the reset occurs during a bus cycle, the return of 
the status lines to the passive state will terminate the 
bus cycle and return the command lines to the inactive 
state. Note that the 8288 does not three-state the com- 
mand outputs based on the passive state of the status 
lines. If the designer needs to three-state the CPU off 
the bus during reset in a single CPU system, the reset 
signal should also be connected to the 8288's AEN input 
and the 'output enable of the address latches (Fig. 3C2). 
This forces the command and address bus interface to 
three-state while the inactive state of DEN from the 8288 
three-states the transceivers on the data bus. 
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For multiple processor systems using arbitration of a 
multimaster bus, the system reset should be connected 
to the INIT input of the 8289 bus arbiter in addition to 
the 8284 reset input (Fig. 3C3).The low active INIT input 
forces all 8289 outputs to their inactive state. The inac- 
tive state of the 8289 AEN output will force the 8288 to 
three-state 
the 
command 
outputs 
and the 
address 


latches to three-state the address bus interface. DEN in- 
active from the 8288 will three-state the data bus inter- 
face. For the multimaster CPU configuration, the reset 
should be common to all CPU's (8289's and 8284's) and 
satisfy 
the 
maximum 
of 
either 
the 
CPU reset 
reo 


quirements 
or 3 TBLBL (3 8289 bus clock times) + 3 
TCLCL (3 8086 clock cycle times) to satisfy 8289 reset 
requirements. 
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through 2.2K ohm resistors. 


The reset signal to the 8086 can be generated by the 
8284. The 8284 has a schmitt 
trigger input (RES) for 


generating reset from a low active external reset. The 
hysteresis specified in the 8284 data sheet implies that 
at least .25 volts will separate the 0 and 1 switching 
point of the 8284 reset input. Inputs without hysteresis 
will switch from low to high and high to low at approxi- 
mately the same voltage 
threshold. 
The inputs 
are 


guaranteed to switch at specified low and high voltages 
(VIL and VIH) but the actual switching point is anywhere 
in-between. Since VIL min is specified at .8 volts, the 
hysteresis guarantees that the reset will be active until 
the input reaches at least 1.05 volts. A reset will not be 
recognized until the input drops at least .25 volts below 
the reset inputs VIH of 2.6 volts. 


To guarantee reset from power up, the reset input must 
remain below 1.05 volts for 50 microseconds after Vcc 
has reached the minimum supply voltage of 4.5 volts. 
The hysteresis allows the reset input to be driven by a 
simple 
RC circuit 
as 
shown 
in 
Figure 
3C4. The 


calculated RCvalue does not include time for the power 
supply to reach 4.5 volts or the charge accumulated dur- 
ing this interval. Without the hysteresis, the reset out- 
put might oscillate as the input voltage passes through 
the switching voltage of the input. The calculated RC 
value provides the minimum required reset period of 50 
microseconds 
for 8284's that switch at the 1.05 volt 


level and a reset period of approximately 
162 micro- 


seconds for 8284's that switch at the 2.6 volt level. If 
tighter tolerance between the minimum and maximum 
reset times 
is necessary, the reset circuit 
shown in 


Figure 3C5 might be used rather than the simple RC cir· 
cuit. This circuit provides a constant current source and 
a linear charge rate on the capacitor rather than the in- 
verse exponential 
charge rate of the RC circuit. 
The 


maximum reset period for this implementation 
is 124 


microseconds. 
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The 8284 synchronizes 
the reset input with the CPU 


clock to generate the RESET signal to the CPU (Fig. 
3C6). The output is also available as a general reset to 
the entire system. The reset has no effect on any clock 
circuits in the 8284. 
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system to accommodate memory and I/O devices that 
cannot transfer information 
at the maximum CPU bus 


bandwidth. 
Ready 
is 
also 
used 
in 
multiprocessor 


systems to force the CPU to wait for access to the 
system bus or Multibus 
system bus. To insert a wait 


state in the bus cycle, the READY signal to the CPU 
must be inactive (low) by the end of T2. To avoid inser- 
tion of a wait state, READY must be active (high) within 
a specified setup time prior to the positive transition 
during T3. Depending on the size and characteristics 
of 


the system, ready implementation 
may take one of two 


approaches. 


The classical 
ready implementation 
is to 
have the 


system 'normally not ready.' When the selected device 
receives the command (RDIWRlINTA)and has had suffi- 
cient 
time 
to 
complete 
the 
command, 
it 
activates 


READY to the CPU, allowing the CPU to terminate the 
bus cycle. This implementation is characteristic of large 
mUltiprocessor, 
Multibus 
systems or systems 
where 


propagation delays, bus access delays and device char- 
acteristics 
inherently slow down the system. For max- 
imum system performance, devices that can run with no 
wait states must return 'READY' within the previously 
described 
limit. 
Failure to respond in time will only 


result in the insertion of one or more wait cycles. 


An alternate technique is to have the system 'normally 
ready.' All devices are assumed to operate at the max- 
imum CPU bus bandwidth. Devices that do not meet the 
requirement must disable READY by the end of T2 to 
guarantee the insertion of wait cycles. This implementa- 
tion is typically 
applied to small single CPU systems 


and reduces the logic required to control 
the ready 


signal. Since the failure of a device requiring wait states 
to disable READY by the end of T2 will result in prema- 
ture termination 
of the bus cycle, the system timing 


must be carefully analyzed when using this approach. 


The 8086 has two different 
timing 
requirements 
on 


READY depending on the system implementation. For a 
'normally 
ready' system 
to 
insert 
a wait 
state, 
the 


READY must be disabled within 8 ns (TRYLCL)after the 
end of T2 (start of T3) (Diag. 301). To guarantee proper 
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operation of the 8086, the READY input must not change 
from ready to not ready during the clock low time of T3. 
For a 'normally 
not ready' system to avoid wait states, 
READY must be active within 
119 ns (TRYHCH) of the 


positive clock transition 
during T3 (Diag. 3D2). For both 
cases, 
READY 
must 
satisfy 
a hold 
time 
of 
30 ns 
(TCHRYX) from the T3 or TW positive clock transition. 


To generate a stable READY signal which satisfies 
the 
previous 
setup and hold times, the 8284 provides two 
separate system ready inputs (RDY1, RDY2) and a single 
synchronized 
ready output 
(READY) for the CPU. The 
RDY inputs are qualified 
with separate access enables 
(AEN1,AEN2, low active) to allow selecting 
one of the 
two 
ready signals 
(Fig. 3D1). The gated 
signals 
are 


logically 
OR'ed and sampled at the beginning 
of each 
ClK 
cycle to generate READY to the CPU (Diag. 3D3). 


The sampled READY signal Is valid within 8 ns (TRYlCl) 
after ClK 
to satisfy 
the CPU timing 
requirements 
on 


'not ready' and ready. Since READY cannot change until 
the next ClK, the hold time requirements 
are also satis- 


fied. The system ready inputs to the 8284 (RDY1,RDY2) 
must be valid 35 ns (TRIVCl) before T3 and AEN must be 
valid 60 ns before T3. For a system using only one RDY 
input, the associated 
AEN is tied to ground while the 
other AEN is connected 
to 5 volts through 
"-1K ohms 
(Fig. 3D2a). If the system generates a low active ready 
signal, it can be connected 
to the 8284 AEN input if the 
additional 
setup time required by the 8284 AEN Input is 
satisfied. 
In this case, the associated 
RDY Input would 
be tied high (Fig. 3D2b). 
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The majority of memory and peripheral devices which 
fail to operate at the maximum CPU frequency typically 
do not require more than one wait state. The circuit 
given in Figure 303 is an example of a simple wait state 
generator. The system ready line is driven low whenever 
a device requiring one wait state is selected. The flip 
flop is cleared by ALE, enabling ROY to the 8284. If no 
wait states are required, the flip flop does not change. If 
the system ready is driven low, the flip flop toggles on 
the low to high clock transition of T2 to force one wait 
state. The next low to high clock transition toggles the 
flip flop again to indicate ready and allow completion of 
the bus cycle. Further changes in the state of the flip 
flop will not affect the bus cycle. The circuit 
allows 
approximately 100 ns for chip select decode and condi- 
tioning of the system ready (Olag. 304). 


If the system is 'normally not ready,' the programmer 
should not assign executable code to the last six bytes 
of physical memory. Since the 8086 prefetches instruc- 
tions, 
the CPU may attempt 
to access non-existent 
memory when executing 
code at the end of physical 


memory. If the access to non-existent memory fails to 
enable READY, the system will 
be caught 
In an In- 


definite walt. 


3E. Interrupt Structure 


The 8086 interrupt structure is based on a table of inter- 
rupt vectors stored in memory locations 
OH through 
003FFH. Each vector consists of two bytes for the in- 
struction 
pointer and two bytes for the code segment. 


These two values combine to form the address of the In· 
terrupt service routine. This allows the table to contain 
up to 256 interrupt vectors which specify the starting ad- 
dress of the service routines anywhere in the one mega- 
byte address space of the 8086. If fewer than 256 differ- 
ent interrupts are defined in the system, the user need 
only allocate enough memory for the interrupt vector 
table to provide the vectors for the defined interrupts. 
During initial system debug, however, it may be desir- 
able to assign all undefined interrupt types to a trap 
routi ne to detect erroneous interrupts. 


Each vector is associated with an interrupt type number 
which points to the vector's location in the interrupt vec- 
tor table. The interrupt type number multiplied 
by four 
gives the displacement of the first byte of the associ- 
ated interrupt vector from the beginning of the table. As 
an example, Interrupt type number 5 points to the sixth 
entry in the interrupt vector table. The contents of this 
entry in the table points to the interrupt service routine 
for type 5 (Fig. 3E1). This structure allows the user to 
specify the memory address of each service routine by 
placing the address (instruction 
pointer and code seg- 


ment values) in the table location provided for that type 
interrupt. 
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Figure 
3E1. 
Direction 
to Interrupt 
Service 
Routrne 
through 
the 
Interrupt 
Vector 
Table 


All interrupts in the 8086 must be assigned an interrupt 
type which uniquely identifies each interrupt. There are 
three classes of interrupt types in the 8086; predefined 
interrupt types which are issued by specific functions 
within the 8086 and user defined hardware and software 
interrupts. 
Note that any interrupt 
type Including the 


predefined interrupts can be issued by the user's hard· 
ware and/or software. 


PREDEFINED INTERRUPTS 


The predefined interrupt 
types in the 8086 are listed 


below with a brief description 
of h,oweach is invoked. 
When Invoked, the CPU will 
transfer 
control 
to the 


memory location 
specified 
by the vector associated 


with the specific type. The user must provide the inter· 
rupt service routine and initialize 
the Interrupt vector 


table with the appropriate s'ervice routine address. The 
user may additionally 
Invoke these interrupts through 


hardware or software. If the preassigned function is not 
used in the system, the user may assign some other 
function 
to the associated 
type. However, for com· 
patibility with future Intel hardware and software prod- 
ucts for the 8086 family, interrupt types 0·31 should not 
be assigned as user defined interrupts. 


This interrupt type is invoked whenever a division opera· 
tion Is attempted during which the quotient exceeds the 
maximum value (ex. division by zero). The interrupt is 
non-maskable and is entered as part of the execution of 
the divide instruction. 
If interrupts are not reenabled by 


the divide error interrupt 
service routine, the service 


routine execution time should be included In the worst 
case divide Instruction execution time (primarily when 
considering the longest instruction 
execution time and 


Its effect on latency to servicing hardware interrupts). 


TYPE 1 - 
SINGLE STEP 


This interrupt type occurs one instruction 
after the TF 
(Trap Flag) is set in the flag register. It is used to allow 
software single stepping through a sequence of code. 
Single stepping is initiated by copying the flags onto the 
stack, setting the TF bit on the stack and popping the 
flags. The interrupt 
routine should be the single step 
routine. The interrupt sequence saves the flags and pro- 
gram counter, then resets the TF flag to allow the single 
step 
routine 
to execute 
normally. 
To return 
to the 


routine under test, an interrupt return restores the IP, 
CS and flags with TF set. This allows the execution of 
the next instruction 
in the program under test before 
trapping back to the single step routine. Single Step is 
not masked by the IF (Interrupt 
Flag) bit in the flag 


register. 


TYPE 2 - 
NMI (Non-Maskable Interrupt) 


This is the highest priority 
hardware interrupt and is 


non-maskable. The input is edge triggered but is syn- 
chronized with the CPU clock and must be active for two 
clock 
cycles to guarantee recognition. 
The interrupt 


signal may be removed prior to entry to the service 
routine. Since the input must make a low to high transi· 
tion to generate an interrupt, spurious transitions on the 
input should be suppressed. 
If the input is normally 


high, the NMI low time to guarantee triggering 
is two 


CPU clock times. This input is typically 
reserved for 


catastrophic 
failures like power failure or timeout of a 
system watchdog timer. 


TYPE 3 - 
ONE BYTE INTERRUPT 


This is invoked by a special form of the software inter· 
rupt instruction 
which requires a single byte of code 


space. Its primary use Is as a breakpoint interrupt for 
software debug. With full representation within a single 
byte, the instruction can map into the smallest instruc· 
tion for absolute resolution In setting breakpoints. The 
interrupt is not maskable. 


TYPE 4 - 
INTERRUPTON OVERFLOW 


This interrupt occurs if the overflow flag (OF) Is set in 
the flag register and the INTO instruction 
is executed. 


The instruction allows trapping to an overflow error ser· 
vice routine. The interrupt is non·maskable. 


Interrupt types 0 and 2 can occur without specific action 
by the programmer (except for performing a divide for 
Type 0) while types 1,3, and 4 require a conscious act by 
the programmer to generate these interrupt types. All 
but type 2 are invoked through software activity and are 
directly associated with a specific instruction. 


USER DEFINED SOFTWARE INTERRUPTS 


The user can generate an interrupt through the software 
with a two byte interrupt instruction 
INT nn. The first 


byte is the INT opcode while the second byte (nn) con· 
tains the type number of the interrupt to be performed. 
The INT instruction 
is not maskable by the interrupt 


enable flag. This instruction can be used to transfer con· 
trol to routines that are dynamically 
relocatable 
and 
whose location in memory is not known by the calling 
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instruction. Since the NMI is an asynchronous event to 
the prefix Immediately preceding t~e stnng pnmatlv~ IS 


the CPU, the point of recognition and initiation of the 
restored. :0 allow correct resumption of the operation, 


transfer of control is similar to the maskable hardware 
the following programming tect:lnlque may be used: 


interrupts. 
LOCKED$BLOCK$MOVE: 
LOCK REP MOVS OEST, CS:SOURCE 


AND CX, 
CX 


JNZ LOCKEO$BLOCK$MOVE 
USER DEFINED HARDWARE INTERRUPTS 


The maskable interrupts initiated by the system hard· 
ware are activated through the INTR pin of the 8086 and 
are masked by the IF bit of the status register (interrupt 
flag). During the last clock cycle of each instruction, the 
state of the INTR pin is sampled. The 8086deviates from 
this rule when the instruction is a MOV or POPto a seg- 
ment register. 
For this 
case, the Interrupts 
are not 
sampled until completion of the following 
Instruction. 
This allows a 32·bit pointer to be loaded to the stack 
pointer registers SS and SP without the danger of an in· 
terrupt occurring between the two loads. Another excep· 
tion is the WAIT instruction which walts for a low active 
Input on the TEST pin. This instruction 
also continuo 
ously samples the interrupt request during its execution 
and allows servicing interrupts during the wait. When an 
interrupt 
Is detected, the WAIT instruction 
is again 
fetched prior to servicing the interrupt to guarantee the 
interrupt routine will return to the WAIT instruction. 


The code bytes generated by the 8086 assembler for the 
MOVS instruction 
are (in descending 
order): LOCK 


prefix, REP prefiX, Segment Override prefix and MOVS. 
Upon return from the interrupt, the segment override 
prefix is restored to guarantee one additional transfer is 
performed between the correct memory locations. The 
instructions 
following 
the 
move operation 
test 
the 


repetition 
count value to determine if the move was 


completed and return if not. 


If the INTR pin is high when sampled and the IF bit is set 
to enable interrupts, 
the 8086 executes an interrupt 


acknowledge sequence. To guarantee the interrupt will 
be acknowledged, the INTR input must be held active 
until the interrupt acknowledge is issued by the CPU. If 
the BIU is running a bus cycle when the interrupt condi· 
tion is detected (as would occur if the BIU is fetching an 
instruction when the current Instruction completes), the 
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Interrupt aCKnowledgeis issued. If a hold request is also 
pending (this might occur if an interrupt and hold re- 
quest are made during execution of a locked instruc- 
tion), the interrupt is serviced after the hold request is 
serviced. 


The Interrupt acknowledge sequence Is only generated 
In response to an Interrupt on the 8086 INTR input. The 
associated bus activity is shown in Figure 3E2. The cy- 
cle consists of two INTA bus cycles separated by two 
Idle clock cycles. During the bus cycles the INTA com- 
mand Is issued rather than read. No address is provided 
by the 8086 during either bus cycle (BHE and status are 
valid), however, ALE is stili generated and wili load the 
address latches with indeterminate 
information. 
This 


condition 
require~ that devices in the system do not 


drive their outputs without being qualified by the Read 
Command. As will be shown later, the ALE is useful in 
maximum mode systems with multiple' 8259A priority In- 
terrupt controllers. 
During the INTA bus cycles, DT/R 


and DEN are conditioned to aliow the 8086 to receive a 
one byte 
Interrupt 
type 
number 
from 
the 
Interrupt 


system. The first INTA bus cycle signals an Interrupt 
acknowledge cycle Is In progress and aliows the system 
to prepare to present the Interrupt type number on the 
next INTA bus cycle. The CPU does not capture Informa- 
tion on the bus during the first cycle. The type number 
must be transferred to the 8086 on the lower half of the 
16-blt data bus during the second cycle. This Implies 
that devices which present Interrupt type numbers to 
the 8086 must be located on the lower half of the 16-blt 
data l:lus.The timing of the INTA bus cycles (with excep- 
tlpn of address timing) Is similar to read cycle timing. 
The 8086 Interrupt 
acknowledge 
sequence 
deviates 


from the form used on 8080 and 8085 In that no Instruc- 
tion Is Issued as part of the sequence. The 8080 and 
8085 required either a restart or call 
Instruction 
be 


Issued to affect the transfer of control. 


In the minimum mode system, the MilO signal wlil be 
low Indicating I/O during the INTA bus cycles. The 8086 
Internal LOCK signal will be active from T2 of the first 
bus cycle until T2 of the second to prevent the BIU from 
honoring a hold request between the two INTA cycles. 


In the maximum mode, the status lines SO-52 will re- 
quest the 8288 to activate the INTA output for each cy- 
cle. The LOCK output of the 8086 will be active from T2 
of the first cycle until T2 of the second to prevent the 
8086 from honoring a hold request on either RQ/GT in- 
put and to prevent bus arbitration logic from relinquish- 
Ing the bus between INTA's in multl·master systems. 
The consequences of READY are Identical to those for 
READ and WRITE cycles. 


Once the 8086 has the Interrupt type number (from the 
bus for hardware Interrupts, from the Instruction stream 
for 
software 
Interrupts 
or from the predefined 
con- 
dition), the type number Is multiplied by four to form the 
displacement 
to the corresponding 
Interrupt vector In 


the InterruDt vector table. The four bytes of the Interrupt 
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ing the transfer of control, the CPU pushes the flags and 
current code segment register and Instruction 
pointer 


onto the stack. The new code segment and instruction 
pointer values are loaded and the single step and inter- 
rupt flags are reset. Resetting the interrupt flag disables 
response to further hardware Interrupts in the service 
routine unless the flags are specifically 
re·enabled by 


the service routine. The CS and IP values are read from 
the interrupt vector table with data read cycles. No seg- 
ment registers are used when referencing the vector 
table during the interrupt 
context 
switch. The vector 


displacement is added to zero to form the 20-bit address 
and S4, 53 = 10 indicating 
no segment register selec- 


tion. 


The actual bus activity associated with the hardware in- 
terrupt acknowledge sequence is as follows: Two inter- 
rupt acknowledge bus cycles, read new IP from the in- 
terrupt vector table, read new CS from the interrupt vec- 
tor table, Push flags, Push old CS, Opcode fetch of the 
first 
instruction 
of the interrupt 
service routine, and 


Push old IP. After saving the old IP, the BIU will resume 
normal operation of prefetching 
instructions 
Into the 


queue and servicing EUrequests for operands. S5 (inter- 
rupt enable flag status) will go Inactive in the second 
clock cycle following reading the new es. 


The number of clock cycles from the end of the instruc- 
tion during which the interrupt occurred to the start of 
interrupt routine execution is 61 clock cycles. For soft- 
ware generated interrupts, the sequence of bus cycles 
is the same except no interrupt acknowledge bus cycles 
are executed. This reduces the delay to service routine 
execution to 51 clocks for INT nn and single step, 52 
clocks for INT3 and 53 clocks for INTO. The same inter- 
rupt setup requirements with respect to the BIU that 
were stated for the hardware Interrupts also apply to the 
software interrupts. If wait states are inserted by either 
the memories or the device supplying the interrupt type 
number, the given clock times will Increase accordingly. 


When considering 
the 
precedence 
of 
interrupts 
for 


multiple simultaneous 
interrupts, 
the following 
guide- 


lines apply: 1. INTR is the only maskable interrupt and if 
detected simultaneously with other Interrupts, resetting 
of IF by the other interrupts will mask INTR.This causes 
INTR to be the lowest priority interrupt serviced after all 
other 
interrupts 
unless 
the 
other 
interrupt 
service 


routines reenable interrupts. 2. Of the nonmaskable in- 
terrupts (NMI, Single Step and software generated), in 
general, Single Step has highest priority (will be ser- 
viced first) followed by NMI, followed by the software In- 
terrupts. 
This Implies 
that a simultaneous 
NMI and 


Single Step trap wlli cause the NMI service routine to 
follow single step; a simultaneous 
software trap and 


Single Step trap will cause the software interrupt ser- 
vice routine to follow single step and a simultaneous 
NMI and software 
trap will 
cause the 
NMI service 


routine to be executed followed by the software Inter- 
rupt service routine. An exception to this priority struc- 
ture occurs if all three Interrupts are pending. For this 
case, transfer of control to the software interrupt ser- 


vice routine followed by the NMI trap will cause both the 
NMI and software 
Interrupt 
service 
routines 
to be ex- 
ecuted 
without 
single 
stepping. 
Single 
stepping 


resumes upon execution of the Instruction 
following 
the 


Instruction 
causing 
the software 
interrupt 
(the next in- 


struction 
In ·the routine being single stepped). 


If the user does not wish to single step before INTR ser· 
vice routines, the single step routine need only disable 
Interrupts 
during execution 
of the program being single 


stepped and reenable interrupts 
on entry to the single 


step 
routine. 
Disabling 
the interrupts 
during 
the pro- 
gram under test prevents entry into the interrupt service 
routine 
while single 
step (TF = 1) is active. To prevent 


single stepping 
before NMI service routines, the single 


step routine must check the return addresl\ on the stack 
for the NMI service routine address and return control to 
that routine without 
single step enabled. As examples, 
consider 
Figures 
3E3a and 3E3b. In 3E3a Single Step 


and NMI occur simultaneously 
while in 3E3b, NMI, INTR 


and a divide error all occur during a divide instruction 
being single stepped. 


SYSTEM CONFIGURATIONS 


To accommodate 
the INTA protocol 
of the maskable 


hardware Interrupts, the 8259A is provided as part of the 
8086 
fl[lmily. 
This 
component 
Is 
programmable 
to 


operate in both 6080/6085 systems 
and 6086 systems. 


The devices 
are cascadable 
In master/slave 
arrange- 


ments to allow up to 64 Interrupts In the system. Figures 
3E4 and 3E5 are examples of 8259A's In minimum 
and 


maximum mode 8086 systems. The minimum mode con· 
figuration 
(a) shows an 8259A connected 
to the CPU's 


multiplexed 
bus. Configuration (b) illustrates an 8259A 


connected to a demultiplexed 
bus system. These inter- 
connects 
are 
also 
applicable 
to 
maximum 
mode 


systems. The configuration 
given for a maximum mode 


system shows a master 8259A on the CPU's multiplexed 
bus with additional 
slave 8259A's out on the buffered 


system bus. This configuration 
demonstrates 
several 


unique features of the maximum mode system inter- 
face. If the master 8259A receives interrupts from a mix 
of slave 8259A's and regular interrupting 
devices, the 


slaves must provide the type number for devices con- 
nected to them while the master provides the type 
number for devices directly attached to Its interrupt in- 
puts. The master 8259A is programmable to determine if 
an interrupt is from a direct input or a slave 8259A and 
will use this information to enable or disable the data 
bus transceivers 
(via the 'nand' function 
of DEN and 


EN). If the master must provide the type number, it will 
disable the data bus transceivers. If the slave provides 
the type number, the master will enable the data bus 
transceivers. The EN output is normally high to allow 


the 8086/8288 
to control the bus transceivers. To select 


the proper slave when servicing a slave Interrupt, the 
master must provide a cascade address to the slave. If 
the 8288 is not strapped in the I/O bus mode (the 8288 
lOB input connected to ground), the MCEJPDENoutput 
becomes a MCE or Master Cascade Enable output. This 
signal is only active during INTA cycles as shown in 
Figure 3E6 and enables the master 8259A's cascade ad- 
dress onto the 8086's local bus during ALE. This allows 
the address latches to capture the cascade address with 
ALE and allows 
use of the system address bus for 


selecting the proper slave 8259A. The MCE is gated with 
LOCK to minimize local bus contention 
between the 


8086 three·stating its bus outputs and the cascade ad- 
dress being enabled onto the bus. The first INTA bus cy- 
cle allows the master to resolve internal priorities and 
output 
a cascade address to be transmitted 
to the 


slaves on the subsequent INTA bus cycle. For additional 
information 
on the 8259A, reference application 
note 


AP-59. 


b. 


Figure 
3E4. 
Mln Mode 
8086 with 
Mester 
8259A on the Local 
Bus and Slave 8259As 
on the System 
Bus 
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3F. Interpreting the 8086 Bus Timing Diagrams 


At first glance, the 8086 bus timing diagrams (Diag. 3F1 
min mode and Diag. 3F2 max mode) appear rather com- 
plex. However, with a few words of explanation on how 
to interpret them, they become a powerful tool in deter- 
mining system requirements. The timing diagrams for 
both the minimum and maximum modes may be divided 
into six sections: (1)address and ALE timing; (2)read cy- 
cle timing; (3) write cycle timing; (4) interrupt acknowl- 
edge timing; (5) ready timing; and (6) HOLD/HLDA or 
RQ/GT timing. 
Since the A.C. characteristics 
of the 
signals are specified relative to the CPU clock, the rela- 
tionship 
between the majority 
of signals can be de- 
duced by simply determining the clock cycles between 
the clock edges the signals are relative to and adding or 
subtracting 
the 
appropriate 
minimum 
or 
maximum 
parameter values. One aspect of system timing not com- 
pensated for in this approach is the worst case relation- 
ship between minimum and maximum parameter values 
(also known as tracking relationships). As an example, 
consider a signal which has specified 
minimum and 
maximum turn on and turn off delays. Depending on 
device characteristics, 
it may not be possible for the 
component to simultaneously demonstrate a maximum 
turn-on and minimum turn-off delay even though worst 
case analysis might imply the possibility. This argument 
is characteristic 
of MOS devices and is therefore ap- 
plicable to the 8086 A.C. characteristics. 
The message 
is: worst case analysis mixing minimum and maximum 
delay parameters will typically exceed the worst case 
obtainable and therefore should not be subjected to fur- 
ther subjective degradation to obtain worst-worst case 
values. This section will provide guidelines for specific 
areas of 8086 timing sensitive to tracking relationships. 


A. MINIMUM MODE BUS TIMING 


1. ADDRESS and ALE 


The address/ALE timing 
relationship 
is important 
to 
determine the ability to capture a valid address from the 
multiplexed 
bus. Since the 8282 and 8283 latches cap- 
ture the address on the trailing edge of ALE, the critical 
timing involves the state of the address lines when ALE 
terminates. If the address valid delay is assumed to be 
maximum TCLAV and ALE terminates 
at its earliest 
point, TCHLLmin (assuming zero minimum delay), the 
address would be valid only TCLCHmin-TCLAVmax= 8 
ns prior to ALE termination. This result is unrealistic in 
the assumption 
of 
maximum 
TCLAV and minimum 
TCHLL. To provide an accurate 
measure of the true 
worst 
case, 
a 
separate 
parameter 
specifies 
the 
minimum time for address valid prior to the end of ALE 
(TAVAL). TAVAL= TCLCH-60 ns overrides 
the clock 
related timings and guarantees 58 ns of address setup 
to ALE termination 
for a 5 MHz 8086. The address is 
guaranteed to remain valid beyond the end of ALE by the 
TLLAX parameter. This specification 
overrides the rela- 
tionship between TCHLL and TCLAX which might seem 
to imply the address may not be valid by the end of the 
latest possible ALE. TLLAX holds for the entire address 
bus. The TCLAXmin spec on the address indicates the 
earliest the bus will go invalid if not restrained by a slow 
ALE. TLLAX and TCLAX apply to the entire multiplexed 
bus for both read and write cycles. AD15-Q is three- 


stated 
for read cycles 
and immediately 
switched 
to 
write data during write cycles. AD19-16 Immediately 
switch from address to status for both read and write 
cycles. The minimum ALE pulse width Is guaranteed by 
TLHLLmin which takes precedence over the value ob- 
tained by relating TCLLHmax and TCHLLmln. 


To determine the worst case delay to valid address on a 
demultiplexed 
address bus, two paths must be con- 
sidered: (1) delay of valid address and (2) delay to ALE. 
Since the 8282 and 8283 are flow through latches, a valid 
address is not transmitted to the address bus until ALE 
is active. A comparison of address valid delay TCLAV- 
max with ALE active delay TCLLHmax Indicates TCLAV- 
max is the worst 
case. Subtracting 
the latch 
prop- 


agation delay gives the worst case address bus valid 
delay from the start of the bus cycle. 


2. Read Cycle Timing 


Read timing consists of conditioning the bus, activating 
the read command and establishing the data transceiver 
enable and direction controls. DT/R is established early 
In the bus cycle and requires no further consideration. 
During read, the DEN signal must allow the transceivers 
to propagate data to the CPU with the appropriate data 
setup time and continue to do so until the required data 
hold time. The DEN turn on delay allows 
TCLCL+ 
TCHCLmin - TCVCTVmax - TDVCL = 127 nStransceiver 
enable time prior to valid data required by the CPU. 
Since the CPU data hold time TCLDXmin and minimum 
DEN turnoff deiay TCVCTXmin are both 10 ns relative to 
the same clock edge, the hold time is guaranteed. Addi- 
tionally, DEN must disable the transceivers prior to the 
CPU redriving the bus with the address for the next bus 
cycle. The maximum DEN turn off delay (TCVCTXmax) 
compared with the minimum delay for addresses out of 
the 
8086 (TCLCL+ TCLAVmin) 
indicates 
the 
trans- 
ceivers are disabled at least 105 ns before the CPU 
drives the address onto the multiplexed bus_ 


If memory or I/O devices are connected directly to the 
multiplexed address and data bus, the TAZRL parameter 
guarantees the CPU will float the bus before activating 
read and allowing the selected device to drive the bus. 
At the end of the bus cycle, the TRHAV parameter spec- 
ifies the bus float delay the device being deselected 
must satisfy to avoid contention with the CPU driving 
the address for the next bus cycle. The next bus cycle 
may start as soon as the cycle following 
T4 or any 


number of clock cycles later. 


The minimum delay from read active to valid data at the 
CPU is 2TCLCL - TCLRLmax - TDVCL = 205 ns. The 
minimum pulse width is 2TCLCL-75 
ns=325 
ns. This 
specification 
(TRLRH) overrides the result which could 
be 
derived 
from 
clock 
relative 
delays 
(2TCLCL- 
TCLRLmax + TCLRHmin). 


3. Write Cycle Timing 


The write 
cycle involves providing write data to the 
system, generating the write command and controlling 
data bus transceivers. The transceiver direction control 
signal DT/R is conditioned to transmit at the end of each 
read cycle and does not change during a write cycle. 
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corrupting 
the address 
on the 
multiplexed 
bus. The 


write data and write command are both enabled from the 
leading 
edge of T2. Comparing 
minimum 
WR active 


delay TCVCTVmln with the maximum 
write data delay 


TCLDV Indicates 
that write data may be not valid until 
100 ns after write Is active. The devices in the system 
should 
capture 
data on the trailing 
edge of the write 


command 
rather than the leading 
edge to guarantee 


valid data. The data from the 8086 is valid a minimum of 
2TCLCL - TCLDVmax + TCVCTXmin = 300 ns before the 
trailing edge of write. The minimum write pulse width Is 
TWLWH = 2TCLCL - 60 ns = 340 ns. The CPU maintains 
valid write data TWHDX ns after write. The TWHDZ spec- 
ification 
overrides 
the 
result 
derived 
by 
relating 


TCLCHmin 
and TCHDZmin 
which 
implies 
write 
<jata 


may only be valid 18 ns afterWR. The 8086 floats the bus 
after write only if being forced off the bus by a HOLD or 


next bus cycle. As with the read cycle, the next bus cy- 
cle may start In the clock cycle following 
T4 or any clock 


cycle later. 


DEN 
Is 
disabled 
a 
minimum 
of 
TCLCHmin + 


TCVCTXmln - TCVCTXmax = 18 
ns 
after 
write 
to 


guarantee data hold time to the selected device. Since 
we are again evaluating a minimum TCVCTX with a max- 
Imum TCVCTX, the real minimum delay from the end of 
write to transceiver 
disable is approximately 
60 ns. 


4. Interrupt Acknowledge 
Timing 


The Interrupt acknowledge 
sequence consists 
of two In- 


terrupt 
acknowledge 
bus 
cycles 
as 
previously 
de- 


scribed. The detailed timing of each cycle Is Identical to 
the read cycle timing 
with two exceptions: 
command 


timing and address/data 
bus timing. 
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e. THE ISSUANCE 
OF THE 8211 COMMAND 
AND CONTROL 
SIGNALS 
(IifR'De, 
~, 
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four mUltlplexea aaaress/status lines 00 nal 1111""-''''''''. - 
The address value on A19·A16 is indeterminate but the 
status information 
will be valid (S3= 0, S4 = 0, S5 = IF, 
S6= 0, S7= BHE = 0). The 
multiplexed 
address/data 
lines will remain in three·state until the cycle afterT4 of 
the INTA cycle. This sequence occurs for each of the 
INTA bus cycles. The interrupt type number read by the 
8086 on the second INTA bus cycle must satisfy the 
bame setup and hold times required for data during a 
read cycle. 


The DEN and DT/Rsignals are enabled for each INTA cy· 
cle and do not remain active between the two cycles. 
Their timing for each cycle is identical to the read cycle. 


The INTA command has the same timing as the write 
command. It is active within 110 ns of the start of T2 pro- 
viding 260 ns of access time from command to data 
valid at the 8086. The command is active a minimum of 
TCVCTXmin = 10 ns into T4to satisfy the data hold time 
of the 8086.This provides minimum INTA pulse width of 
300 ns, however taking signal delay tracking into con· 
sideration gives a minimum pulse width of 340 ns. Since 
the maximum inactive delay of INTA is TCVCTXmax= 
110 ns and the CPU will not drive the bus until 15 ns 
(TCLAVmln) into the next clock cycle, 105 ns are avail· 
able for interrupt devices on the local bus to float their 
outputs. If the data bus is buffered, DEN provides the 
same amount of time for local bus transceivers to three· 
state their outputs. 


5. Ready Timing 


The detailed 
timing 
requirements 
of the 8086 ready 
signal and the system ready signal into the 8284 are 
described 
in Section 3D. The system ready signal is 
typically generated from either the address decode of 
the selected device or the address decode and the com- 
mand (RD, WR, INTA). For a system which is normally 
not ready, the time to generate ready from a valid ad- 
dress 
and 
not 
insert 
a 
wait 
state, 
is 
2TCLCL- 
TCLAVmax - TR1VCLmax = 255 ns. This time is avail- 
able for buffer delays and address decoding to deter- 
mine if the selected device does not require a wait state 
and drive the ROY line high. If wait cycles are required, 
the user hardware must provide the appropriate ready 
delay. Since the address will not change until the next 
ALE, the ROY will remain valid throughout the cycle. If 
the system is normally ready, selected devices requiring 
wait states also have 255 ns to disable the ROYline. The 
user circuitry 
must delay re·enabling ROY by the ap· 


propriate number of wait states. 


If the RD command is used to enable the ROY signal, 
TCLCL - TCLRLmax - TRIVCLmax = 15 ns are available 
for external logic. If the WR command is used, TCLCL- 
TCVCTVmax - TRIVCLmax = 55 ns are available. Com· 
parlson of ROY control 
by address or command in· 
dicates that address decoding provides the best timing. 
If the system is normally not ready, address decode 
alone could be used to provide ROY for devices not reo 
quiring wait states while devices requiring wait states 
may use a combination 
of address decode and com· 
mand to activate a wait state generator. If the system is 


combination 
of address decode and command to ac- 


tivate a delay to re-enable ROY. 


If the system requires no wait states for memory and a 
fixed number of wait states for AD and WR to all I/O 
devices, the M/iO signal can be used as an early indica- 
tion of the need for wait cycles. This allows a common 
circuit 
to control 
ready timing 
for the entire system 
without feedback of address decodes. 


6. Other Considerations 


Detailed HOLD/HLDA timing is covered in the next sec- 
tion and is not examined here. One last signal con- 
sideration 
needs to be mentioned 
for the minimum 
mode system. The TEST input is sampled by the 8086 
only during execution of the WAIT instruction. The TEST 
signal should be active for a minimum of 6 clock cycles 
during the WAIT instruction 
to guarantee detection. 


B. MAXIMUM MODE BUS TIMING 


The maximum mode 8086 bus operations are logically 
equivalent to the minimum 
mode operation. 
Detailed 
timing analysis now involves signals generated by the 
CPU and the 8288 bus controller. The 8288 also provides 
additional control and command signals which expand 
the flexibility 
of the system. 


In the maximum mode, the address information 
con- 


tinues to come from the CPU while the ALE strobe is 
generated by the 8288.To determine the worst case rela· 
tionships 
between ALE and the address, we first must 


determine 
8288 ALE activation 
relative to the So-S2 
status 
from 
the 
CPU. The maximum 
mode timing 
diagram specifies two possible delay paths to generate 
ALE. The first is TCHSV + TSVLH measured from the ris- 
ing edge of the clock cycle preceding T1. The second 
path is TCLLH measured from the start of T1. Since the 
8288 initiates a bus cycle from the status lines leaving 
the passive state (SO-52 = 1),if the 8086 is late in issuing 
the status (TCHSVmax) while the clock high time is a 
minimum (TCHCLmin), the status will not have changed 
by the start of T1 and ALE is issued TSVLH ns after the 
status changes. If the status changes prior to the begin- 
ning of n, the 8288 will not issue the ALE until TCLLH 
ns after the start of T1. The resulting worst case delay to 
enable ALE (relative to the start of T1) is TCHSVmax + 
TSVLHmax - TCHCLmin = 58 
ns. 
Note, 
when 
calcu· 


lating signal relationships, 
be sure to use the proper 
maximum mode values rather than equivalent minimum 
mode values. 


The trailing edge of ALE is triggered in the 8288 by the 
positive 
clock edge in T1 regardless of the delay to 
enable ALE. The resulting minimum ALE pulse width is 
TCLCHmax-58ns=75ns 
assuming 
TCHLL=O. 


TCLCHmax must be used since TCHCLmin was as- 
sumed to derive the 58 ns ALE enable delay. The ad- 
dress 
is 
guaranteed 
to 
be 
valid 
TCLCHmin + 
TCHLLmin - TCLAVmax = 8 ns prior to the trailing edge 


of ALE to capture the address in the 8282 or 8283 
latches. Again we have assumed a very conservative 
TCHLL = O. Note, since the address and ALE are driven 
by separate devices, no tracking of A.C. characteristics 
can be assumed. 


The address hold time to the latches is guaranteed by 
the address remaining valid until the end of T1 while 
ALE is disabled a maximum of 15 ns from the positive 
clock transition 
in T1 (TCHCLmin - TCHLLmax = 52 ns 
address hold time). The multiplexed 
bus transitions 
from address to status and write data or three-state (for 
read) are identical to the minimum mode timing. Also, 
since the address valid delay (TCLAV) remains the 
critical path in establishing a valid address, the address 
access times to valid data and ready are the same as the 
minimum mode system. 


2. Read Cycle Timing 


The 
maximum 
mode 
system 
offers 
read 
signals 


generated by both the 8086 and the 8288. The 8086 RD 
output signal timing is identical to the minimum mode 
system. Since the A.C. characteristics of the read com- 
mands generated by the 8288 are significantly 
better 
than the 8086 output, access to devices on the demul· 
tiplexed buffered system bus should use the 8288 com· 
mands. The 8086 RD signal is available for devices 
which 
reside directly 
on the 
multiplexed 
bus. The 
following 
evaluations 
for 
read, write 
and 
interrupt 


acknowledge only consider the 8288 command timing. 


The 8288 provides separate memory and I/O read signals 
which conform to the same A.C. characteristics. 
The 


commands are issued TCLML ns after the start of T2 
and terminate 
TCLMH ns after the start of T4. The 
minimum command length is 2TCLCL- TCLMLmax + 
TCLMLmin = 375 ns. The access time to valid data at the 
CPU 
is 
2TCLCL- TCLMLmax - TDVCLmax = 335 
ns. 


Since the 8288 was designed for systems with buffered 
data busses, the commands are enabled before the CPU 
has three-stated the multiplexed bus and should not be 
used with devices which reside directly on the multi- 
plexed bus (to do so could result in bus contention duro 
ing 8086 bus float and device turn·on). 


The direction control for data bus transceivers is estab· 
lished in T1 while the transceivers are not enabled by 
DEN until the positive clock transition of T2. This pro- 
vides TCLCH + TCVNVmin = 123 ns for 8086 bus float 
delay 
and 
TCHCLmin 
+ TCLCL - TCVNVmax- 


TDVCLmax = 187 ns of transceiver active to data valid at 
the CPU. Since both DEN and command are valid a mini· 
mum of 10 ns into T4, the CPU data hold time TCLDX is 
guaranteed. A maximum DEN disable of 45 ns (TCVNX 
maxI guarantees the transceivers are disabled by the 
start of the next 8086 bus cycle (215 ns minimum from 
the same clock edge).On the positive clock transition of 
T4, DT/R is returned to transmit 
in preparation for a 


possible write operation on the next bus cycle. Since 
the system memory and I/O devices reside on a buffered 
system bus, they must three-state their outputs before 
the device for the next bus cycle is selected (approxi- 
mately 2TCLCL) or the transceivers drive write data onto 
the bus (approximately 2TCLCL). 


3. Write Cycle Timing 


In the maximum mode, the 8288 provides normal and ad· 
vanced write commands for memory and I/O. The ad· 
vanced write commands are active a full clock cycle 
ahead of the normal write commands and have timing 
identical 
to the read commands. The advanced write 


pulse width is 2TCLCL- TCLMLmax+ TCLMHmin= 375 
ns while the normal write 
pulse width 
is TCLCL- 


TCLMLmax + TCLMHmin = 175 ns. Write 
data 
setup 


time to the selected device is a function of either the 
data valid delay from the 8086(TCLDV)or the transceiver 
enable delay TCVNV. The worst case delay to valid write 
data is TCLDV= 110 ns minus transceiver propagation 
delays. This implies the data may not be valid until 100 
ns after the advanced write command but will be valid 
approximately 
TCLCL- TCLDVmax+ TCLMLmin= 100 


ns prior to the leading edge of the normal write com· 
mand. 
Data 
will 
be 
valid 
2TCLCL- TCLDVmax+ 


TCLMHmin = 300 ns before the trailing edge of either 
write command. The data and command overlap for the 
advanced command is 300 ns while the overlap with the 
normal write command is 175 ns. The transceivers are 
disabled 
a 
minimum 
of 
TCLCHmin - TCLMHmax+ 


TCVNXmin = 85 ns after the write command while the 
CPU provides valid data a minimum of TCLCHmin- 
TCLMHmax + TCHDZmin = 85 ns. This guarantees write 
data hold of 85 ns after the write command. The trans· 
ceivers 
are 
disabled 
TCLCL - TCVNXmax 
+ 


TCHDTLmin = 155 ns (assuming TCHDTL = 0) prior to 
transceiver 
direction 
change for a subsequent 
read 
cycle. 


4. Interrupt Acknowledge Timing 


The maximum mode INTA sequence is logically 
iden- 


tical to the minimum mode sequence. The transceiver 
control (DEN and DT/R) and INTA command timing of 
each interrupt 
acknowledge cycle is identical 
to the 
read cycle. As in the minimum mode system, the multi· 
plexed address/data bus will float from the leading edge 
of T1 for each INTA bus cycle and not be driven by the 
CPU until after T4 of each INTA cycle. The setup and 
hold times on the vector number for the second cycle 
are the same as data setup and hold for the read. If the 
device prOViding the interrupt 
vector number is con- 


nected 
to 
the 
local 
bus, 
TCLCL - TCLAZmax + 


TCLMLmin = 130 ns are available from 8086 bus float to 
INTA command active. The selected device on the local 
bus must disable the system data bus transceivers 
since DEN is still generated by the 8288. 


If the 8288 is not in the lOB (I/O Bus) mode, the 8288 
MCE/PDEN output becomes the MCE output. This out· 
put is active during each INTA cycle and overlaps the 
ALE signal during T1. The MCE is available for gating 
cascade addresses from a master 8259A onto three of 
the upper AD15·AD8 lines and allowing ALE to latch the 
cascade address into the address latches. The address 
lines may then be used to provide CAS address selec- 
tion to slave 8259A's located on the system bus (refer· 
ence Figure 3E5).MCE is active within 15 ns of status or 
the start of T1 for each INTA cycle. MCE should not 
enable the CAS lines onto the multiplexed bus during 
the first cycle since the CPU does not guarantee to float 


the bus until 80 ns into the first INTA cycle. The first 
MCE can be inhibited 
by gating MCE with LOCK. The 


8086 LOCK output 
is activated during T2 of the first 


cycle and disabled during T2 of the second cycle. The 
overlap of LOCK with MCE allows the first MCE to be 
masked and the second MCE to gate the cascade ad- 
dress onto the local bus. Since the 8259A will not pro- 
vide a cascade address until the second cycle, no infor- 
mation is lost. As with ALE, MCE is guaranteed valid 
within 58 ns of the start of T1 to allow 75 ns CAS ad- 
dress setup to the trailing edge of ALE. MCE remains 
active 
TCHCLmin - TCHLLmax + TCLMCLmin = 52 ns 


after ALE to provide data hold time to the latches. 


If the 8288 is strapped in the lOB mode, the MCE output 
becomes PDEN and all I/O references are assumed to be 
devices on the local bus rather than the demultiplexed 
system 
bus. Since 
INTA cycles 
are considered 
I/O 


cycles, all interrupts 
are assumed to come from the 


local system and cascade addresses are not gated onto 
the system address bus. Additionally, the DEN signal is 
not enabled since no I/O transfers occur on the system 
bus. If the local I/O bus is also buffered by transceivers, 
the POEN signal is used to enable those transceivers. 
PDEN A.C. characteristics 
are identical 
to DEN with 


PDEN enabled for I/O references and DEN enabled for 
instruction 
or memory data references. 


5. Ready Timing 


Ready timing based on address valid timing is the same 
for maximum and minimum mode systems. The delay 
from 8288 command valid to ROY valid at the 8284 is 
TCLCL - TCLMLmax - TRIVCLmin = 130 ns. This time is 
available for external circuits to determine the need to 
insert wait states and disable ROY or enable ROY to 
avoid wait states. INTA, all read commands and ad- 
vanced write commands provide this timing. The normal 
write command is not valid until after the ROY signal 
must be valid. Since both normal and advanced write 
commands 
are generated 
by the 8288 for all write 


cycles, the advanced write may be used to generate a 
ROY indication 
even though the selected device uses 


the normal write command. 


Since sepa~te commands are provided for memory and 
110, no MilO signal is specifically 
available as in the 


minimum mode to allow an early 'wait state required' in- 
dication for 110 devices. The S2 status line, however is 
logically equivalent to the MilO signal and can be used 
for this purpose. 


6. Other Considerations 


The RO/GTtiming is covered in the next section and will 
not be duplicated here. The only additional signals to be 
considered in the maximum mode are the queue status 
lines 
OSO, OS1. These signals 
are changed on the 


leading edge of each clock cycle (high to low transition) 
including 
idle and walt cycles (the queue status is in- 


dependent of the bus activity). External logic may sam- 
ple the lines on the low to high transition of each clock 
cycle. When sampled, the signals indicate the queue ac- 
tivity in the previous clock cycle and therefore lag the 
CPU's activity 
by one cycle. The TEST input require- 


ments are identical to those stated for the minimum 
mode. 


To inform the 8288 of HALT status when a HALT instruc- 
tion is executed, the 8086 will initiate a status transition 
from passive to HALT status. The status change will 
cause the 8288 to emit an ALE pulse with an indeter- 
minate address. Since no bus cycle is initiated (no com- 
mand is issued), the results of this address will not af- 
fect CPU operation (Le., no response such as READY is 
expected from the system). This allows external hard- 
ware to latch and decode all transitions 
in system 


status. 


3G. Bus Control Transfer 
(HOLD/HLDA 
and RQ/GT) 


The 8086 supports protocols for transferring control of 
the local bus between itself and other devices capable 
of acting as bus masters. The minimum mode config- 
uration offers a signal level handshake similar to the 
8080 and 8085 systems. The maximum mode provides 
an enhanced pulse sequence protocol designed to op- 
timize 
utilization 
of CPU pins 
while 
extending 
the 


system configurations 
to two prioritized levels of alter- 


nate bus masters. These protocols 
are simply 
tech- 


niques for arbitration of control of the CPU's local bus 
and should not be confused with the need for arbitration 
of a system bus. 


The minimum mode 8086 system uses a hold request in- 
put (HOLD) to the CPU and a hold acknowledge (HLDA) 
output 
from the CPU. To gain control 
of the bus, a 


device must assert HOLD to the CPU and wait for the 
HLDA before driving the bus. When the 8086 can relin- 
quish the bus, it floats the RD,WR, INTA and M/iO com- 
mand lines, the DEN and DT/Rbus control lines and the 
multiplexed address/data/status lines. The ALE signal is 
not three-stated. The CPU acknowledges 
the request 


with HLDA to allow the requestor to take control of the 
bus. The requestor must maintain the HOLD request ac- 
tive until it no longer requires the bus. The HOLD re- 
quest to the 8086 directly affects the bus interface unit 
and only indirectly affects the execution unit. The CPU 
will continue to execute from its internal queue until 
either 
more Instructions 
are needed or an operand 


transfer is required. This allows a high degree of overlap 
between CPU and auxiliary bus master operation. When 
the requestor drops the HOLD signal, the 8086 will re- 
spond by dropping HLDA. The CPU will not re-drive the 
bus, command and control signals from three-state until 
it needs to perform a bus transfer. Since the 8086 may 
still be executing from its internal queue when HOLD 
drops, there may exist a period of time during which no 
device is driving the bus. To prevent the command lines 
from drifting 
below the minimum VIH level during the 


transition 
of bus control, 
22K ohm pull up resistors 


should be connected to the bus command lines. The 
timing diagram in Figure 3G1 shows the handshake se- 
quence and 8086 timing to sample HOLD, float the bus, 
and enable/disable HLDA relative to the CPU clock. 


To guarantee valid system operation, the designer must 
assure that the requesting device does not assert con- 


trol of the bus prior to the 8086 relinquishing 
control and 
that the device relinquishes 
control 
of the bus prior to 
the 8086 driving 
the bus. The HOLD request 
Into the 


8086 must be stable THVCH ns prior to the CPU's low to 
high 
clock 
transition. 
Since 
this 
Input 
is 
not 
syn· 


chronlzed 
by the CPU, signals driving the HOLD input 


should 
be 
synchronized 
with 
the 
CPU 
clock 
to 
guarantee 
the setup time is not violated. 
Either clock 


edge may be used. The maximum delay between HLDA 
and 
the 
8086 
floating 
the 
bus 
Is 
TCLAZmax- 
TCLHAVmln = 70 ns. If the system cannot tolerate 
the 


70 ns overlap, 
HLDA active 
from the 8086 should 
be 


delayed to the device. The minimum delay for the CPU to 
drive the control 
bus from HOLD Inactive Is THVCHmin 


+ 3TCLCL = 635 
ns 
and 
THVCHmin 
+ 3TCLCL + 


TCHCL'= 701 ns to drive the multiplexed 
bus. If the 


device does not satisfy these requirements, 
HOLD Inac- 


tive to the 8086 should be delayed. The delay from HLDA 
Inactive to driving the busses Is TCLCL+ TCLCHmln- 
TCLHAVmax = 158 ns for the control bus and 2TCLCL- 
TCLHAVmax = 240 ns for the data bus. 


1.1 Latency of HLDA to HOLD 


The decision 
to respond to a HOLD request is made in 


the bus Interface unit. The major factors that Influence 
the decision are the current bus activity, the state of the 
LOCK signal internal to the CPU (activated by the soft- 
ware LOCK prefix) and interrupts. 


If the LOCK is not active, an Interrupt acknowledge 
cy- 


cle Is not in progress and the BIU (Bus Interface Unit) Is 
executing a T4 or Tl when the HOLD request is received, 
the minimum 
latency to HLDA is: 


35 ns 
65 ns 
200 ns 
10 ns 


310 ns 


THVCH min (Hold setup) 
TCHCL mln 
TCLCL (bus float delay) 
TCLHAV min (HLDA delay) 


@ 5 MHz 


34 ns 
200 ns 
82 ns 
200 ns 
160 ns 


677 ns 


(just missed setup time) 
delay to next sample 
TCHCL max 
TCLCL (bus float delay) 
TCLHAV max (HLDA delay) 


@ 5 MHz 


If the BIU just initiated 
a bus cycle when the HOLD Re- 


quest was received, the worst case response time is: 


34 ns 
82 ns 
7*200 
N*2oo 
160 ns 


1.676 lIS 


THVCH (just missed) 
TCHCL max 
bus cycle execution 
N wait states/bus 
cycle 


TCLHAV max (HLDA delay) 


@ 5 MHz, no walt states 


Note, the 200 ns delay for just missing is included in the 
delay for bus cycle execution. 
If the operand transfer is 


a word transfer to an odd byte boundary, two bus cycles 
are executed 
to perform the transfer. 
The BIU will not 


acknowledge 
a HOLD request 
between 
the two 
bus 


cycles. 
This type of transfer 
would 
extend 
the above 


maximum latency by four additional 
clocks plus N addi- 


tional wait states. With no wait states in the bus CYCle, 
the maximum would be 2.476 microseconds. 


Although the minimum mode 8086 does not have a hard- 
ware LOCK output, 
the software 
LOCK prefix may still 


be included in the instruction 
stream. The CPU internal- 


ly reacts to the LOCK prefiX as would 
the maximum 


mode 8086. Therefore, the LOCK does not allow a HOLD 
request to be honored until completion 
of the instruc· 


tion 
following 
the 
prefix. 
This 
allows 
an instruction 


which 
performs 
more than one memory reference (ex. 


ADD [BX), CX; which adds CX to [BXDto execute without 
another bus master gaining control of the bus between 
memory references_ Since the LOCK signal is active for 
one clock 
longer 
than the instruction 
execution, 
the 


maximum 
latency to HLDA is: 


34 ns 
200 ns 
82 ns 
(M + 1)*200 ns 
200 ns 
160 ns 


(M*200 ns)+ 876 ns 


THVCH (just miss) 
delay to next sample 
TCHCL max 
LOCK instruction 
execution 
set up HLDA (Internal) 
TCLHAV max (HLDA delay) 


@ 5 MHz 


If the HOLD request is made at the beginning of an Inter· 
rupt acknowledge 
sequence, 
the maximum 
latency 
to 
HLDA is: 


34 ns 
82 ns 
2600 ns 
160 ns 


2.876,..s 


THVCH (just missed) 
TCHCL max 
13 clock cycles for INTA 
TCLHAV max 


@ 5 MHz 


A typical use of the HOLD/HLDA signals in the minimum 
mode 8086 system 
is bus control 
exchange with DMA 


devices 
like the Intel 8257-5 or 8237 DMA controllers. 


Figure 3G2 gives a general interconnect 
for this type of 


configuration 
using 
the 
8237·2. The DMA controller 


resides on the upper half of the 8086's local bus and 
shares the A8-A15 demultiplexing 
address latch of the 


8086. All registers 
in the 8237·2 must be assigned odd 


addresses to allow initialization 
and Interrogation 
by the 


CPU over the upper 
half of the data bus. The 8086 


RDIWR commands 
must be demultlplexed 
to provide 


separate I/O and memory commands which are compati· 
ble with the 8237-2 commands. 
The AEN control 
from 


the 8237·2 must disable 
the 8086 commands 
from the 


command 
bus, disable 
the address 
latches 
from 
the 


lower 
(AO·A7) and 
upper (A19-A16) address 
bus and 


select the 8237·2 address strobe (ADSTB) to the A8-A15 
address 
latch. 
If the data bus is buffered, 
a pull·up 


resistor on the DEN line will keep the buffers disabled. 
The DMA controller 
will 
only transfer 
bytes 
between 
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10 are useo, tney must be provided by an ad· 
ditional 
port with either a fixed value or initialized 
by 
software and enabled onto the address bus by AEN. 


Figure 3G3 gives an interconnection 
for placing the 
8257 on the system bus. By using a separate latch to 
hold the upper address from the 8257·5and connecting 
the outputs to the address bus as shown, 16-bit DMA 
transfers 
are 
provided. 
In 
this 
configuration, 
AEN 
simultaneously 
enables AO and BHE to allow 
word 
transfers. AEN still disables the CPU interface to the 
command and address bus~es. 


The maximum mode 8086 configuration 
supports a sig· 
nificantly different protocol for transferring bus control. 
When viewed with respect to the HOLD/HLDA sequence 
of the minimum mode, the protocol appears difficult 
to 
implement externally. However, it is necessary to under- 
stand the intent of the protocol and its purpose within 
the system architecture. 


AP·67 
,mu ••"",,, ••,,. uu:; maSlers wnlcn resioe totally on the 
local bus and share the complete CPU interface to the 
system bus. The complete 
interface includes the ad· 
dress latches, data transceivers, 8288 bus controller and 
8289 multi 
master 
bus arbiter. 
If the alternate 
bus 
masters in the system do not reside directly on the 8086 
local bus, system bus arbitration is required rather than 
local CPU bus arbitration. To satisfy the need for multi· 
master system bus arbitration at each CPU's system in- 
terface, the 8289 bus arbiter should be used rather than 
the CPU RQ/GT logic. 


To allow a device with a simple HOLD/HLDA protocol to 
gain control of a single CPU system bus, the circuit in 
Figure 3G4 could be used. The design is effectively 
a 
simple bus arbiter which isolates 
the CPU from the 
system 
bus when an alternate 
bus master issues a 
HOLD request. The output of the circuit, A£R (Address 
ENable), disables the 8288 and 8284 when the 8086 in· 
dicates idle status (80,51,52 = 1),LOCK is not active and 
a HOLD request is active. With AEN inactive, the 8288 
three-states the command outputs and disables DEN 
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allow the requesting device to drive the system bus. The 
AE.N signal 
to the 8284 disables 
the ready input and 
forces a bus cycle initiated 
by the 8086 to wait until the 
8086 regains control 
of the syste!T1bus. The CPU may 
actively drive Its local bus during th"s interval. 


The requesting 
device will not gain control 
of -the bus 


during an 8086 initiated 
bus cycle, a locked instruction 
or an interrupt 
acknowledge 
cycle. The LOCK- signal 


from 
the 
8086 
is 
active 
between 
INTA 
cycles 
to 
guarantee the CPU maintains control of the bus. Unlike 
the minimum 
mode 8086 HOLD response, this arbitra- 


tion circuit 
allows the requestor to gain control 
of the 


bus between. consecutive 
bus cycles which transfer 
a 


word operand on an odd address boundary and are not 
locked. 
Depending 
on the 
characteristics 
of the 
re- 


questing device, any of the 74LS74 outputs can be used 
to generate a HLDA to the device. 


Upon completion 
of its bus operations, the alternate bus 


master must relinquish 
control 
of the system bus and 


drop the HOLD request. After AEN gOes inactive, the ad- 
dress latches and data transceivers 
are enabled but, if a 


CPU initiated 
bus cycle is pending, 
the' 8288 will 
not 


drive the command 
bus until a minimum 
of 105 ns or 


maximum of 275 ns later. If the system is normally not 
ready, the 8284 AEN input may immediately 
be enabled 


with 
ready returning 
to the CPU when the selected 


device completes 
the transfer. If the system is normally 


ready, the 8284 AEN Input must be delayed long enough 
to provide access time equivalent to a normal bus cycle. 
The 74LS74 latches in the design provide a minimum of 
TCLCHmin 
for the alternate device to float the system 


bus after releasing HOLD. They also provide 2TCLCL ns 
address 
access 
and 2TCLCL- 
TAEVCHmax 
ns (8288 


command enable delay) command access prior to ena- 
bling 
8284 ready detection. 
If HLDA is generilted 
as 


shown 
in Figure 3G4, TCLCL ns are available 
for the 


8086 to release the bus prior to issuing 
HLDA while 
HLDA 
is dropped 
almost 
immediately 
upon 
loss 
of 


HOLD. 


-.:I'\;;; •••••• ~ 
•••••.1 •••• 
t1 •.•••.•••.•• - 
•.•••••••.•••••. 
_ •••.••••.•• 
_- 
••. - 
_ •.•••• --_._-- 


latch for buffering the address lines A15-A8 and uses its 
AEN output 
to enable the latch onto the address bus. 


The maximum latency from HOLD to HLDA for this cir- 
cuit is dependent 
on the state of the system when the 


HOLD is issued. For an idle system the maximum delay 
is the propagation 
delay through the nand gate and RlS 


flip-flop 
(TD1) plus 2TCLCL plus TCLCHmax plus prop- 


agation 
delay of the 74LS74 and 74LS02 (TD2). For a 


locked 
instruction 
it 
becomes: 
TD1 + TD2 + (M + 2) 


*TCLCL+ TCLCHmax where M is the number of clocks 
required for execution of the locked instruction. 
For the 


interrupt 
acknowledge 
cycle 
the 
latency 
is 


TD1 + TD2 +!1 *TCLCL + TCLCHmax. 


2.2 Shared Local Bus (RQ/GT Usage) 


The RQ/GT protocol was developed to allow up to two in- 
struction 
set extension 
processors 
(co-processors) 
or 


other 
special 
function 
processors 
(like 
the 8089 I/O 


processor 
in local mode) to reside directly 
on the 8086 


local bus. Each RQ/GT pin of the 8086 supports 
the full 


protocol for exchange of bus control (Fig. 3G5). The se- 
quence 
consists 
of a request 
from the alternate 
bus 


master to gain control 
of the system bus, a grant from 


the CPU to indicate the bus has been relinquished 
and a 


release pulse from the alternate master when done. The 
two 
RQ/GT pins (RQ/GTO and RQ/GT1) are prioritized 


with RQ/GTO having the highest priority. 
The prioritiza- 


tion only occurs if requests have been received on both 
pins before a response has been given to either. For ex- 
ample, if a request il) received on RQ/GT1 followed 
by a 


request on RQ/GTO prior to a grant on RQ/GT1, RQ/GTO 
will gain priority 
over RQ/GT1. However, if RQ/GT1 had 


already received a grant, a request on RQ/GTOmust wait 
until a release pulse is received on RQ/GT1. 


The request/grant 
sequence interaction 
with the bus in- 


terface 
unit 
is similar 
to HOLD/HLDA. 
The CPU con- 


tinues to execute until a bus transfer 
for additional 
in- 


structions 
or data is required. 
If the release pulse is 


So 
51 
S, 


LOCK 


HOLD 


received before the CPU needs the bus, It will not drive 
the bus until a transfer is required. 


Upon receipt of a request pulse, the 8086 floats the 
multiplexed 
address, data and status bus, the SO,51, 


and S2 status lines, the LOCK pin and RD. this 
action 


does not disable the 8288 command outputs from driv- 
ing the command bus and does not disable the address 
latches from driving the address bus. The 8288 contains 
internal pull-up resistors on the SO, 51, and 52 status 
lines to maintain the passive state while the 8086 out- 
puts are three-state. The passive state prevents the 8288 
from 
initiating 
any commands 
or activating 
DEN to 


enable the transceivers buffering the data bus. If the 
device issuing the RQ does not use the 8288, It must 
disable 
the 8288 command outputs 
by disabling 
the 


8288 AEN input. Also, address latches not used by the 
requesting device must be disabled. 
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2.3 RQ/GT Operation 


Detailed timing 
of the RQ/GT sequence 
Is given In 


Figure 3G6. To request a transfer of bus control via the 
RQ/crr lines, the device must drive the line low for no 
more than one CPU clock interval to generate a request 
pulse. The pulse must be synchronized with the CPU 
clock to guarantee the appropriate setup and hold times 
to the clock edge which samples the RQ/GT lines in the 
CPU. After issuing a request pulse, the device must 
begin sampling for a grant pulse with the next low to 
high clock edge. Since the 8086 can respond with a 
grant pulse in the clock cycle Immediately following the 
request, the RQ/GT line may not return to the positive 
level between the request and grant pulses. Therefore 
edge triggered logic is not valid for capturing a grant 
pulse. It also implies the circuitry which generates the 
request pulse must guarantee the request is removed in 
time to detect a grant from the CPU. After receiving the 
grant pulse, the requesting device may drive the local 
bus. Since the 8086 does not float the address and data 
bus, LOCK or RD until the high to low clock transition 
following the low to high clock transition the requestor 
uses to sample for the grant, the requestor should walt 
the float delay of the 8086 (TCLAZ) before driving the 
local bus. This precaution prevents bus contention dur- 
ing the access of bus control by the requestor. 


To return control of the bus to the 8086, the alternate 
bus 
master 
relinquishes 
bus control 
and 
Issues a 


release pulse on the same RQ/GT line. The 8086 may 
drive the SO-S2status lines, RD and LOCK, three clock 
cycles after detecting 
the release pulse and the ad- 


dress/data bus TCHCLmin ns (clock high time) after the 
status lines. The alternate bus master should be three- 
stated off the local bus and have other 8086 interface 
circuits (8288and address latches) re-enabled within the 
8086 delay to regain control of the bus. 


2.4 RQ/GT Latency 


The RQ to GT latency for a single RQ/GT line is similar 
to the HOLD to HLDA latency. The cases given for the 
minimum mode 8086 also apply to the maximum mode. 
For each case the delay from RQ detection by the CPU 
to GT detection by the requestor is: 
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This gives a clock cycle maximum delay for an idle bus 
interface. All other cases are the minimum 
mode result 
minus 476 ns. If the 8086 has previously 
Issued ~ant 


on one of the RQ/GT lines, a request on the other RQ/GT 
line will not receive a grant until the first device releases 
the interface with a release pulse on its RQ/GT line. The 
delay from release on one AO/GT 
line to a grant on the 


other Is typically 
one clock period as shown in Figure 


3G7. Occasionally 
the delay from a release on RQ/G'fi 


to a grant on RQ/~ 
will take two clock cycles and Is a 


function 
of a pending 
request 
for transfer 
of control 


from the execution 
unit. The latency 
from request 
to 


grant 
when 
the 
interface 
is under 
control 
of a bus 


master on the other RQ/GT line Is a function 
of the other 


bus master. The protocol 
embodies 
no mechanism 
for 


the CPU to force an alternate bus master oil the bus. A 
watchdog 
timer 
should 
be used to prevent an errant 


alternate 
bus master from 'hanging' 
the system. 
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quence into a RQIGT pulse sequence is given in Figure 
3G8. After receiving the grant pulse, the HLDA 
is ena- 
bled TCHCLmin ns before the CPU has three-stated 
the 
bus. If the requesting circuit drives the bus within 20 ns 
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drops at the beginning 
of the release pulse to provide 


2TCLCL + TCLCH for the requestor to relinquish 
control 


of the status 
lines and 3TCLCL to float the remaining 


signals. 
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I/O. The I/O instructions 
allow the I/O devices to reside 
in a separate I/O address space while memory mapped 
I/O allows 
the full 
power of the instruction 
set to be 
used for I/O operations. 
Up to 64K bytes of I/O mapped 
I/O may be defined in an 8086 system. To the program- 
mer, the separate I/O address space is only accessible 
with INPUT and OUTPUT commands which transfer data 
between 
I/O devices and the AX (for 16-bit data trans- 
fers) or AL (for 8-bit data transfers) register. The first 256 
bytes of the I/O space (0 to 255) are directly addressable 
by the I/O instructions 
while the entire 64K is accessible 
via register indirect addressing 
through the DX register. 


The later technique 
is particularly 
desirable for service 
procedures that handle more than one device by allow- 
ing the desired device address to be passed to the pro- 
cedure as a parameter. I/O devices may be connected to 
the local CPU bus or the buffered system bus. 


4A. Elght·Blt 
I/O 


Eight-bit I/O devices may be connected 
to either the up- 
per or lower half of the data bus. Assigning 
an equal 


number of devices to the upper and lower halves of the 
bus will distribute 
the bus loading. 
If a device is con- 
nected 
to the upper half of the data bus, all I/O ad- 


dresses assigned to the device must be odd (AO= 1). If 
the device is on the lower half of the bus, its addresses 
must be even (AO= 0). The address assignment 
directs 


the eight-bit 
transfer to the upper (odd byte address) or 


lower (even byte address) half of the sixteen-bit 
data 


bus. Since AO will always be a one or zero for a specific 
device, AO cannot be used as an address input to select 
registers 
within 
a specific 
device. 
If a device on the 


upper half of the bus and one on the lower half are 
assigned addresses that differ only in AO (adjacent odd 
and even addresses), AOand SHE must be conditions 
of 


chip select decode to prevent a write to one device from 
erroneously 
performing 
a write 
to the other. 
Several 


techniques 
for generating 
I/O device chip selects 
are 


given in Figure 4A1. 


The first technique 
(a) uses separate 8205's to generate 
chip selects 
for odd and even addressed 
byte periph- 
erals. If a word transfer 
is performed 
to an even ad- 


dressed device, the adjacent odd addressed 
I/O device 


is also selected. This allows accessing 
the devices in- 


dividually 
with 
byte transfers 
or simultaneously 
as a 


16-bit device with word transfers. 
Figure 4A1(b) restricts 


the 
chip 
selects 
to 
byte 
transfers, 
however 
a word 


transfer to an odd address will cause the 8086 to run two 
byte transfers that the decode technique will not detect. 
The 
third 
technique 
simply 
uses 
a single 
8205 to 
generate odd and even device selects for byte transfers 
and will only select the even addressed eight-bit device 
on a word transfer to an even address. 


If greater than 256 bytes of the I/O space or memory 
mapped I/O is used, additional 
decoding beyond what is 


shown in the examples may be necessary. This can be 
done with additional 
TIL, 8205's or bipolar PROMs (In- 
tel's 3605A). The bipolar PROMs are slightly 
slower than 
multiple 
levels of TIL (50 ns vs 30 to 40 ns for TIL) but 
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One last technique 
for interfacing 
with eight-bit 
periph- 


erals Is considered 
in Figure 4A3. The sixteen-bit 
data 


bus 
is multiplexed 
onto 
an eight-bit 
bus to accom- 


modate byte oriented DMA or block transfers to memory 
mapped eight-bit 
I/O. Devices connected 
to this 
inter- 


face may be assigned a sequence of odd and even ad- 
dresses rather than all odd or even. 


8 
CS1 
10 
CS2 


: 
"0 
7 
A, 
4 
A, 
3 
A, 


2 
As 


As 
1 
As 


8-BIT 
PERIPHERAL 
DATA BUS 


I~BIT I 


DATA 1 
BUS 


BHE 
PERIPHERAL 
CO 


4B. Sixteen· Bit 110 


For obvious reasons of efficient 
bus utilization 
and sim- 
plicity of device selection, sixteen-bit 
I/O devices should 
be assigned even addresses. To guarantee the device Is 
selected 
only for word operations, 
AO and BHE should 
be conditions 
of chip select code (Fig. 4B1). 


4C. General Design Considerations 


MIN/MAX, MEMORY 110 MAPPED AND LINEAR SELECT 


Since the minimum 
mode 8086 has common 
read and 
write commands 
for memory and 110, If the memory and 
I/O address spaces overlap, the chip selects 
must be 
qualified 
by M/IO to determine which address space the 
devices are assigned to. This restriction 
on chip select 
decoding 
can be removed 
If the I/O and memory 
ad- 
dresses 
In the system do not overlap and are properly 
decoded; all 110 Is memory mapped; or RD, WR and M/Io 
are 
decoded 
to 
provide 
separate 
memory 
and 
I/O 
read/write commands (Fig. 4C1). The 8288 bus controller 
in the maximum mode 8086 system generates separate 
I/O and memory commands 
In place of a M/Io signal. An 
110 device is assigned to the I/O space or memory space 
(memory 
mapped 
I/O) by connection 
of either 
I/O or 
memory command 
lines to the command 
Inputs of the 
device. To allow overlap of the memory and I/O address 
space, the device must not respond to chip select alone 
but must require a combination 
of chip select and a read 
or write command. 


NOTE: 
IF IT IS NOT 
NECESSARY 
TO THREE·STATE 
THE 
COMMAND 
LINES, 
A 
DECODER 
(8205 
OR 74S138) 
COULD 
BE USED. 
THE 
74LS257 
IS NOT 
RECOMMENDED 
SINCE 
THE 
OUTPUTS 
MAY 
EXPERIENCE 
VOLTAGE 
SPIKES 
WHEN 
ENTERING 
OR LEAVING 
THREE·STATE. 


Figure 
4C1. 
Decoding 
Memory 
and 110 RD and WR Commands 
lor 
Minimum 
Mode 8086 Systems 


Linear select techniques 
(Fig. 4C2) for I/O devices can 
only be used with devices that either reside In the 110 ad· 
dress space or require more than one active chip select 
(at least one low active and one high active). Devices 
with a single chip select input cannot use linear select If 
they are memory mapped. This Is due to the assignment 
of memory address space FFFFFOH·FFFFFFH 
to reset 
startup and memory space 00000H·003FFH to Interrupt 
vectors. 
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40. 
Determining 
I/O Device Compatibility 


This section presents a set of A.C. characteristics 
which 
represent the timing of the asynchronous 
bus Interface 
of the 8086. The equations are expressed In terms of the 
CPU 
clock 
(when 
applicable) 
and 
are 
derived 
for 
minimum and maximum modes of the 8086. They repre- 
sent the bus characteristics 
at the CPU. 


The results 
can be used to determine 
110 device 
reo 
qulrements 
for operation 
on a single CPU local bus or 
buffered system bus. These values are not applicable to 


a Multibus 
system bus interface. The requirements 
for a 
Multibus 
system bus are available in the Multibus 
inter- 
face specification. 


A list of bus parameters, 
their definition 
and how they 
relate to the A.C. characteristics 
of Intel peripherals are 
given 
in Table 
401. 
Cycle 
dependent 
values 
of 
the 
parameters are given in Table 402. For each equation, if 
more than one signal 
path is involved, 
the equation 
reflects 
the worst case path. 


ex. TAVRL(address valid before read active) = 
(1) Address from CPU to RO active 
( or) 
(2) ALE (to enable the address through the 
address latches) to RO active 


The worst case delay path is (1). 


For the maximum mode 8086 configurations, 
TAVWLA, 


TWLWHA 
and TWLCLA 
are relative 
to the advanced 
write 
signal 
while 
TAVWL, 
TWLWH 
and TWLCL are 
relative to the normal write signal. 


TAVAL 
- 
Address 
stable 
before AD leading 
edge 
(TAA) 


TAHAX 
- 
Address 
hold after AD trailing 
edge 
(TAA) 


TALAH 
- 
Aead pulse width 
(TAA) 


TALDV 
- 
Aead to data valid delay 
(TAD) 


TAHDZ - 
Aead trailing 
edge to data floating 
(TDF) 


TAVDV - 
Address 
to valid data delay 
(TAD) 


TALRL - 
Read cycle time 
(TRCYC) 


TAVWL 
- 
Address 
valid before write 
leading 
edge 
(TAW) 


TAVWLA 
- 
Address 
valid before advanced 
write 
(TAW) 


TWHAX 
- 
Address 
hold after write trailing 
edge 
(TWA) 


TWLWH 
- 
Write 
pulse width 
(TWW) 


TWLWHA 
- 
Advanced 
write 
pulse width 
(TWW) 


TDVWH 
- 
Data set up to write trailing 
edge 
(TDW) 


TWHDX - 
Data hold from write 
trailing 
edge 
(TWD) 


TWLCL - 
Write 
recovery 
time 
(TRV) 


TWLCLA 
- 
Advanced 
write 
recovery 
time 
(TRV) 


TSVRL - 
Chip select 
stabie 
before 
RD leading 
edge 
(TAR) 


TRHSX - 
Chip select 
hold after RD trailing 
edge 
(TRA) 


TSLDV - 
Chip select 
to data valid delay 
(TRD) 


TSVWL - 
Chip select 
stable 
before WR leading 
edge 
(TAW) 


TWHSX 
- 
Chip select 
hold after WR trailing 
edge 
(TWA) 


TSVWLA 
- 
Chip select 
stable 
before advanced 
write 
(TAW) 


Symbols 
in parentheses 
are equivalent 
parameters 
specified 
for 


Intel 
peripherals. 


In the given list of equations, TWHOXB is the data hold 
time from the trailing 
edge of write 
for the minimum 
mode 
with 
a buffered 
data 
bus. 
For this 
equation, 


TCVCTX cannot be a minimum for data hold and a max· 
imum for write inactive. The maximum difference 
is 50 
ns giving the result TCLCH-50. If the reader wishes to 
verify the equations or derive others, refer to Section 3F 
for assistance 
with 
interpreting 
the 8086 bus timing 
diagrams. 


Figure 
401 
shows 
four 
representative 
configurations 
and the 
compatible 
Intel 
peripherals 
(including 
wait 
states 
if required) for each configuration 
are given in 
Table 403. Configuration 
1 and 2 are minimum 
mode 
demultiplexed 
bus 8086 systems without (1)and with (2) 
data bus transceivers. 
Configurations 
3 and 4 are max- 


imum mode systems with one (3)and two (4) levels of ad- 
dress 
and 
data 
buffering. 
The 
last 
configuration 
is 
characteristic 
of a multi-board 
system with bus buffers 
on each board. The 5 MHz parameter values for these 
configurations 
are given in Table 404 and demonstrate 


the relaxed device requirements 
for even a large com- 


plex 
configuration. 
The 
analysis 
assumes 
all 
com- 
ponents are exhibiting 
the specified 
worst case param- 


eter 
values 
and 
are 
under 
the 
corresponding 
tem- 


perature, voltage and capacitive 
load conditions. 
If the 


capacitive loading on the 8282/83 or 8286/87 is less than 
the maximum, graphs of delay vs. capacitive 
loading in 
the respective data sheets should be used to determine 
the appropriate 
delay values. 


(a) Minimum 
Mode 


TAVRL=TCLCL+ 
TCLRLmin- 
TCLAVmax=TCLCL-l00 


TRHAX= 
TCLCL- 
TCLRHmax 
+ TCLLHmin 
= TCLCL-150 


TRLRH = 2TCLCL- 
60= 2TCLCL- 
60 
TALDV = 2TCLCL- 
TCLRLmax 
- TDVCLmin 
= 2TCLCL-195 


TRHDZ= 
TRHAVmin= 
155 ns 
TAVDV= 
3TCLCL- 
TDVCLmin 
- TCLAVmax 
= 3TCLCL-140 


TRLRL = 4TCLCL = 4TCLCL 
TAVWL= 
TCLCL+ 
TCVCTVmin- 
TCLAVmax= 
TCLCL-100 


TWHAX= 
TCLCL+ 
TCLLHmin 
- TCVCTXmax 
= TCLCL-110 


TWLWH = 2TCLCL- 
40= 2TCLCL- 
40 


TDVWH = 2TCLCL+ 
TCVCTXmin 
- TCLDVmax 
= 2TCLCL- 
100 


TWHDX=TWHDZmin=89 
TWLCL = 4TCLCL = 4TCLCL 
TWHDXB=TCLCHmin+(- 
TCVCTXmax+ 
TCVCTXmin)= 
TCLCHmin 
- 50 


Note: 
Delays 
relative 
to chip 
select 
are a function 
of the chip 
select 


decode 
technique 
used and are equal to: equivalent 
delay 


from address 
- chip 
select 
decode 
delay. 


(b) Maximum 
Mode 


TAVRL= 
TCLCL+ 
TCLMLmin 
- TCLAVmax 
= TCLCL-100 


TRHAX= 
TCLCL- 
TCLMHmax+ 
TCLLHmin 
= TCLCL- 
40 


TRLRH = 2TCLCL- 
TCLMLmax+ 
TCLMHmin 
= 2TCLCL- 
25 


TRLDV = 2TCLCL- 
TCLMLmax 
- TDVCLmin 
= 2TCLCL- 
65 


TRHDZ= 
TRHAVmin 
= 155 
TAVDV= 
3TCLCL- 
TDVCLmin- 
TCLAVmax 
= 3TCLCL- 
140 


TRLRL = 4TCLCL= 
4TCLCL 
TAVWLA=TAVRL=TCLCL-100 
TAVI'IL= 
TAVRL+ 
TCLCL= 
2TCLCL-l00 
TWHAX = TRHAX = TCLCL - 40 
TWLWHA 
= TRLRH = 2TCLCL - 25 


TWLWH 
= TRLRH - TCLCL= 
TCLCL- 
25 
TDVWH = 2TCLCL+ 
TCLMHmin 
- TCLDVmax 
= 2TCLCL-100 


TWHDX= 
TCLCHmin 
- TCLMHmax+ 
TCHDZmin 
= TCLCHmin 
- 30 


TWLCL = 3TCLCL = 3TCLCL 
TWLCLA 
= 4TCLCL = 4TCLCL 


Configuration 


Minimum 
Mode 
Maximum 
Mode 


Unbuffered 
Buffered 
Buffered 
Fully 
Buffered 


8251A 
•... 
lW 
•... 
•... 


8253·5 
•... 
lW 
•... 
•... 


8255A·5 
•... 
lW 
•... 
•... 


8257·5 
•... 
1W 
•... 
•... 


8259A 
•... 
•... 
•... 
•... 


8271 
•... 
lW 
•... 
•... 


8273 
•... 
lW 
•... 
•... 


8275 
•... 
lW 
•... 
•... 


8279·5 
•... 
1W 
•... 
•... 


8041A- 
•... 
lW 
•... 
•... 


8741A 
•... 
lW 
•... 
•... 


8291 
•... 
•... 
•... 
•... 


-Includes 
other Intel peripherals 
based on the 8041A (i.e., 8292, 8294, 


8295). 


tI'" 
implies 
full 
operation 
with 
no wait 
states. 


W implies 
the number 
of wait 
states 
required. 


Configuration 


Minimum 
Mode 
Maximum 
Mode 


Unbullered 
Bullered 
Bullered 
Fully Bullered 


TAVRL 
70 
72 
70 
58 
TRHAX 
57 
27 
169 
141 
TRLRH 
340 
320 
375 
347 
TRLDV 
205 
150 
305 
261 


TRHDZ 
155 
158 
382 
380 
TAVDV 
430 
400 
400 
372 
TRLRL 
800 
770 
800 
772 
TAVWL 
70 
72 
270 
258 
TAVWLA 
- 
- 
70 
58 
TWHAX 
97 
67 
169 
141 
TWLWH 
360 
340 
175 
147 
TWLWHA 
- 
- 
375 
347 
TDVWH 
300 
339 
270 
258 
TWHDX 
88 
15 
95 
13 


TWLCL 
800 
772 
600 
572 
TWLCLA 
- 
- 
800 
772 
TSVRL 
52 
54 
52 
40 
TRHSX 
50 
50 
171 
143 
TSLDV 
412 
382 
382 
354 
TSVWL 
52 
54 
252 
240 
I 


TWHSX 
90 
90 
171 
143 
TSVWLA 
- 
- 
52 
40 


- 
Not applicable. 


Peripheral 
compatibility 
is determined 
from the equa- 
tions given for the CPU by modifying 
them to account 
for additional 
delays 
from 
address 
latches 
and data 
transceivers 
in the configuration. 
Once the system con- 
figuration 
is selected, the system requirements 
can be 
determined 
at the 
peripheral 
interface 
and 
used 
to 
evaluate compatibility 
of the peripheral 
to the system. 


During 
this 
process, 
two areas must 
be considered. 


First, can the device operate at maximum 
bus band- 
width and if not, how many wait states are required. Sec- 
ond, are there any problems that cannot be resolved by 
walt states. 


Examples of the first are TRLRH (read pulse width) and 
TRLDV (read access or RD active to output 
data valid). 


Consider 
address access 
time (valid address 
to valid 
data) for the maximum 
mode fully buffered 
configura- 


tion. 


TAVDV = 3TCYC - 140 ns - 
address 
latch delay - 
address buffer delay - 
chip select decode delay - 
2 
transceiver 
delays 


Assuming 
inverting 
latches, 
buffers 
and 
trans- 
ceivers with 22 ns max delays (8283, 8287) and a 
bipolar 
PROM decode with 50 ns delay. the result 


is: 


8- 


The result gives the address to data valid delay required 
at the peripheral (in this configuration) 
to satIsfy zero 
wait state CPU access time. If the maximum delay 
specified for the peripheral is less than the result, this 
parameter is compatible with zero wait state CPU opera- 
tion. If not, wait states must be inserted until TAVDV+ n 
• TCYC (n is the number of wait states) is greater than 
the peripherals maximum delay. If several parameters 
require wait states, either the largest number required 
should always be used or different transfer cycles can 
insert the maximum number required for that cycle. 


The second area of concern includes TAVRL (address 
set up to read) and TWHDX (data hold after write). 
Incompatibilities 
in this area cannot be resolved by the 


insertion 
of wait states and may require either addi· 


tional hardware, slowing down the CPU (if the parameter 
is related to the clock) or not using the device. 


As an example consider address valid prior to advanced 
write low (TAVWLA) for the maximum mode fully buf· 
fered system. 


TAVWLA = TCYC- 100 ns - 
address latch delay - 
address buffer delay - 
chip select decode delay+ 
write buffer delay (minimum) 


Assuming inverting latches and buffers with 22 ns 
delay (8283,8287) and an 8205 address decoder with 
18 ns delay 


TAVWLA=38 
ns which is the time a 5 MHz 8086 
system provides 


4E. 110Exemple. 


1. Consider an Interrupt 
driven procedure for handling 


multiple 
communication 
lines. On receiving an Interrupt 


from one of the lines, the Invoked procedure 
polls the 


lines (reading the status 
of each) to determine 
which 


line to service. The procedure does not enable lines but 
simply 
services 
Input 
and output 
requests 
until 
the 


associated 
output buffer Is empty (for output requests) 


or until an Input line is terminated 
(for the example only 


EOT Is considered). On detection 
of the terminate condl· 
tlon, the routine will disable the line. It is assumed that 
other routines 
will fill a lines output 
buffer and enable 


the device to request output 
or empty the Input buffer 


and enable the device to input additional 
characters. 


The routine begins operation by loading CX with a count 
of the number of lines In the system and OX with the 110 
address of the first line. The I/O addresses are assigned 
as shown in Figure 4E1 with 8251A's as the I/O devices. 
The status of each line Is read to determine 
if it needs 


service. If yes, the appropriate 
routine is called to input 


or output 
a character. 
After servicing 
the line or if no 


service 
Is needed, CX Is decremented 
and OX is in· 
cremented to test the next line. After all lines have been 
tested and serviced, the routine terminates. 
If all Inter· 


rupts from the lines are OR'd together, 
only one inter· 


rupt is used for all lines. If the interrupt 
is Input to the 


CPU through 
an 8259A interrupt 
controller, 
the 8259A 


should 
be programmed 
in the level triggered 
mode to 


guarantee all line Interrupts 
are serviced. 


To service either an input or output request, the called 
routine transfers OX to BX, and shifts BX to form the off- 
set for this device into the table of input or output buf· 
fers. The first entry in the buffer is an index to the next 
character position 
in the buffer and is loaded into the 51 


register. By specifying 
the base address of the table of 


DEVICES 
ARE 
CONNECTED 
TO THE 
UPPER 
AND 
LOWER 
HALVES 
OF THE 
DATA 
BUS. 


ADDRESS 
o 
1 
2 
3 
4 
5 
8 
7 
ETC. 


DEVICE 
0 
DEVICE 
1 


DEVICE 
0 
DEVICE 
1 
DEVICE 
2 
DEVICE 
3 


DEVICE 
2 
DEVICE 
3 


DATA 
DATA 
CONTROUSTATUS 
CONTROUSTATUS 
DATA 
DATA 
CONTROUST 
ATUS 
CONTROU$T ATUS 


buffers 
as a displacement 
into the data segment, 
the 


base + index + displacement 
addressing 
mode 
allows 


direct access to the appropriate 
memory location. 
8086 


code for part of this example is shown In Figure 4E2. 


2. As 
a 
second 
example, 
consider 
using 
memory 


mapped I/O and the 8086 string prlmatlve instructions 
to 


perform 
block transfers 
between 
memory and I/O. By 


assigning 
a 
block 
of 
the 
memory 
address 
space 


(equivalent 
in size to the maximum 
block to be trans· 


ferred 
to the 
I/O device) and decoding 
this 
address 


space to generate the I/O device's chip select, the block 
transfer 
capability 
Is easily 
Implemented. 
Figure 
4E3 


gives an interconnect 
for 16-bit 110 devices while Figure 


4E4 Incorporates 
the 16-bit bus to 8-bit bus multiplexing 


scheme to support 8-bit I/O devices. A code example to 
perform such a transfer 
is shown In Figure 4E5. 


; THIS 
CODE 
DEMONSTRATES 
TESTINO 
DEVICE 


; STATUS 
FOR SERVICE, 
CONSTRUCTING 
THE 


; APPROPRIATE 
LINE 
IUFFER 
ADDRESS 
FOR INPUT 


; AND 
OUTPUT 
AND 
SERVICING 
AN INPUT 


; REQUEST 


MASk 
Eau 
OFFFDH 
CHECK_STATUS: 
INPUT 
AL, ox 
: GET S2151ASTATUS. 


MOV 
AH, At 
TEST 
AH, READ_OR_WRITLSTATUS 


Jl 
NEXT_IO 
CALL 
ADDRESS 
TEST 
AH, READ STATUS 


JZ 
WRITE-SERVICE 
CALL 
READ 
TEST 
AH, WRITE STATUS 
JZ 
NEXT_IO 
WRITE-SERVICE: 
CAll 
WRITE 
NEXT_IO: 
DEe 
ex 
; TEST IF DONE. 


JNC 
EXIT 
; YES, RESTORE. 
RETURN. 


AND 
ox, MASK 
; REMOVE 41 AND 


ADD 
ox, 3 
; INCREMENT 
ADDRESS. 


OR 
ox, 2 
; SelECT 
STATUS FOR 
JMP 
CHECK-STATUS 
; NEXT INPUT. 


ADDRESS: 
AND 
DX, MASK 
; SELECT DATA. 


MOY 
IH, 
Dl 
; CONSTRUCT 
IUFFER 


INC 
IH 
; DISPLACEMENT 
FOR 


SHR 
IH 
; THIS DEVICE. 


XOR 
Il, 
Bl 
; IX 
IS THE DISPLACEMENT. 


RET 


READ: 
INPUT Al, 
DX 
; READ CHARACTER. 


MOV 51, READ_BUFFERS 
IIX] 
; GET CHARACTER 
POINTER. 


MOV READ_IUFFERS 
(BX + SI), Al 
; STORE CHARACTER. 


INC READ_BUFFERS 
IBX) 
; INCR CHARACTER 
POINTER. 


CMP Al, 
EOT 
; END OF TRANSMISSION? 


JNZ CONT _READ 
CAll 
DISABLE 
READ 
; YES, DISABLE 
RECEIVER. 


CONT_READ: 
RET 
; SEND MESSAOETHAT 
INPUT 
; IS READY. 


3605 
A·1 


DECODE 


TRANSFER 
258 BVTE 
BLOCKS 
TO THE 
110 DEVICE 


THE 
ADDRESS 
SPACE 
ASSIGNED 
TO THE UO DEVICE 
IS 


A" 
FROM 
~BASE 


THRU 
j.- 
BASE 
~ 


A7 
~ 
ADDRESS 
0'. 


ADDRESS 
1'. 


CS 


8·BIT 
110 
DEVICE 
_____ I ) 


; DEFINE 
THE 
110 ADDRESS 
SPACE 
110 SEGMENT 
ORG 
BLOCIL..ADDRESS 
' 


IIO_BLOCK: 
OW 128 DUP (1) 
UO ENDS 


; ASSUME 
THE 
DATA 
IS FROM 
THE 
CURRENT 


; DATA 
SEGMENT 


CLD 
; OF = FORWARD 
LES DI,IIO_BLOCK--ADDRESS 
; 110 BLOCK 
ADDRESS 


; CONTAINS 
THE 
ADDRESS 


; OF 110 BLOCK 
MOV 
CX, 
BLOCK_LENGTH 
MOV 
SI, SOURCE--ADDRESS 
MOVS 
110 BLOCK 
; PERFORM 
WORD 
TRANSFERS 


; END 
CODE 
EXAMPLE 


NOTE 
THE 
CODE 
IS CAPABLE 
OF 
PERFORMING 
BYTE 
TRANSFERS 
BY 


CHANGING 
THE 
UO BLOCK 
DEFINITION 
FROM 
128 WORD 
TO 258 BYTES 


5. INTERFACING WITH MEMORIES 


Figure 
5.1 is a general 
block 
diagram 
of an 8086 


memory, The basic characteristics 
of the diagram are 


the partitioning of the 16·bit word memory into high and 
low 8·bit banks on the upper and lower halves of the 
data bus and inclusion of BHE and AOin the selection of 
the banks. Specific implementations depend on the type 
of memory and the system configuration. 


SA. ROM and EPROM 


The easiest devices to interface to the system are ROM 
and EPROM.Their byte format provides a simple bus in· 
terlace and since they are read only devices, AO and 
BHE need not be Included in their chip enable/select 
decoding (chip enable is similar to chip select but addl· 
tlonally determines if the device is in active or standby 
power 
mode). The address 
lines 
connected 
to 
the 


devices start with A1 and continue up to the maximum 


number the device can accept, leaving the remaining ad· 
dress lines for chip enable/select decoding. To connect 
the devices directly to the multiplexed 
bus, they must 


have 
output 
enables. 
The 
output 
enable 
Is 
also 


necessary to avoid bus contention 
in other configura· 
tions. Figure 5A1 shows the bus connections for ROM 
and EPROM memories. No special decode techniques 
are required for generating chip enables/selects. 
Each 


valid decode selects one device on the upper and lower 
halves of bus to allow byte and word access. Byte ac· 
cess is achieved by reading the full word onto the bus 
with the 8086 only accepting the desired byte. For the 
minimum mode 8086, if RD,WR and M/iO are not decod· 
ed to form separate commands for memory and I/O, and 
the I/O space overlaps the memory space assigned to 
the EPROM/ROM then M/IO (high active) must be a con· 
dition of chip enable/select decode. The output enable 
is controlled by the system memory read signal. 


system. The parameters, equations and evaluation tech- 
niques given in the I/O section are also applicable to 
these devices. The relationship of parameters is given in 
Table 5A1. TACC and TCE are related to the same equa· 
tion and differ only by the delay associated with the chip 
enable/select decoder. As an example, consider a 2716 
EPROM memory residing on the multiplexed bus of a 
minimum mode configuration: 


TACC= 3TClCl-140- 
address buffer delay = 430 ns 
(8282= 30 ns max delay) 


TCE= TACC- decoder delay = 412 ns 
(8205decoder delay = 18 ns) 


TOE=2TClCl-195=205 
ns 


TDF= 
= 155 ns 


TOE - 
Output 
Enable 
to Valid 
Data" 
TRLDV 
TACC 
- 
Address 
to Valid 
Data" 
TAVDV 
TCE - 
Chip 
Enable 
to Valid 
Data" 
TSLDV 
TDF - 
Output 
Enable 
High to Output 
Float" 
TRHDZ 


The results are the times the system configuration 
reo 


quires of the component for full speed compatibility 
with 
the system. Comparing these times with 2716 
parameter limits indicates the 2716-2will work with no 
wait states while the 2716 will require one wait state. 
Table 5A2 demonstrates EPROM/ROMcompatibility for 
the configurations 
presented in the I/O section. Before 
designing a ROM or EPROM memory system, refer to 
AP-30 for additional information on design techniques 
that give the system an upgrade path from 16K to 32K 
and 64K devices. 


ConfiguraUon 


Minimum 
Mode 
Maximum 
Mode 


Unbuffered 
Buffered 
Buffered 
Fully Buffered 


2716-1 
.- 
.- 
.- 
.- 


2716-2 
.- 
1W 
1W 
1W 
2732 
1W 
1W 
1W 
1W 


2332 
.- 
.- 
.- 
.- 


2364 
.- 
.- 
.- 
.- 


5B. Static RAM 


Interfacing static RAM to the system introduces several 
new requirements to the memory design. AOand BHE 
must 
be 
included 
in 
the 
chip 
select/chip 
enable 
decoding of the devices and write timing must be con· 
sidered in the compatibility 
analysis. 


For each device, the data bus connections 
must be 
restricted to either the upper or lower half of the data 
bus. Devices like the 2114 or 2142 must not straddle the 
upper and lower halves of the data bus (Fig. 5B1). To 
allow selecting either the upper byte, lower byte or full 
16-bit word for a write operation, BHE must be a condi- 
tion of decode for selecting the upper byte and AOmust 
be a condition of decode for selecting the lower byte. 
Figure 
5B2 gives 
several selection 
techniques 
for 


_c 
CS; 
I/O,- 
es, 


~ 
1102 


\ 
Ao·g 


y 
1103 
_c 
WE 


00 
I/O. 


The first group requires inclusion of AO and BHE to 
decode 
or 
enable 
the 
chip 
selects. 
Since 
these 


memories do not have output enables, read and write 
are used as enables for chip select generation to pre- 
vent bus contention. If read and write are not used to 
enable the chip selects, devices with common input/out· 
put pins (like the 2114)will be subjected to severe bus 
contention 
between chip select and write active. For 


devices with 
separate input/output 
lines (like 2141, 


2147), the outputs can be externally buffered with the 
buffer enable controlled by read. This solution will only 
allow bus contention between memory devices in the ar· 
ray during chip select transition 
periods. These tech- 


niques are considered in more detail in Section 2C. 


For devices with output enables (2142), write may be 
gated with BHE and AOto provide upper and lower bank 
write strobes. This simplifies 
chip select decoding by 


eliminating 
BHE and AO as a condition 
of decode. 


Although both devices are selected during a byte write 
operation, only one will receive a write strobe. No bus 
contention will exist during the write since a read com· 
mand must be issued to enable the memory output 
drivers. 


If multiple chip selects are available at the device, BRE 
and AO may directly 
control 
device selection. 
This 


allows 
normal chip select decoding 
of the address 


space and direct connection of the read and write com· 
mands to the devices. Alternately, the multiple 
chip 


select inputs of the device could directly decode the ad· 
dress space (linear select) and be combined with the 
separate write strobe technique to minimize the control 
circuitry needed to generate chip selects. 


As with the EPROM's and ROM's, if separate commands 
are not provided for memory and I/O in the minimum 
mode 8086 and the address spaces overlap, MliO (high 
active) must be a condition of chip select decode. Also, 
the address lines connected to the memory devices 
must start with A1 rather than AO. 


BHE 


M/iO OR 
ADDITIONAL 
ADDRESS 


CHIP 
SELECTS 
(HIGH 
AND 
lOW 
FOR 
FOUR 
GROUPS) 


For analysis 
of 
RAM compatibility, 
the write 
timing 
parameters listed in Table 5B1 may also need to be con· 
sidered 
(depending 
on the RAM device being consid· 
ered). The CPU clock 
relative timing 
is given in Table 
5B2. The equations 
specify 
the device requirements 
at 
the CPU and provide a base for determining 
device reo 
quirements 
in other configurations. 
As an example con· 
sider the write timing 
requirements 
of a 2142 in a max· 
imum 
mode buffered 
8086 system 
(Figure 
5B4). The 
2142 write parameters 
that must be analyzed are TWA 
advanced 
write 
pulse width, 
TWF! write 
release time, 
TDWA data to write 
time overlap and TDH data hold 
from write time. 


TWA = 2TCLCL - TCLMLmax 
+ TCLMHmin 
= 375 ns. 
TWR= 
2TCLCL- 
TCLMHmax+ 
TCLLHmln+ 
TSHOVmln= 
170 ns. 
ToWA= 
2TCLCL- 
TCLoVmax 
+ TCLMHmln 
- TIVOVmax 
= 265 ns. 
ToH = TCLCH - TCLMHmax+ 
TCHoXmln+ 
TiVOVmln= 
95 ns. 


TW - 
Write 
Pulse Width 
TWR - 
Write 
Release (Address 
Hold From End of Write) 
TOW - 
Data and Write 
Pulse Overlap 
ToH 
- 
Data Hold 
From 
End of Write 
TAW - 
Address 
Valid 
to End of Write 
TCW - 
Chip 
Select 
to End of Writa 
TASW 
- 
Address 
Valid 
to Beginning 
of Write 


(a) Minimum 
Mode 


TW = TWLWH 
= 2TCLCL - 60 = 340 ns 
TWR= 
TCLCL- 
TCVCTXmax+ 
TCLLHmin= 
90 ns 
TOW = 2TCLCL - TCLoVmax 
••.TCVCTXmin 
= 300 ns 
TDH = TWHoX 
= 68 ns 
TAW= 
3TCLCL- 
TCLAVmax+ 
TCVCTXmin 
= 500 ns 
TCW=TAW-Chip 
Select 
Decode 
TASW= 
TCLCL- 
TCLAVmax+ 
TCVCTXmin 
= 100 ns 


(b) Maximum 
Mode 


TW= 
TCLCL- 
TCLMLmax+ 
TCLMHmin 
= 175 ns 
TWR = TCLCL - TCLMHmax 
+ TCLLHmin 
= 165 ns 
ToW=TW= 
175 ns 
ToH = TCLCHmin 
- TCLMHmax 
+ TCHoXmin 
= 93 ns 
TAW= 
3TCLCL- 
TCLAVmax+ 
TCLMHmin= 
500 ns 
TCW = TAW - Chip Select 
Decode 
TASW = 2TCLCL- 
TCLAVmax 
+ TCLMLmin= 
300 ns 
TWA' 
=TW+ 
TCLCL= 
375 ns 
TDWA' 
= 2TCLCL- 
TCLoVmax 
+ TCLMHmin 
= 300 ns 
TASWA' 
=TASW- 
TCLCL= 
100 ns 


Comparing 
these results with the 2142 family' indicates 
the standard 2142 write timing ,is fully compatible 
with 
this 
8086 configuration. 
Read timing 
analysis 
is also 
necessary to completely 
determine compatibility 
of the 
devices. 


5C. Dynamic 
RAM 


Dynamic 
RAM is perhaps the most complex 
device to 
design into a system. To relieve the engineer of most of 
this burden, Intel provides the 8202 dynamic 
RAM con- 
troller 
as part of the 8086 family of peripheral devices. 
This section will disc\Jss using the 8202 with the 8086 to 
build a dynamic memory system for an 8086 system. For 


additional 
information 
on the 8202, refer to the 8202 
data sheet (9800873) and application 
note AP-45 Using 
the 8202 Dynamic RAM Controller 
(9800809A). 


5,C.1 Standard 8086·8202 Interconnect 


Figure 5.C.1.1 shows a standard interconnection 
for an 
8202 into an 8086 system. 
The configuration 
accom· 


modates 64K words (128K bytes) of dynamic 
RAM ad· 
dressable 
as words or bytes. To access the RAM, the 
8086 initiates 
a bus cycle with an address that selects 
the 8202 (via PCS) and the appropriate 
transfer 
com- 
mand (MRDC or MWTC). If the 8202 is not performing 
a 
refresh cycle, the access starts immediately, 
otherwise, 


the 8086 must wait for completion 
of the retresh. XACK 
from the 8202 is connected 
to the 8284 ROY input to 
force the CPU to wait until. the RAM cycle is completed 
before the CPU can terminate 
the bus cycle. This effec· 
tively synchronizes 
the asynchronous 
events of refresh 
and 
CPU 
bus 
cycles. 
The 
normal 
write 
command 
(MWTC) is used rather than the advanced 
command 
(AMWC) to guarantee the data is valid at the dynamic 
RAMS before the write command 
is issued. The gating 
of WE with AOand BHE provides selective write strobes 
to the upper and lower banks of memory to allow byte 
and word write operations. 
The logic which 
generates 
the strobe for the data latches allows read data to prop- 
agate to the system as soon as the data is available and 
latches the data on the trailing 
edge of CAS. 


DETAILED TIMING 


Read Cycle 


For no wait state operation, the 8086 requires data to be 
valid from MRDC in: 


2TCLCL - TCLM L - TDVCL - buffer delays = 291 ns. 


Since the 8202 is CAS access limited, we need only ex· 
amine CAS access time. The 820212118guarantees data 
valid from 8202 RD low to be: 


(tph + 3tp + 100 ns) 8202 TCC delay + TCAC for the 2118 


MRDe 
HIOH 
BYTE 
MWTC 
WRITE 


..,.. 
.HE 
5MHz 


OTHER 
ADo-AD15 


READY 
A~. 
2111 
INPUTS 


OAT, 
Dfo.11 
I 
DATA 


Flgur.SC1.l. 
5 MHz 808618202l128K 
Byt. 
System 
- 
Double 
Data, Control 
and Addre •• 
Buffering 
(Note: 
Bus driver 
on 8202 Is not needed 
Ille.a 


then 64K bytes 
are used) 


For a 25 MHz 8202 and 2118-3, we get 297 ns which is In- 
sufficient 
for no wait state operation. 
If only 64K~tes 
are accessed, the 8202 requires only (tph + 3tp + 8:>ns) 
giving 282 ns access and no wait states required. Refer 
to Figure 5.C.1.2 and 5C.1.3 for timing 
information 
on 
the 8202 and 2118. 


Write Cycle 


An 
important 
consideration 
for 
dynamic 
RAM write 
cycles 
is to guarantee 
data to the RAM is valid when 
both CAS and WE are active. For the 2118, if WE is valid 
prior to CAS, the data setup is to CAS and if CAS is valid 
before WE (as would occur during a read modify write 
cycle) the data setup time is to WE. For the 8202, the WR 
to CAS delay is analyzed to determine 
the data setup 
time to CAS inherently 
provided by the 8202 command 
to RAS/CAS timing. 
The minimum 
delay from WR to' 
CAS is: 


TCCmin= 
tph+ 2tp+25= 
127 ns @ 25 MHz 


SUbtracting 
buffer 
delays and data setup at the 2118, 
we have 83 ns to generate 
valid data after the write 
command 
is Issued by the CPU (In this case the 8288). 
Since 
the 
8086 will 
not 
guarantee 
valid 
data 
until 
TCLAVmax - TCLMLmin = 100 ns from 
the 
advanced 


write signal, the normal write signal is used. The normal 
write MWTC guarantees data is valid 100 ns before it is 
active. The worst case write pulse width is approximate- 
ly 175 ns which is sufficient 
for all 2118's. 


Synchronization 


To force the 8086 to wait during 
refresh the XACK or 
SACK lines must be returned 
to the 8284 ready input. 


The maximum delay from Ro to SACK (if the 8202 is not 
performing 
refresh) is TAC = tp + 40 = 80 ns. To prevent 
a wait state at the 8086, ROY must be valid at the 8284 
TCLCHmin - TCLMLmax- 
TR1VCLmax= 
48 
ns 
after 


the command 
is active. This implies 
that under worst 
case conditions, 
one walt state will be Inserted for every 
read cycle. Since MWTC does not occur until one clock 
later, two wait states may be inserted for writes. 


The XACK from command delay will assert ROY TCC + 
TCX = (tph + 3tp + 100)+ (5tp + 20)= 460 
ns 
after 
the 
command. 
This will 
typically 
insert 
one or two 
wait 
states. 


Unless 2118-3's are used in 64K byte or less memories, 
SACK must not be used since it does not guarantee a 
wait state. From the previous access time analysis we 
saw that other conflguratlQns 
required a walt state. 


tCAS 
----~--------\' 


\ 


lex 
--txw-- 


lww f=" - ---- 
-- 
- --- 


I 


A.C. CHARACTERISTICS 


TA=0·Ct070·C, 
Vcc=5V±10% 


Measurements 
made 
with 
respect 
to RAS, - 
RAS4, 
CAS, 


WE, OUTo- 
OUTe are at 2.4V and 0.8V. All other 
pins 
are 
measured 
at 1.5V. 


Loading: 


64 Devices 


CL= 
30 pF 
CL=320 
pF 
CL=230 
pF 
CL=450 
pF 
CL=640 
pF 


Symbol 
Parameter 
Min 
Max 
Units 


tp 
Clock 
(Internal/External) 
Period 
(See Note 
1) 
40 
54 
ns 


tRC 
Memory 
Cycle 
Time 
10 tp- 
30 
12 tp 
ns 


tRAH 
Row Address 
Hold 
Time 
~ 
tp- 
10 
ns 


tASR 
Row 
Address 
Setup 
Time 
tpH 
-vi., 
ns 


tCAH 
Column 
Address 
Hold 
Time 
5 tp 
ns 


tASC 
Column 
Address 
Setup 
Time 
tp-35 
ns 


tRCO 
RAS to CAS 
Delay 
Time 
2 tp- 
10 
2 tp+ 
45 
ns 


twcs 
WE Setup 
to CAS 
, 
.. 
tp-40 
ns 


tRSH 
RAS Hold 
Time 
5tp-30 
ns 


tCAS 
CAS 
Pulse 
Width 
5tp-30 
ns 


tRP 
RAS Precharge 
Time 
(See Note 
2) 
4 tp- 
30 
ns 


tWCH 
WE Hold 
Time 
to CAS 
5 tp- 
35 
ns 


tREF 
Internally 
Generated 
Refresh 
to Refresh 
Time 
64 Cycle 
548 tp 
576 tp 
ns 
128 Cycle 
264 tp 
288 tp 
ns 


tCR 
RD, WR to RAS Delay 
tpH + 30 
tpH+ 
tp+ 
75 
ns 


tcc 
RD, WR to CAS 
Delay 
•. -- 
-- 
tpH+ 
2 tp+ 
25 
tpH + 3 tp+ 
100 
ns 


tRFR 
REFRQ 
to RAS Delay 
1.5 tp+ 
30 
2.5 tp+ 
100 
ns 


tAS 
Ao-A,5 
to RD, WR Setup 
Time 
(See Note 
4) 
0 
ns 


tCA 
RD, WR to SACK 
Leading 
Edge 
tp+40 
ns 


tCK 
RD, WR to XACK, 
SACK 
Trailing 
Edge 
Delay 
30 
ns 


tKCH 
RD, WR Inactive 
Hold 
to SACK 
Trailing 
Edge 
10 
ns 


tsc 
RD, WR, Pes to X/CLK 
Setup 
Time 
(See Note 
3) 
15 
ns 


tcx 
CAS to XACK 
Time 
5 tp- 
40 
5 tp+ 
20 
ns 


tACK 
XACK 
Leading 
Edge to CAS Trailing 
Edge Time 
10 
ns 


txw 
XACK 
Pulse 
Width 
2 tp- 
25 
ns 


tLL 
REFRQ 
Pulse 
Width 
20 
ns 


tCHS 
RD, WR, 
PCS Active 
Hold 
to RAS 
- 
0 
ns 


tww 
WR to WE Propagation 
Delay 
8 
50 
ns 


tAL 
S, to ALE 
Se.tup Tim~ 
40 
ns 


tLA 
S, to ALE 
Hold 
Time 
2 tp+ 
40 
ns 


tpL 
External 
Clock 
Low Time 
- 
15 
" 
. , 


ns 


tpH 
External 
Clock 
High 
Time 
22 
ns 


tpH 
External 
Clock 
High 
Time 
for Vcc= 
5V ± 5% 
18 
ns 


Not •• : 


1. tp minimum 
determines 
maximum 
oscillator 
frequency. 


tp maximum 
determines 
minimum 
frequen~ 
maintain 
2 ms refresh rate and tAP minimum. 


2. To achieve the minimum time between the RAg" of a memory cycle and the FlAS of a refresh cycle, such as a transparent refresh, REFRQshould be 
pulsed In the previous memory cycle. 
3. tsc Is not required for proper operation which Is In agreementwith the other specs, but can be used to synchronize external signals with XlCLK If It Is 


desired. 
4. If tAS Is less than 0 then the only Impact Is that tASR decreases by a corresponding amount. 


READ CYCLE 
''''' 
'''' 
v•• 
W 
v~ 
..... 


v,. 
w 
® 
V~ 


leA. 
v•• 


ADDREUES 


V" 


v,. 


WE 


V" 


tllAC 


Va. 
HIGH 
IMPEDANCE 
!louT 


V••. 


WRITE CYCLE 
"C 
'''' 
v,. 
W 
CD 
® 
v" 
(!) 
Ie •• 
tco. 


tllCO 
'''. 
v,. 
tcoa 
W 
® 
v~ 


leA. 
v,. 


ADDRESSES 


V" 


tllWL 


leWL 
V,. 
---twC. 


WE 
.W. 


V" 


V,. 
D,. 


V" 


NOTEI: 
1.2. 
VIM MIH AND 
VIL MAX ARE 
REFERENCE 
LEVELS 
FOR 
MEASURINO 
TIMINO 
OF 
INPUT 
SIGNALS. 


3,4. 
Votj 
MIN AND 
VOL MAX ARE 
REFERENCE 
LEVELS 
FOR 
MEASURINO 
TIMINO 
OF DouT. 


5. 
to ••.••.II 
MEASURED 
TO 
lOUT < 110\.1. 


I. 
tot 
AND 
tDM ARE 
REFERENCED 
TO CD OR wr. 
WHICHEVER 
OCCURS 
LAST. 


a. 
lAcM 
II 
REFERENCED 
TO THE 
T!\AILING 
EDOE 
OF m 
OR W, WHICHEVER 
OCCURI 
FIRIT 
• 


•• 
tell" 
REQUIREMENT 
18 ONl.Y 
APPl.ICABLE 
FOR AiDlrn 
CYCl.ES 
PRECEDED 
IY 
A e:o.ONl.Y 
CYCLE 
(I•••• 
FOR 
SYSTEMS 
WHERE 
en 
HAS 
NQT 
IEEN 
DECODED 
WITH 
Mi). 


dP_R~ 
, 
2118-3 
2118-4 
2118·7 


Symbol 
PI",mltlr 
. ~, 
Min. 
MIx. 
Min. 
Max. 
Min. 
MIx. 
Unit 
Notll 


tRAC 
Access 
Time 
From 
RAS 
100 
120 
1SO 
ns 
4,5 


tCAC 
Access 
Time 
from CAS 
55 
65 
80 
ns 
4,5,6 


tREF 
Time 
Between 
Refresh 
2 
2 
2 
ms 


tRP 
RAS Precharge 
Time 
110 
120 
135 
ns 
1 
. 


tCPN 
CAS Precharge 
Time (non-page 
cycles 
SO 
55 
70 
ns 
),. 


tCRP 
CAS to RAS Precharge 
Time 
0 
0 
0 
ns 


tRCO 
RAS to CAS Delay Time 
25 
45 
25 
55 
25 
70 
ns 
7 


tRSH 
RAS Hold Time 
70 
85 
105 
ns 


tCSH 
CAS Hold Time 
100 
120 
165 
ns 


tASR 
Row Address 
Set·Up Time 
0 
0 
0 
ns 


tRAH 
Row Address 
Hold Time 
15 
15 
15 
ns 


tASC 
Column 
Address 
Set-Up Time 
0 
0 
0 
ns 


tCAH 
Column 
Address 
Hold Time 
15 
15 
20 
ns 


tAR 
Column 
Address 
Hold Time to RAS 
80 
70 
90 
ns 


tT 
Transilion 
Time (Rise and Fall) 
3 
SO' 
3 
SO 
3 
SO 
ns 
8 


tOFF 
Output 
Buffer 
Turn Off Delay 
0 
45 
0 
SO 
0 
80 
ns 


TRC 
Random 
Read Cycle 
Time 
235 
270 
320 
ns 


tRAS 
RAS Pulse Width 
115 
1סס oo 
140 
1סס oo 
175 
1סס oo 
ns 


tCAS 
CAS Pulse Width 
55 
1סס oo 
65 
1סס oo 
95 
1סס oo 
ns 


tRCS 
Read Command 
Set-Up Time 
0 , 


0 
0 
ns 


tRCH 
Read Command 
Hold Time 
0 
0 
0 
ns 


tRC 
Random 
Write 
Cycle 
Time 
235 
270 
320 
ns 


tRAS 
RAS Pulse Width 
115 
1סס oo 
140 
1סס oo 
175 
1סס oo 
ns 


tCAS 
CAS Pulse Width 
55 
1סס oo 
65 
1סס oo 
95 
1סס oo 
ns 


twcs 
Write 
Command 
Set-Up Time 
0 
0 
- 0 
ns 
9 


tWCH 
Write 
Command 
Hold Time 
25 
30 
45 
ns 


tWCR 
Write 
Command 
Hold Time, 
to RAS 
70 
85 
115 
ns 


twP 
Write 
Command 
Pulse Width 
25 
30 
SO 
ns 


tRwL 
Write 
Command 
to RAS Lead Time 
80 
65 
110 
ns 


tCWl 
Write 
Command 
to CAS Lead Time 
45 
(j() 
100 
ns 


tos 
Data-In Set-Up Time 
0 
0 
0 
ns 


tOH 
Data-In 
Hold Time 
25 
30 
I 
45 
ns 


tOHR 
Data-In 
Hold Time, 
to RAS 
70 
85 
115 
ns 


tRWC 
Read-Modlty·Wrlte 
Cycle 
Time 
285 
320 
410 
ns 


tRAW 
RMW Cycle 
RAS Pulse Width 
165 
1סס oo 
190 
1סס oo 
265 
1סס oo 
ns 


tCAW 
RMW Cycle 
CAS Pulse Width 
105 
1סס oo 
120 
1סס oo 
185 
1סס oo 
ns 


tAWO 
RAS to WE Delay 
100 
120 
1SO 
ns 
9 


!cwo 
CAS to WE Delay 
55 
65 
. 


80 
ns 
9 


NOTES: 
1. All yoltages 
referenced 
to VSS' 
_ 


2. 
Eight 
cycles 
afe required 
after 
power-up 
or prolonged 
periOds 
(greater 
than 
2 ma) of RA$ 
Inactivity 
before 
proper 
device 
operation 
Is achieved. 
Any 8 cycles 
which 
perform 


refresh 
are adequate 
for 
this 
purpose. 
3. 
A.C. 
Characteristics 
assume 
IT = 5 ns. 


4. 
Assume 
that 
tRCO 
" 
tACO 
(max.). 
If tRCO 
is greater 
than 
tRCO 
(max.) 
then 
tRAC 
will 
Increase 
by the 
amount 
that 
tRCO 
exceeds 
tRCO 
(max.). 
5. 
Load = 2 TIL 
loads 
and 
100 pF. 


~: ~:~~7~~~r.o.:':~fi~d(~::·);eference 
poln' only; If tACO I. I••• 
Ihan tACO (max.) access time Is tAAC' If tACO I. grealar than 'ACO (max.) ace••• 
time Is tACO+ tCAC. 


8. 
tT Is measured 
between 
VIH 
(min.) 
and 
VIL (max.). 


9. 
twcs, 
tcwo 
and 
tRWO 
are specified 
as reference 
points 
only. 
If twcs 
•• twcs 
(mln.) 
the 
cycle 
is an early 
write 
cycle 
and 
the 
data 
out 
pin 
will 
remain 
high 
Impedance 
throughout 
the 
entire 
cycle. 
If tcwo 
•• tcwo 
(min.) 
and 
tRWO 
•• tRWO 
(min.), 
the 
cycle 
is a read-modify-write 
cycle 
and 
the 
data 
out 
will 
contain 
the 
data 
read 
trom 
the 
selected 
address. 
It neither 
of the 
above 
conditions 
is satisfied, 
the 
condition 
of the data 
out 
is indeterminate. 


1) SACK timing from command will not allow reliable 
operation while XACK is not active early enough to pre- 
vent wait states. 


2) The normal write command required to guarantee 
data setup is not enabled until the CPU has sampled 
READY thereby forcing multiple wait states during write 
operations. 


The first problem could be resolved if an early command 
could be generated that would guarantee SACK was 


the maximum mode status. The early command is en- 
abled from the trailing edge of ALE and disabled on the 
trailing edge of the normal command. 
The command 


provides 
an 
additional 
TCHCLmin- 
TCHLLmax+ 


TCLMLmax-circuit 
delays=53 
ns of access time and 


time to generate ROYfrom the early command. If we go 
back to our previous equations, early command to valid 
data at the CPU is now: 


TCHCLmin - TCHLLmax + 2TCLCL - TDVCLmax - buf- 
fer and circuit delays = 333 ns 
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We can now use the slowest 2118 which gives 8202 and 
2118 access of 320 ns. Early command to ROY timing is 
TCLCL - TCHLLmax - circuit 
delays - TR1VCLmax = 


115 ns and provides 35 ns of margin beyond the 8202 
command to SACK delay. 


The write timing of the 8202 and write data valid timing 
of the 8086 do not allow use of an early write command. 
However, if the 8202 clock is reduced from 25 MHz to 20 
MHz and WE to the RAM's is gated with CAS, the ad- 
vanced write command (AMWC) may be used. At 20 MHz 
the minimum command to' CAS delay is 148 ns while the 
maximum data valid delay is 144 ns. 


The reduced 8202 clock frequency still satisfies 
no wait 


state read operation 
from early read and will insert no 


more than one wait state for write (assuming no conflict 
with 
refresh). 20 MHz 8202 operation 
will 
however re- 
quire using the 2118-4 to satisfy 
read access time. 


Note that slowing the 8202 to 22.2 MHz guarantees valid 
data within 10 ns after CAS and allows using the 2118-7. 
Since 
this 
analysis 
is totally 
based 
on worst 
case 


minimum 
and maximum 
delays, 
the designer 
should 


evaluate 
the timing 
requirements 
of his specific 
im- 
plementation. 


It should be noted that the 8202 SACK is equivalent 
to 


XACK timing if the cycle being executed was delayed by 


refresh. 
Delaying 
SACK until 
XACK time 
causes 
the 


CPU to enter wait states until the cycle is completed. 
If 


the cycle is a read cycle, the XACK timing 
guarantees 


data is valid at the CPU before ROY is issued to the CPU. 


The use of the early command 
signals 
also solves a 


problem not mentioned 
previously. The cycle rate of the 


8202 @ 20 MHz requires that commands 
(from leading 


edge to leading edge) be separated by a minimum of 695 
ns. The maximum mode 8086 however may issue a read 
command 600 ns after the normal write command. 
For 


the early read command and advanced write command, 
725 ns are guaranteed 
between commands. 


APPENDIX 
I 
BUS CONTENTION 
AND ITS EFFECT ON SYSTEM INTEGRITY 


As higher performance 
microprocessors 
have become 
available, 
the architecture 
of microprocessor 
systems 
has been. evolving, again placing demands on memory. 
For many years, system 
designers 
have been plagued 
with 
the problem 
of bus contention 
when connecting 
multiple 
memories 
to a common data bus. There have 
been various 
schemes 
for avoiding 
the problem, 
but 
device manufacturers 
have been unable to design inter- 


nal circuits 
that 
would 
guarantee 
that 
one 
memory 
device would 
be "off" 
the bus before another 
device 
was selected. With small memories (512x8 and 1Kx8), it 
has been traditional 
to connect all the system address 
lines together 
and utilize 
the'difference 
between tACC 
and tco to perform a decode to select the correct device 
(as shown in Figure 1). 


With the 1702A, the chip select to output delay was only 
100 ns shorter than the address access time; or to state 
it another way, the tACCtime was 1000 ns while the tco 
time was 900 ns. The 1702A tACCperformance of 1000 ns 
was suitable 
for the 4004 series microprocessors, 
but 
the 
8080 processor 
required 
that 
the 
corresponding 
numbers be reduced to tACC= 450 ns and tco= 
120 ns. 
This allowed a substantial 
improvement 
in performance 
over the 4004 series of microprocessors, 
but placed a 
substantial 
burden 
on 
the 
memory. 
The 
2708 was 
developed 
to be compatible 
with the 8080 both in ac- 
cess time and power supply requirements. 
A portion of 
each 8080 machine cycle time had to be devoted to the 
architecture 
of the system decoding scheme used. This 
devoted portion of the machine cycle Included the time 
required for the system controller 
(8224) to perform its 
function 
before the actual decode process could begin. 


Let's 
pause 
here 
and 
examine 
the 
actual 
decode 
scheme that was used so we can understand 
how the 
control 
functions 
that 
a memory 
device 
requires 
are 
related to system architecture. 


The 2708 can be used to illustrate 
the problem of having 
a single control 
line. The 2708 has only one read control 


function, 
chip select (eS), which is very fast (tco= 
120 
ns) with respect to the overall access time (tACC= 450 
ns) of the 2708. It is this time difference 
(330 ns) that is 
used to perform 
the decode function, 
as illustrated 
in 
Figure 
2. The scheme 
works 
well and does not limit 


system performance, 
but it does lead to the possibility 
of bus contention. 
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BUS CONTENTION 


There 
are 
actually 
two 
problems 
with 
the 
scheme 
described 
in the previous section. First, if one device in 
a mUltiple memory system has a relatively long deselect 
time, and a relatively fast decoder is used, it would be 
possible 
to have another device selected 
at the same 
time. If the two devices thus selected were reading op- 
posite data; that is, device number one reading a HIGH 
and device number two reading a LOW, the output tran- 
sistors of the two memory devices would effectively 
pro- 


duce a short circuit, as Figure 3 illustrates. 
In this case, 


the current 
path is from Vcc on device number one to 
GND on device number two. This current is limited only 
by the "on" 
impedance 
of the MOS output 
transistors 
and can reach levels in excess of 200 mA per device. If 
the MOS transistors 
have a lot of "extra" 
margin, the 
current 
is usually 
not destructive; 
however, an instan- 


taneous load of 400 mA can produce "glitches" 
on the 
Vcc supply-glitches 
large enough to cause standard 
TIL devices to drop bits or otherwise 
malfunction, 
thus 
causing incorrect address decode or generation. 


The second problem with a single control line scheme is 
more subtle. As previously mentioned, there is only one 
control function 
available on the 2708 and any decoding 
scheme must use it out of necessity. In addition, any in· 
advertent changes in the state of the high order address 
lines that are inputs to the decoder wiil cause a change 
in the device that is selected. The result is the same as 
before-bus 
contention, 
only from a different 
source. 


The deselected 
device cannot get "off" 
the bus before 
the selected one is "on" the bus as the addresses rapid· 
Iy change state. One approach to solving this problem 
would be to design (and specify as a maximum) devices 


with tOF time less than tco time, thereby assuring that if 
one device is selected while another Is simultaneously 
being deselected, there would be some small (20 ns) 
margin. Even with this solution, the user would not be 
protected from devices which have very fast tco times 
(tco is specified as a maximum). 
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The only sure solution appears to be the use of an exter· 
nal 
bus driver/transceiver 
that 
has an 
independent 
enable function. 
Then that function, 
not the "device 
selecting 
function," 
or addresses, could control 
the 
flow of data "on" and "off" 
the bus, and any contention 
problems would be confined to a particular card or area 
of a large card. In fact, many systems are Implemented 
that way-the 
use of bus drivers Is not at all uncommon 


In large systems where the drive requirements of long, 
highly capacitive interconnecting 
lines must be taken 
into consideration-It 
also may be the reason Whymore 
system designers were not aware of the bus contention 
problem until they took a previously large (multlcard) 
system and, using an advanced mlcorprocessor 
and 
higher density memory devices, combined them all on 
one card, thereby eliminating 
the requirement for the 
bus drivers, but experiencing the problem of bus con- 
tention as described above. 


THE MICROPROCESSOR/MEMORY 
INTERFACE 


From the foregoing discussion, 
it becomes clear that 
some new concepts, both with regard to architecture 
and performance are required. A new generation of two 
control line devices is called for with general require· 
ments as listed below: 


1. Capability 
to control 
the data "on" 
and "off" 
the 
system bus, independent of the device selecting func- 
tion identified above. 


2. Access time compatible with the high performance 
microprocessors 
that are currently available .. 


Now let's examine the system architecture 
that Is re- 


quired to Implement the two line control and prevent 
bus contention. 
This is shown In the form of a timing 
diagram (Figure 4). As before, addresses are used to 


generate the unique device selecting 
function, 
but a 


separate and Independent Output Enable (OE)control Is 
now used to gate data "on" and "off" 
the system data 
bus. With this scheme, bus contention 
is completely 


eliminated as the processor determines the time during 
which 
data must 
be present on the 
bus and then 


releases the bus by way of the Output Enable line, thus 
freeing 
the 
bus 
for 
use 
by 
other 
devices, 
either 
memories or peripheral devices. This type of architec- 
ture can be easily accomplished If the memory devices 
have two 
control 
functions, 
and the system 
Is Im- 


plemented according 
to the block diagram shown In 
Figure 5. It differs 
from the previous block diagram 
(shown In Figure 1) In that the control bus, which Is con- 
nected to all memory Output 
Enable pins, provides 
separate and Independent control over the data bus. In 
this way, the microprocessor Is always In control of the 
system; while In the previous system, the microproc- 
essor passed control to the particular memory device 
and then waited for data to become available. Another 
way to look at It Is, with a single control line the sytem Is 
always asynchronous with respect to microprocessor/ 
memory communications. 
By using two control 
lines, 


the memory Is synchronized to the processor. 
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Multitasking 
For the 8086 


PROCEDURES 


ACTIVATE$TASK Procedure 
ACTIVATE$DELAY Procedure 
DECREMENT$DELAY Procedure 
CASE$TASK Procedure 
PREEMPT Procedure 
DISPATCH Procedure 


PLlM·86 
PROCEDURES 


Initialization 
and the Main Loop 
Additional 
Ideas 
Source Code 


Real-time software systems differ markedly from batch 
processing systems. An external signal indicating that 
it is time for an hourly log or an interrupt caused by an 
emergency condition is an event usually not encoun- 
tered in batch processing. Because real-time control 
systems of all types share a number of characteristics, 
it is possible to develop flexible 
operating systems 


which will meet the needs of a great majority of real· 
time applications. Intel Corporation has developed such 
a system, the RMXl80TMsystem, for the iSBCTMline of 
8080/85based single board computers. Thus, the user is 
released from the chore of designing 
an operating 


system and is free to concentrate his efforts on the 
applications 
software 
for the 
individual 
tasks 
and 


merely integrate them into a pre·existing system. 


But what if a user does not needall the capabilities of an 
RMX/80™ system or wants a different hardware con- 
figuration than an iSBC™ computer? This application 
note contains a set of PLlM-86 procedures designed to 
be used in medium-complexity 8086 real·time systems. 


A normal control system can be broken down into a 
number of concurrently executable tasks. The CPU can 
be running only one task at any instant of time but the 
speed of the processor often makes concurrent tasks 
appear to be running simultaneously. Breaking the soft- 
ware functions into separate concurrent tasks is the job 
of the designer/programmer. Once this is done there re- 
mains the problem of integrating these tasks with a 
supervisory program which acts as a traffic cop in the 
scheduling and execution of the separate tasks. This 
note discusses a set of PLlM-86 procedures to imple- 
ment the supervisory program function. 


A minimum operating system might (like Its batch proc- 
essing cousin) have only a queue for ready tasks (tasks 
waiting to be executed). Any task that becomes ready is 
put on the bottom of the queue and when a running task 
is finished, the task on the top of the queue is started. 
Any interrupt causes the state of the system to be 
saved, an interrupt routine to be executed, the state of 
the system to be restored, and execution of the inter- 
rupted program to continue. The interrupt routine 'might 
(or might not) put a new task on the ready queue. This 
approach has worked well for many simple control 
systems, especially in the single-chip computer area. 
But what features are lacking in this approach that are 
necessary (or at least nice)? 


1. A system of priorities is often needed. All waiting 
ready tasks must be executed sooner or later but some 
tasks need immediate attention while others can be run 
when there is nothing else to do. If a midnight monthly 
report, due for completion by 8 a.m. the next day, is in 
the process of printing at 1 a.m. and a fire alarm occurs, 
it is reasonable to assume that the fire alarm has higher 
priority 
since the fire could conceivably 
render the 


monthly report irrelevant. 


There are a number of ways in which to assign priorities. 
Tasks are usually 
numbered and may be assigned 


priorities according to their ascending (or descending) 
numbers. They could .instead be grouped into a number 
of priority levels, with tasks on the same level having 
equal priorities. The latter approach is taken in this 
application note. 


Ass'ume'that a monthly report is being printed and an 
alarm occurs in the external world that, because of its 
importance, must be attended to immediately. The inter- 
rupt routine, executed as a result of the alarm input, 
should not automatically return to the interrupted log· 
ging routine but instead should call a preempt routine 
which checks to see if a higher priority task is ready for 
execution. The reason for this is that the monthly report 
routine, if returned to, has no way of "knowing" 
that a 


higher priority task is waiting to be executed. The alarm 
output task has been readied by the interrupt routine 
and since it is known to be higher priority than the log· 
ging task, it 
is executed first, 
thereby immediately 
signaling the system operator that there has been an 
alarm. It then returns to the loggin!) task provided that 
there are no further high priority tasks waiting to be exe· 
cuted. The logging printer may not have even paused 
during the alarm output task. The computer appears to 
human 
beings 
to 
be 
executing 
concurrent 
tasks 


simultaneously. 


Of course, the alarm output function could be performed 
inside the interrupt procedure. But sooner or later, the 
designer will encounter a worst case situation in which 
there is not enough time to execute all required tasks 
between interrupts, and the system will fall behind in 
real-time. It is much cleaner to make the interrupt pro· 
cedures as short as possible and stack up tasks to be 
executed than to stack up interrupt. procedures. 


2. Another feature that might be necessary is a capabil- 
ity to put a task to sleep for a known period of real time. 
Assume a relay output must remain closed for one sec- 
ond. Most real·time systems cannot tolerate the dedica- 
tion of the CPU to such a trivial task for that length of 
time so a system of programmable dynamic delays 
could be implemented. This application 
note imple- 


ments such a system. 


Although the PLlM-86 procedures here have been de- 
bugged and tested, it is assumed that the user will want 
to change, add, or delete features as needed.This appli· 
cation note is 'intended to present ideas for a logical 
structure of procedures that, because they are written in 
PLlM-86, can be easily modi'fied to user requirements. 
Each procedure will be discussed in detail and integra- 
tion and optional features will be presented. 


PLlM-86 


PLM-86 is a block structured high level language that 
allows 
direct 
design 
of 
software 
modules. 
Using 


PLlM-86, designers 
can forget 
their assembly 
level 


PLM-86 
STRUCTURE 


PLlM-86 automatically 
keeps track of the level of the dif- 


ferent 
software 
blocks. 
(See Chapter 
10, "PLlM-86 
Pro- 


gramming 
Manual"). 
There 
are 
methods 
of 
writing 
PLlM-86 
which 
contribute 
to 
the 
understandability 
of 


the source 
code without 
adding 
to the amount 
of object 
code 
generated. 
For 
instance, 
the 
following 
three 
IF/THEN/ELSE 
blocks 
generate 
identical 
object 
code 


but are compiled 
from 
different 
source 
statements. 


Line 
Level 
Statement 


3 
IFA=BTHENC=D;ELSEE=F;G=H; 


7 
IFA=BTHEN 


8 
C=D; 
ELSE 


9 
1 
E= F; 


10 
1 
G= 
H; , 


11 
1 
IF A= 
B THEN 
DO; 


13 
2 
C = 
D; 


14 
2 
END; 
15 
1 
ELSE DO; 


16 
2 
E=F; 
17 
2 
END; 
18 
1 
G= 
H; 


It is not instantly 
apparent 
from the code on line 3 or the 
code 
starting 
at line 
7 which 
statements 
will 
be exe- 


cuted. 
However, 
adding 
the 
DO; and 
END; statements 


(starting 
at 
line 
11) 
remove 
any 
doubt. 
Either 
the 


statements 
starting 
at line 11 or the statements 
starting 


at line 15 will 
be executed 
and the statement 
on line 18 


will 
be executed 
in either 
case. Why? 
Because 
all these 


lines are at level 1 in the block 
structure. 
The other 
lines 


are at level 
2 because 
of the 
DO;/END; 
combinations. 
When one refers 
to the relatively 
complex 
structures 
of 


the task multiplexer 
procedures, 
the usefulness 
of such 


an approach 
is obvious, 
as the procedures 
have been in- 
dented 
according 
to 
the 
level 
numbers 
generated 
by 
PLlM-86. 
In particular, 
if 
the 
designer 
is 
not 
careful, 


nested 
IF/THEN/ELSE 
'statements 
can 
generate 
im- 
proper 
results. 
Using a proper 
number 
of DO;/END; 
com- 
binations 
avoids 
the 
possible 
ambiguity 
in 
nested 
IF/THEN/ELSE 
statements 
as can be seen 
in the ACTI- 
VATE$TASK 
procedure 
listed 
in 
the 
PLlM-86 
source 


code later in this 
note. The DO;lEND; 
construct 
naturally 


must 
be used 
when 
multiple 
statements 
are required 


within 
the 
IF/THEN/ELSE 
blocks. 
Following 
are exam- 
ples of the possible 
primary 
structures 
of PLlM-86: 


DO; 
A=B; 
C=D; 
END; 


DO 
1= 1 TO 5; 
A=I; 
C=D+I; 
END; 


DO CASE A; 


A= B; 
A=C; 
A=D; 
END; 


IF 
A= 
B THEN 
DO; 


C=D; 
END; 


ELSE DO; 


E= F; 
END; 


IF 
A = B THEN 
DO; 
C=D; 
END; 


ELSE IF A = C THEN 
DO; 
D=E; 
END; 


ELSE IF A= 
D THEN 
DO; 


E= F; 
END; 


ELSE DO; 


F=G; 
END; 


A 
complete 
tutorial 
on 
structured 
programming 
is 


beyond 
the scope 
and intent 
of this.application 
note and 


the reader 
is referred 
to the appropriate 
references 
ap· 


pearing 
in the bibliography. 


Once 
a decision 
is made 
on the details 
of the kind 
of 


data 
structure 
that 
is 
needed 
to 
implement 
the 
task 


multiplexer, 
the procedures 
that 
manipulate 
the struc- 


ture 
are relatively 
simple 
to write. 
The following 
char- 


acteristics 
are assumed 
for the task multiplexer 
appear- 


ing in this 
application 
note. 


There 
are two 
levels 
of priority, 
high 
and 
low. 
All 
high 


priority 
tasks 
that 
are ready 
to run will 
be dispatched, 


executed, 
and completed, 
on a FIFO 
basis, 
before 
any 


low priority 
task 
is dispatched. 


Any 
task 
can 
be interrupted. 
No task 
multiplexer 
pro- 
cedure 
can be interrupted. 


If a high priority 
task is interrupted, 
it will 
be completed 


before 
any other 
task is dispatched. 
If a low priority 
task 


is interrupted, 
all 
ready 
high 
priority 
tasks 
will 
be dis- 


patched, 
executed, 
and completed 
before 
program 
con· 


trol 
is returned 
to the low priority 
task. 


(top) poi nter and a tai I (bottom) 
poi nter and tasks on any 
queue 
are link-listed 
from 
head to tail. 
Tasks 
are "dis- 


patched" 
(taken 
off 
the queue) 
at the 
head and "acti- 
vated" 
(put on the queue) at the tail on a FIFO basis. 


Link-listed 
queues 
are 
chosen 
for 
simplicity. 
All 
dis- 
patch 
and activate 
information 
is contained 
in the head 
and tail 
pointers. 
Tasks 
located 
in the middle 
of these 


link-lists 
are of no concern 
for activating 
and dispatch- 


ing. This 
means, 
of course, 
that 
tasks 
are executed 
in 


the 
order 
that 
they 
appear 
on the 
queue, 
i.e., first-in, 
first-out. 


There 
is a pointer 
byte 
associated 
with 
each 
task. 
If a 


task 
is on either 
the low priority 
or high 
priority 
ready 
queue, 
its associated 
pointer 
byte will 
point 
to the next 
task 
number 
on the list. These 
pointer 
bytes enable 
the 
task ready lists to be linked. 
Note that the pointer 
byte is 
o for the last task on a list. 


There is a status 
(flag) byte associated 
with each task. If 
a task is on a ready list or a delay list, bit 7 will 
be a "1" 
indicating 
that that particular 
task is busy. If a task is on 


either 
high priority 
or low priority 
ready queues, 
bit 6 will 
be a "1" 
indicating 
that the task 
is on one of the ready 
queues. 
If the task 
is listed 
on the delay 
list, 
(see next 
item), 
bit 5 will 
be a "1" 
indicating 
that 
this 
particular 


task 
has a delay 
in progress. 
If a task 
is unlisted, 
bits 


5-7 
will 
be "0." Bits 
0-4 are 
not 
used 
by 
the 
task 
multiplexer 
procedures 
and are available 
to the user, giv- 


ing 5 user defined 
flags 
per task. 


There 
is a delay 
byte 
associated 
with 
each 
task. 
This 
feature 
allows 
tasks 
to be "put 
to sleep" 
for a variable 
length 
of 
time, 
from 
1 to 255 "ticks" 
of the 
interrupt 


clock. 
If a task does 
not need an associated 
delay then 


this 
byte 
is available 
to the user as a utility 
byte to be 
used for any purpose. 
These delays 
will 
be discussed 
in 
detail 
later 
in the application 
note. 


The following 
diagram 
is a representation 
of the 
task 
multiplexer 
data structure: 


TASK NUMBER 
POINTER 
BYTE 
STATUS BYTE 
DELAY BYTE 


a 
n 
n+1 
n+2 
1 
n+3 
n+4 
n+5 
2 
n+6 
n+7 
n+8 


3 
n+9 
n+ 10 
n+ 
11 


4 
n+ 12 
n+ 13 
n+ 14 


5 
n+ 15 
n+ 
1£ 
n+17 


m-1 
n+ 3m-6 
n+3m- 
5 
n+ 3m-4 
m 
n + 3m - 2 
n+3m- 
1 
n+3m 


3m + 3 TOTAL 
RAM BYTES 
n = FIRST RAM ADDRESS 
OF ARRAY 


Following 
is a chart 
of 
what 
a task 
multiplexer 
data 


structure 
might 
look 
like at a given 
moment 
in time: 


TASK NUMBER 
TASK(n).PNTR 
TASK(n).STATUS 
TASK(n).DELAY 


a 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 


1100 0000 
1010 0000 
1100 0000 
1010 0000 
1100 0000 
0000 
0000 
1010 0000 
1lOa 0000 
0000 
0000 


1100 0000 


What information 
can one ascertain 
from observation 
of 


the above chart?-The 
ready-to-run 
high priority 
tasks, 
in 
order, are 5,1,3. This can be seen by following 
the high 


priority 
ready linked 
list from 
head to tail. The ready-to- 


run 
low 
priority 
tasks, 
in 
order 
are 
8, 
10. 
The 


TASK(n).PNTR 
byte=O 
for the last listed 
task. Tasks 
4, 


7, 2 are 
listed, 
in 
order, 
on 
the 
delay 
list 
and 
have 


associated 
delays 
of 4,10,13 ticks 
respectively. 
Tasks 6 


and 9 are not 
listed 
and 
therefore 
idle. 
The 
• for 
the 


TASK (0) bytes 
indicate 
a special 
condition. 
There 
is no 


TASKOO allowed 
and a zero condition 
is treated 
as an er- 


ror 
condition. 
TASK(O).PNTR 
byte 
is 
used 
for 
the 


DELAY$HEAD 
byte 
to 
minimize 
code 
in 
the 
ACTI- 


VATE$DELAY 
procedure. 
TASK(O).STATUS 
and 


TASK(O).DELAY 
are unused 
bytes. 


NEW$TASK 
is the 
number 
of the 
task 
that 
will 
be in- 


stalled 
on 
a ready 
list 
or 
the 
delay 
list 
when 
ACTI- 


VATE$TASK 
or ACTIVATE$DELAY 
is called. 


NEW$DELAY 
is the value 
of the delay 
that 
will 
be in- 


stalled 
on 
the 
delay 
list 
when 
ACTIVATE$DELAY 
is 


called. 


A task is defined 
as RUNNING 
if it is in the act of execu- 


tion 
or if an interrupt 
routine 
is executing 
which 
inter- 


rupted 
a RUNNING 
task. 


A task 
is defined 
as PREEMPTED 
if it has been 
inter- 


rupted 
and a higher 
priority 
task 
is being 
executed. 


A task is defined 
as READY if it is contained 
within 
one 


of the ready queues. 


A task 
is defined 
as IDLE if its BUSY$BIT 
(bit 7) is not 


set, 
i.e., it is not 
listed 
anywhere 
else. 
Note 
that 
it is 


possible 
to completely 
disable 
an IDLE task 
simply 
by 


setting 
its BUSY$BIT. 
In that case, it is not and cannot 


be listed 
anywhere 
else. 
This 
feature 
is useful 
during 


system 
integration. 


STATE DIAGRAM 


The state diagram indicates the relationships 
among 


the possible task states and the procedures involved in 
changing states. 


The state diagram looks somewhat complicated 
and a 


discussion of the possible change of states is in order. 
Assuming 
a certain 
existing 
state, 
future 
possible 


states will be discussed including the procedures which 
can cause the change of state. 


From the unlisted (idle) state, the ACTIVATE$TASK pro· 
cedure will put the NEW$TASK on either the high priori- 
ty ready queue or the low priority ready queue at the tail 
end of the queue. The number of the task automatically 
assigns the priority and therefore the proper queue. All 
task numbers below FIRST$LOW$PRIORITY$TASK are 
assumed 
to 
be high 
priority 
tasks. 
Also, 
from 
the 


unlisted state the ACTIVATE$DELAY procedure will put 
the NEW$TASK and NEW$DELAY at the proper position 
on the delay list. 


After a task has been put on either high priority ready 
queue or low priority ready queue it eventually will go to 
the RUNNING$TASK state. The DISPATCH procedure 
accomplishes this action. 


From the delay list a task can only go to one of the ready 
queues. When a task's associated delay goes to zero the 
DECREMENT$DELAY 
procedure 
calls 
the 
ACTI- 


VATE$TASK procedure and installs the NEW$TASK on 
the proper ready queue. 


From the RUNNING$TASK state a task may use the' 
CASE$TASK procedure to put itself on the ready list tail 
by 
setting 
NEW$TASK= RUNNING$TASK. 
It 
may 
instead 
put 
itself 
on 
the 
delay 
list 
by 
setting 


NEW$TASK= 
RUNNING$TASK 
and 
also 
setting 


NEW$DELAY equal to something other than zero. Other- 
wise, it will progress to the unlisted state upon comple- 
tion. 


The CASE$TASK procedure 
unlists 
tasks when they 


have 
completed 
execution. 
A 
low 
priority 
RUN- 


NING$TASK will go to the preempted state if a high 
priority task is on the ready list following 
an interrupt 


during execution of the low priority task if the PREEMPT 
procedure is called. 


And finally, a PREEMPTED$TASK will return to a RUN- 
NING$TASK state when all high priority 
ready tasks 


have completed execution. This is accomplished by the 
DISPATCH procedure which then returns to the PRE- 
EMPT procedure. 


Some lockouts are necessary to avoid chaos in the task 
multiplexer. These are as follows: 
The unused bits in the STATUSbyte are available to the 
user. 


The BUSY$BIT= 1 in the TASK(n).STATUS byte will 
abort the ACTIVATE$TASKand the ACTIVATE$DELAY 
procedures and return an indication of the aborting by 
selling 
the STATUS byte equal zero. A task must be 


unlisted to be able to be installed on a list. 


A RUNNING$TASK may put itself on a list after it has 
executed but it is not allowed to re-list any listed tasks 
(i.e., no task may ever be listed twice at the same time!). 
A task that tries to activate another task that is already 
busy can wait (via the delay feature) for the required task 
to complete execution, become idle, and therefore be 
available to be activated. A PREEMPTED$TASKmay not 
be listed. If the ACTIVATE$TASKor ACTIVATE$DELAY 
procedure 
is 
called 
and 
NEW$TASK= 
PRE- 
EMPTED$TASK, the procedure will 
be aborted and 


return with STATUS=O. Otherwise, the STATUSbyte is 
returned with the new task status. 


Only one task may be preempted as there are only two 
levels of priority. The user may desire to implement 
many levels of priority in which case a linked-list of 
preempted tasks could be declared in a structure which 
includes the number of the first task in each priority 
level group of tasks. This obviously complicates the 
PREEMPTand DISPATCH procedures. 


The tasks themselves are made into reentrant proce- 
dures because of the necessary forward references of 
the CASE$TASK procedure. 


PLlM-86 allows structures and arrays of structures. The 
structure needed for the task multiplexer is a link-list 
pointer byte, a task status byte, and a task delay byte. 
Each task has an associated pointer byte, status byte, 
and delay byte. These are combined into an array of up 
to 255 tasks. For purposes of this discussion, 
the 


number of tasks is chosen as an arbitrary 10, leading to 
the following array declaration. 


DECLARETASK(10)STRUCTURE 
(PNTR BYTE,STATUSBYTE,DELAYBYTE); 


Thus the delay byte associated with task number 7 can 
be accessed by using the variable TASK(7).DELAYand 
the status of task number 5 can be examined through 
the use of TASK(5).STATUS.The TASK(n).PNTR byte 
contains the task number of the next listed task on the 
same list as TASK(n),i.e., if TASK(n)is on the delay list, 
then TASK(n).PNTRwill contain the number of the next 
task on the delay list or 0 indicating the end of the list. 


TASK(n).STATUSis a byte with the following reserved 
flags: 


BIT 7 
BUSY$BIT,"1" IF TASK IS BUSY 
BIT 6 
READY$BIT,"1" IF ON READY LIST 
BIT 5 
DELAY$BIT,"1" IF ON DELAY LIST 
BIT 4 - 
BIT 0 
UNUSED 


The TASK(n).DELAY byte is a number which can put 
TASK(n) to sleep for up to 255 system clock ticks. The 
system clock tick is interrupt driven from the user's 
timer and'its period is chosen for the particular applica- 
tion. A one millisecond timer is popular and assuming 
such a time, delays of up to 255 ms are available in the 
task multiplexer as it is written. If this delay range is not 
wide 
enough, 
the 
user 
may 
want 
to 
define 
his 


TASK(n).DELAY as a word instead of a byte in the 
PLlM-86 declare statement, giving delays of up to 65 
seconds from the basic one millisecond clock tick. 


Linked 
lists 
are useful 
for 
a number of 
reasons, 


However, a treatise on linked lists would defeat the pur- 
pose of this application note and the reader is referred 
to the references listed in the bibliography. 


The linked lists used in this application. note have a 
head byte associated with each list, i.e., the head byte 
contains the number of the first task on the list. The first 
task pointer byte points to the second task on the list, 
etc. The pointer of the last task on the list is set at zero 
to indicate that it is the last task. Two of the linked lists 
are readyqueues and require a tail byte as well as a head 
byte, The tai I byte points to the last entry on the list. 
Tasks are put on the bottom, or tail, of the ready lists 
and are taken off the top, or head, of the ready lists, The 
delay list has no tail but does have a head, called a 
DELAY$HEAD.The delay list is not a queue, as delays 
are installed on the list in order of delay magnitude for 
reasons to be explained later. 


There are two ready lists, one for high priority tasks and 
one for low priority tasks. The head and tail pointers 
associated with these two lists are: HIGH$PRIORITY$ 
HEAD, HIGH$PRIORITY$TAIL,LOW$PRIORITY$HEAD, 
and LOW$PRIORITY$TAIL.Obviously, the structure can 
be expanded to any number of priority levels by expand- 
ing the head and tail pointers and the historical record 
of the preempted tasks. 


A task multiplexer can have a number of simultaneous 
delays active and it would be efficient 
if there were a 


way to keep from decrementing all delays on every clock 
tick, which is most time consuming. One way to accom- 
plish 
this 
feat 
is 
to 
move the 
problem 
from 
the 


DECREMENT$DELAYroutine to the ACTIVATE$DELAY 
routine. The delays are arranged in a linked-list 
of 


ascending sizes such that the value of each delay in- 
cludes the sum of all previous delays, This allows the 
decrementing of only one delay during each clock tick 
interrupt routine. An example will further illuminate this 
approach. Suppose the following conditions exist: 


Task 7 has a 5 millisecond delay 


Task 3 has an 8 millisecond delay 


Task 9 has a 14 millisecond delay 


DELAY$HEAD= 07 
TASK(7).PNTR= 03 
TASK(3).PNTR= 09 
TASK(9).PNTR= 00 
TASK(7).DELAY= 05 (FIRST DELAY= 5) 
TASK(3).DELAY=03 (5+3=8) 
TASK(9).DELAY= 06 (5+ 3 + 6 = 14) 


The linked-list 
is arranged so that the delays are in 
ascending order and each delay is equal to the sum of 
all previous delays up through that point. Since this is 
true, all delays are effectively decremented merely by 
decrementing the first delay. Of course, something for 
nothing is impossible and the speed gained by arrang- 
ing the delays in the above manner is paid for by the 
complexity of the ACTIVATE$DELAY routine. But since 
the ACTIVATE$DELAY routine 
is executed 
less fre- 


quently than the DECREMENT$DELAYroutine, the sav- 
ings in real time is worth the added complexity. 


Suppose a new delay is to be activated in the above 
scheme. Task 5 with a delay of 10 milliseconds is to be 
added. A before and after chart will indicate what the 
ACTIVATE$DELAY procedure must accomplish. 


BEFORE 


TASK NUMBER 


POINTER 


DELAY 


07 
03 


07 
03 
09 


05 
03 


AFTER 


TASK NUMBER 
07 
03 
05 
09 


POINTER 
07 
03 
05' 
09@ 
00 


DELAY 
05 
03 
02@ 
04' 


FIRST POINTER IS THE DELAY$HEAD 
CHANGES ARE MARKED WITH AN ' 
ADDITIONS ARE MARKED WITH AN @ 


Note that the pointer before the added task has changed 
and the delay after the added task has changed. The 
function 
of the ACTIVATE$DELAY procedure is to ac- 


complish these changes and additions. 


The following 
procedure explanations 
reference the 
PLlM·86 source code listing which follows the applica- 
tion note text. 


This procedure is initiated by a call instruction with the 
byte NEW$TASK containing the number of the task to 
be put on the proper ready queue. 


Interrupts must be disabled whenever the link·lists are 
being changed. If interrupts 
are enabled when this 


procedure is called, they should be re-enabled upon 
returning. 


The assignment of priority is a simple matter. A declare 
statement, 
DECLARE 
FIRST$LOW$PRIORITY$TASK 


LITERALLY 'N,' (where N is the actual number of the 
first low priority task) indicates to the procedures that 
tasks 1 to N are high priority tasks and tasks N or higher 
are low priority tasks. 


This procedure checks the busy bit in the status byte to 
see if this particular task is already busy and if so, 
returns a STATUS of zero. Otherwise, it returns the new 
STATUS of the task. It then checks the priority to see if 
this particular task is a high or low priority. If it is high 
priority, then the task pointer pointed to by the HIGH$ 
PRIORITY$TAIL pointer 
is changed from zero to the 


number of the NEW$TASK. The HIGH$PRIORITY$TAIL 
pointer 
is 
then 
changed 
to 
the 
number 
of 
the 


NEW$TASK and the pointer 
associated 
with 
NEW$ 


TASK is made equal to zero. This completes the ACTI- 
VATE$TASK functions. If the new task is a low priority 
task, then the same functions are performed using the 
LOW$PRIORITY$TAILpointer. 


This procedure is initiated by a call with the byte NEW$ 
TASK containing the number of the task to be put on the 
delay list and the byte NEW$DELAY containing 
the 


value of the associated delay. 


Interrupts are disabled and the busy bit of this particular 
task is checked. If the busy bit is set the STATUSbyte is 
set to zero and the procedure returns without activating 
the delay. If the busy bit is not set the integer value DIF- 
FERENCE is set 
equal to 
the 
NEW$DELAY value. 


POINTER$Ois set equal to the DELAY$HEAD. POINT· 
ER$1 is set to zero. The DO WHILE loop executes until 
POINTER$Oequals zero or DIFFERENCE is less than 
zero. Remember that the proper place to insert the new 
delay is being searched for, and that will be either at the 
end of the list (POINTER$O=0) or when the sum of the 
previous delays do not exceed the new delay value. The 
DOWHILE loop has POINTER$O,POINTER$1,OLD$DIF- 
FERENCE, and DIFFERENCE keeping track of where 
the procedure is in the loop, while searching for the 
proper place to insert the new delay. The existing delays 
are sequentially subtracted from the remains of NEW$ 
DELAY according to the link-listed order until the end of 
the list or a negative result is encountered indicating 
that the proper delay insertion point has been reached. 
At this point POINTER$Ocontains the task number to be 
assigned to TASK(NEW$TASK).PNTR.POINTER$1 con- 
tains 
the 
task 
number 
immediately 
preceding 
the 


NEW$TASKsuch that TASK(POINTER$1).PNTR= NEW$ 
TASK and our link list is fully updated, with the actual 
delays yet to go. If POINTER$O= 0 it means that the new 
delay is larger than any of the other delays and therefore 
should 
go on the 
end of 
the 
list 
so TASK(NEW$ 


TASK).DELAY is set equal to 
the 
DIFFERENCE. If 


POINTER$Ois not equal to zero then if POINTER$O 
equals POINTER$1(indicating that there were not any 
delays previously listed), then TASK(POINTER$1).PNTR 
is 
set 
equal 
to 
zero. TASK(NEW$TASK).DELAY is 
set 
equal 
to 
the 
OLD$DIFFERENCE 
and 
TASK 
(POINTER$O).DELAYis set equal to the negative of.DIF· 
FERENCE which 
at this 
point 
is negative, thereby 
resulting in a positive unsigned number. The reader is 
encouraged to implement an example (see Delay Struc· 
ture section) to prove that the above approach is vaiid. 
Particular attention should be paid to the contents of 
the two pointers, as they are the key to the procedure. 
The final 
function 
of this 
procedure is to set the 
BUSY$BIT 
and 
DELAY$BIT 
in 
the 
TASK(NEW$ 
TASK).STATUSbyte. The byte named STATUSwhich is 
returned by this procedure is set equal to the status of 
the new task. If it is desired to have interrupts enabled, 
they must be enabled after the procedure return instruc- 
tion. The reason for such a complex method of ac- 
tivating a delay will become apparent in the following 
section. 


The first delay on the linked-list is decremented and, if it 
is zero, the associated task is put on the appropriate 
ready queue. The next delay (if any) is checked to see if 
it is zero and if so, that task is put on the appropriate 
ready queue, etc. A loop is performed until either no 
delay or a non-zero delay is found. The procedure then 
returns. 


It is assumed that this procedure is part of an interrupt 
routine and that the interrupts are disabled during its 
execution. Interrupts cannot be enabled during changes 
to any of the linked-lists or else recovery may not be 
possible. 


This procedure begins by checking to see if there are 
any active delays. If DELAY$HEAD= 0 then this pro- 
cedure returns immediately. Otherwise it decrements 
the first 
delay. If this delay goes to zero then the 
associated task number is passed to the ACTIVATE$ 
TASK procedure as the OFF$DELAY byte. A new 
DELAY$HEAD is chosen from the next link-listed delay 
and that delay checked for a value of zero which will 
happen if the first two or more delays are equal. This 
loop 
is 
accomplished 
by the 
DO WHILE DELAY$ 
HEAD <> 0 AND TASK(DELAY$HEAD).DELAY= 0; This 
procedure is designed to require very little CPU time 
unless a delay times out. The DO WHILE loop is by- 
passed if the resulting delay value is not zero. A certain 
amount of care should be exercised to insure that many 
delays do not all time out at the same time. One method 
would be to modify the ACTIVATE$DELAYprocedure to 
insure that there are no zero entries in the delay bytes. 
The basic procedure, however, assumes that the clock 
"tick" 
timing 
will 
be chosen to minimize the above 
potential problem. 


This procedure performs the function of calling the task 
indicated by the contents of the RUNNING$TASK byte. 
All 
listed 
tasks 
are 
called 
in 
this 
manner. 
The 
CASE$TASKprocedure is called by the DISPATCH pro- 
cedure. When a particular task has completed execution 
it returns to the CASE$TASK procedure which then 
resets the BUSY$BITand the READY$BITand returns to 
the DISPATCHprocedure after setting RUNNING$TASK 
equal to zero. This procedure allows a task to relist itself 
immediately upon returning from execution. 


The PREEMPT procedure is called whenever it is pos- 
sible that a high priority task has been put on the ready 
queue while a low priority task was in the process of 
execution. An example will illustrate: 


Assume that the control system is being interrupted by 
the 60 Hz line frequency and a register is being in- 
cremented each time this 16.67ms edge occurs. When 
the register gets to 60 (indicating that one second has 
passed),the register is zeroed and the high priority time- 
keeping task is put on the ready queue. Assume also 
that a low priority data logging task was running when 
this interrupt occurred. The interrupt routine calls PRE- 
EMPT. If a high priority 
task is running, PREEMPT 
simply returns. But in our example, a low priority task is 
running 
so PREEMPT transfers 
RUNNING$TASK to 
PREEMPTED$TASKand calls DISPATCH, which calls 
CASE$TASK, which calls the time-keeping task. When 
the tillie-keeping 
task has completed, 
it 
returns to 
CASE$TASKwhich returns to DISPATCHwhich returns 
to the PREEMPTprocedure which returns to the inter- 
rupt routine which returns to the interrupted low priority 
data logging task if no other high priority tasks are on 
the ready queue. If the high priority ready queue is not 
empty, any and all high priority tasks will be completed 
before the interrupted routine is returned to. PREEMPT 
refuses to return to the interrupt routine until HIGH$ 
PRIORITY$HEADis equal to zero. It is important to note 
that a low priority task will not be preempted unless the 
PREEMPTprocedure is called. As noted above, it is nor· 
mally called from the interrupt routine which interrupted 
the low priority task, but there is nothing to prohibit 
PREEMPTfrom being called from inside a low priority 
task procedure. 


This procedure calls a high priority 
task if 
HIGH$ 
PRIORITY$HEADis not equal to zero, restores a pre- 
empted task if PREEMPTED$TASKis not equal to zero, 
calls a low priority task if LOW$PRIORITY$HEADis not 
equal to zero, and simply returns if there is nothing to 
do, all in order of priority. The DISPATCH procedure is 
called from the main program loop which must enable 
interrupts as DISPATCHdisables interrupts as soon as 


Because the block structure and let-els are so important 
to the understanding of the following procedures, they 
have been indented according to level. This was a sim- 
ple task accomplished 
by no indenting for level one, 
indenting once for level two, etc. The resulting attrac- 
tive, easy to follow 
format 
was worth 
the effort 
to 


increase the initial level of understanding for readers of 
this application 
note who are not intimately 
familiar 


with PLiM. 


Everything except the very simple main program loop 
has been made into procedures. Interrupt routines and 
tasks are also procedures. Keeping track of interrupts, 
calls, and returns is easy for PLiM and a violation of the 
block structure through such devices as GOTO targets 
outside the procedure body is the best way the author 
knows to crash and burn. Honor the power of the struc· 
ture, accept the limitations 
involved, and checkout and 


debugging will be a pleasure. 


Since CASE$TASK references the individual tasks, the 
task procedure structure was included in the PLlM-86 
compilation. 
All the user has to do is insert the par- 
ticular task code in place of the /"TASKnn CODE"/ com- 
ment, define the interrupt procedures and the system 
should be ready to run. Obviously, the user will desire to 
change the total number of tasks and the number of the 
FIRST$LOW$PRIORITY$TASK. 


The last entry in the PLlM·86 program is the initialization 
process which essentially 
zeros the task multiplexer 


data and the main loop which loops until TRUE= FALSE, 
I.e. forever, with 
interrupts 
enabled. The STATUS= 


STATUS instruction simply insures that the loop can be 
interrupted as the instruction 
following an ENABLE in- 


struction is not interruptible. 


These few instructions are included for information only 
and will need to be expanded considerably for use in a 
real-world 
system. 
The task 
multiplexer 
procedures 


were checked out on an iSBC 86/121M computer running 
under random interrupt control and these instructions 
were the minimum necessary to cause the system to 
run. As was stated earlier, the following 
source code 


does not include any interrupt procedures and these will 
have to be generated following the format explained in 
the PLlM-86 programming manual. 


Hesource allOCatiOnIS a TeaTureThatcould be added to 
the task multiplexer. To keep it simple and yet avoid the 
deadlock problem (two tasks each grab a resource that 
the other needs), an extra array can be added to the 
TASK(n).XXXstructure in which each bit in the byte (or 
word), represents a resource necessary for the execu· 
tion of a task. A RESOURCES$STATUSbyte can then 
keep the dynamic busy status of the system resources 
(printers, terminals, floating point math packages, etc.). 
When 
the 
CASE$TASK 
procedure 
is 
called, 
the 


resources 
required 
by 
the 
next 
RUNNING$ 


TASK can be compared to the RESOURCES$STATUS 
byte to see if the required resources are available. If they 
are, the following 
PLlM·86 statement 
will update the 


new status of the resources: 


RESOURCES$STATUS= 
RESOURCES$STATUS 
OR 


T ASK(RU 
N N ING$T ASK).RESOURCES; 


However, if the resources are not available, the CASE$ 
TASK procedure can return the task to the ready or delay 
list and try again later. When the task has completed, 
the 
following 
PLlM-86 
statement 
will 
update 
the 
resources status byte: 


RESOURCES$STATUS 
= RESOURCES$STATUS 
AND 
NOT 


T ASK(RU 
N N ING$T ASK). 
RESOU 
RCES; 


Message 
passing 
from 
task 
to 
task 
may also 
be 


necessary. Assuming 
that a task will 
have only one 


message at a time to deliver or receive, another byte 
could 
be 
added 
to 
the 
task 
structure 
such 
that 


TASK(RUNNING$TASK).MESSAGE could 
represent 
a 


byte containing 
the number of the task wishing 
to 


deliver a message to the RUNNING$TASK. Since a task 
can call CASE$TASKwhich in turn will call another task, 
message block parameters can be passed directly from 
one task to another. The task that calls CASE$TASK 
must handle the necessary housekeeping involved in 
recovering after 
the message has been passed. Of 


course, the data structure would have to be expanded to 
accommodate the message parameters and blocks. For 
further ideas involving message handling refer to the 
RMX/80™user's guide. 


Two additional 
relatively simple procedures could be 


added to obtain the SUSPENDand RESUMEfeatures of 
the RMX/80™ system. Remember that if the BUSY$BIT 
is set in a TASK(n).STATUSbyte and the task is unlisted, 
then it cannot be listed. If it is desired to dynamically 
enable and disable a task, this bit could be set by a 
SUSPEND procedure and reset by the RESUME pro- 
cedure. 


OECLARE 
TOTAL$TASKS 
LITERALLY' 
10', 
DECLARE 
TRUE 
LITERALLY 
'0FFH', 
DECLARE 
FALSE 
LITERALLY 
'0', 
DECLARE 
BUSY$BIT 
LITERALLY 
'10000000B', 
DECLARE 
REAO~$8IT 
LITERALLY 
'010000008', 
DECLARE 
DELAY$BIT 
LITERALLY, 
'001000008', 
DECLARE 
FIRST$LOW$PRIORIT~$TASK 
LITERALLY 
:6', 


DECLARE 
TASK(TOTAL$TASKS) 
STRUCTUR~(PNTR 
B~TE, 
STATUS 
BYTE, 
DELAY 
BYTE); 


DECLARE 
HIGH$PRIORIT~$HEAD 
BYTE, 
HIGH$PRIORIT~$TAIL 
8YTE; 
DECLARE 
LOW$PRIORITY$HEAO 
BYTE, 
LOW$PRIORIT~$TAIL 
BYTE; 
DECLARE 
RUNNING$TASK 
BYTE, 
PREEMPTED$TASK 
BYTE, 
OECLARE 
STATUS 
8~TE, 
NEW$TASK 
BYTE, 
NEW$DELAY 
BYTE; 
DECLARE 
DELAY$HEAD 
BYTE 
A'r (@TASK(0).,PNTR); 


ACTIVATE$TASK: 
PROCEDURE; 
/* 
ASSUMES 
NEW$TASK<>0 
*/ 
DISABLE; 
; 


IF 
(TASK(NEW$TASK) 
,STATUS 
AND 
BUSY$8IT)<>0 
THEN 
STATUS~0; 
ELSE 
/* 
SINCE 
TASK 
IS 
NOT 
BUSY 
*/ DO; 
IF 
NEW$TASK 
< 
FIRST$LOW$PRIORITY$1ASK 
THEN 
00; 
IF 
HIGH$ PRIORITY$'rAI LO 
0 THEN: DO; 
TASK(HIGH$PRIORIT~$TAIL) 
,PNTR=NEW$TASK;, 


END; 
ELSE 
/* 
SINCE 
HIGH$PRIORITY$TAIL=0 
THEN 
*/ 00; 
HIGH$PRIORITY$HEAD=NEW$TASK; 
END; 
HIGH$PRIORITY$TAIL=NEW$TASK; 
END; 
ELSE 
/* SINCE 
TASK 
IS 
LOW 
PRIORITY 
THEN 
*/ 
00; 
If 
LOW$PRIORITY$TAIL<>0 
THEN 
00; 
TASK(LOW$PRIORITY$TAIL) 
,PNTR=NEW$TASK;, 


END; 
ELSE 
/* 
SINCE 
LOW$PRIORITY$TAIL=0 
THEN 
*/ 
DO; 
L'OW$PRIORII'Y$HEAD=NEW$TASK; 
END; 
LOW$PRIORITY$TAIL=NEW$TASK; 
END; 
TASK (NEW$TASK) 
,PNTR=0; 
TASK(NEW$TASK) 
,STATUS=TASK(NEW$TASK) 
,STATUS 
OR 
BUSY$BIT 
OR 
READY$BIT; 
STATUS=TASK(NEW$TASK) 
,STATUS; 
END; 
NEW$TASK=0; 
RE'rURN ; 
END 
ACTIVATE$TASK; 


ACTIVA'rE$DELAY: 
PROCEDURE;/*ASSUMES 
NEW$TASI<, 
NEW$DELAYOeJ*/ 


DECLARE 
POINTER$0 
BYTE, 
POINTER$l 
BYTE; 
DECLARE 
OLD$DIFFERENCE 
INTEGER, 
DIFFERENCE 
INTEGER; 
DISABLE; 
IF 
(TASK(NEW$TASK) 
.STATUS 
AND 
8USY$BIT)<>eJ 
THEN 
STATUS=eJ; 
ELSE 
/* 
SINCE 
TASK 
IS 
NOT 
BUSY 
*/ 
DO; 
DIFFERENCE=INT(NEW$DELAY) 
; 
POINTER$eJ=DELAY$HEAD; 
POIN'rER$l=eJ ; 
DO 
WHILE 
POINTER$eJ<>eJ AND 
DIFFERENCE>eJ; 


OLD$DIFFERENCE=DIFFERENCE; 
DIFFERENCE=DIFFERENCE-INT(TASK(POINTER$eJ) 
.DELAY) ; 
If 
DIFFERENCE>eJ 
THEN 
DO; 
POINTER$I=POINTER$0; 
POINTER$eJ=TASK(POINTER$l) 
.PNTR; 
END; 
END; 
TASK(NEW$TASK) 
.PNTR=POINTER$eJ; 


TASK(POINTER$I) 
.PNTR=NEW$TASK; 
IF 
POINTER$eJ=eJ THEN 
TASK(NEW$TASK) 
.DELAY=LOW(UNSIGN(DIFFERENCE)); 


ELSE 
/* 
SINCE 
DIFFERENCE<eJ 
THEN 
*/ 
DO; 
If 
POINTER$eJ=POINTER$l 
THEN 
TA3K(POINTER$l) 
.PNTR=eJ; 
TASI«NEW$TASK) 
.DELAY=LOW(UNSIGN(OLD$DIFFERENCE)); 
TASK(POINTER$0) 
.DELAY=LOW(UNSIGN(-DIFFERENCE)); 
END; 
TASI«NEW$TASK) 
.STATUS=TASK(NEW$TASK) 
.STATUS 
OR 
BUSY$BIT 
OR 
DELAY$BIT; 
STATUS=TASK(NEW$TASK) 
.STATUS; 
END; 


NEW$'rASK=eJ; 
NEW$DELAY=eJ 
; 
RE'rURN; 
END 
ACTIVATE$DELAY; 


DECREMENT$OELAY: 
PROCEDURE; 
/* 
ASSUMES 
INTERRUPTS 
DISABLED 
*/ 


OECLARE 
OFF$DELAY 
BYTE; 
IF 
DELAY$HEAD<>eJ 
THEN 
DO; 
TASK(DELAY$HEAD) 
.DELAY=TASK(DELAY$HEAD) 
.DELAY-I; 
DO 
WHILE 
DELAY$HEAO<>eJ 
AND 
TASK(DELAY$HEAD) 
.DELAY=eJ; 
OFF$DELAY=DELAY$HEAD; 
DELAY$HEAD=TASK(DELAY$HEAD) 
.PNTR; 
TASK(OFF$DELAY) 
.STATUS=TASK(OFF$DELAY) 
.STATUS 
AND 
NOT(BUSY$BIT 
OR 
DELAY$BIT); 
NEW$TASK=OFf$DELAY; 
CALL 
ACTIVATE$TASK; 
END; 
END; 


RETURN; 
END 
DECREMENT$DELAY; 


CALL 'rASK01; 
CALL TASK02; 
CALL'rASK01; 
CALL TASK04; 
CALL 'rASK05; 
CALL 'rASK06; 
CALL 
TASK07; 
CALL 'rASK08; 
CALL 
TASK09; 
END; 
TASK(RUNNING$TASK) .STATUS=TASK(RUNNING$TASK) 
.STATUS AND 
NOT 
(BUSY$8IT OR 
READY$BIT); 
TASK(RUN~ING$TASK) .PNTR=0; 
IF RUNNING$TASK=NEw$TASK 
THEN DO; 
IF NEW$DELAY<>0 
THEN 
DO; 
CALL 
ACTIVATE$DELAY; 
END; 
ELSE /* SINCE NEW$OELAY=0 
*/ DO; 


CALL ACTIVATE$TASK; 
END; 
END; 
RUNNING$'rASK=0; 
RETURN ; 
END CASE$TASK; 


PREEMPT:PROCEDURE 
REENTRANT; 
/* ASSUMES 
INTERRUPTS 
DISABLED 
*/ 


If PREEMPTED$TASK=0 
THEN 
DO; 
IF 
(HIGH$PRIORITY$HEAO<>0) 
AND 
(RUNNING$TASK>= 
FIRST$LOW$PRIORITY$TASK) 
THEN 
DO; 
PREEMPTED$TASK=RUNNING$TASK; 
RUNNING$TASK=0; 
DO WHILE 
PREEMPTED$TASK<>0; 


CALL 
DISPATCH; 
END; 
END; 
END; 
RETURN ; 
END PREEMPT; 


--_._---, 
IF HIGH$PRIORITY$HEAD<>0 
THEN 
DO; 
RUNNING$TASK=HIGH$PRIORI~Y$HEAD; 
aIGH$PRIORITY$HEAD=TASK(RUN~ING$TASK) 
.PNTR; 
IF HIGH$PRIORITY$HEAD 
= 0 THEN 
HI~H$PRIORITY$TAIL 
0; 
CALL CASE$'rASK; 
END; 
ELSE 
IF PREEMPTED$TASK<>0 
THEN 
DO; 
RUNNING$TASK=PREEMPTED$TASK; 
PREEMPTED$TASK=0; 
END; 
ELSE IF LOW$PRIORITY$HEAD<>0 
THEN 
DO; 
RUNNING$TASK=LOW$PRIORI~Y$HEAD; 
LOW$PRIORITY$HEAD=TASK(RUNNI~G$TASK) 
.PNTR; 
IF LOw$PRIORITY$HEAD 
= 0 THEN LOW$PRIORITY$TAIL 
0; 
CALL CASE$'rASK; 
END; 
ELSE RETURN; 
RETURN ; 
END DISPA'rCH; 


TASK01: 
PROCEDURE 
REENTRANT; 
ENA8LE; 


DISABLE; 
RE'rURN; 
END 
TASK01; 


TASK02: 
PROCEDURE 
REENTRANT; 
ENABLE; 


DISAt3LE; 
RETURN 
; 
END 
TASK02; 


TASK01: 
PROCEDURE 
REENTRANT; 
ENA8LE; 


DISA8LE; 
RE'fURN ; 
END 
'I'ASK01; 


'rASK04: 
PROCEDURE 
REENTRANT; 
ENABLE; 


DISABLE; 
RETURN 
; 
END 
'fASK04; 


TASK05: 
PROCEDURE 
REENTRANT; 
ENABLE; 


DISABLE; 
RE'rURN; 
END 
'rASKl?J 
5; 


TASK06: 
PROCEDURE 
REENTRANT; 
ENABLE; 


DISABLE; 
RE'fURN ; 
END 
TASK06; 


TASK07: 
PROCEDURE 
REENTRANT; 
ENABLE; 


DISABLE; 
RETURN 
; 
END 
TASK07; 


rASK08: 
PROCEDURE 
REEN~RANT; 
ENABLE; 


DISABLE; 
RE'rURN; 
END 
TASKI!l8; 


'rASKI!l9:PROCEDURE 
REENTRANT; 
ENABLE; 


DISABLE; 
RETURN ; 
END 
'rASKI!l9; 


DISABLE; 
DO 
STATUS=1!l TO 
9; 
TASK(STATUS) 
.PNTR=I!l; 
TASK(STATUS) 
.STATUS=I!l; 
TASK(STATUS) 
.DELAY=I!l; 
NEW$TASK,NEw$DELAY=I!l; 
HIGH$PRIORITY$HEAD,HIGH$PRIORITY$TAIL=I!l; 
LOW$PRIORITY$HEAD,LOW$PRIORITY$TAIL=I!l; 
RUNNING$TASK,PREEMPTED$TASK=I!l; 
END; 


DO 
WHILE 
TRUE<>FALSE; 


CALL 
DISPATCH; 
ENAdLE; 
STA'rOS=STA'rUS; 
END; 


3. Wirth, Nicklaus, Algorithms + Data Structures = Programs, Prentice·Hall, Englewood, N.J., 1976. 


4. "PLlM·86 Programming Manual," Intel Corporation, 
1978, manual order number 9800466A. 


5. "RMXl80 User's Guide," Intel Corporation, 
1977, manual order number 98005228. 
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available. This 1/0 processor (lOP) makes available the 
power of 1/0 channels, as used in mainframes and mini- 
computers, in a microcomputer form. Designed as part 
of the MCS-86™ family, the lOP can be interfaced with 
the MCS-80™and MCS-85™families as well. 


An 1/0 channel is basically a processor remote from the 
main CPU, which 
independently 
runs I/O operations 


upon command of the CPU. To relate the 8089 to ex- 
isting LSI components, it is similar to a microprocessor 
that is time-multiplexed with a DMA controller, but with 
two channels available. However, since the 8089 proc- 
essor is optimized for 
1/0 
and multiprocessor 
opera- 


tions, and the DMA has been made much more flexible 
than existing DMA controllers, a truly general purpose 
and powerful I/O control system is available on one chip. 


Due to the uniqueness of the 8089, this application note 
was written to review debugging strategies and point 
out possible pitfalls 
when developing an lOP system. 
Debugging an lOP system is very similar to debugging 
microprocessorlDMA 
controller systems, and many of 


the techniques described here are standard microproc- 
essor techniques. However, several factors are present 
which can complicate the debugging process: 


1. Multiprocessor Operation 


Although 
usable by itself, the lOP is designed to be 


used with other processors. All factors 
normally en- 


countered with multiprocessor operation, including bus 
arbitration, processor communication, critical code sec- 
tions, etc., must be addressed in the design and debug 
of an lOP system. 


2. DMA Tle·ln to lOP Program Execution 


The relationship 
between lOP program execution and 


DMA transfers and termination is different from earlier 
DMA controllers and should be fully understood to prop- 
erly run the system. 


3. Dependency of Programs on Real·Tlme 1/0 


Operations 


Requirements by I/O devices for maximum data rates 
and minimum latency times force the software program- 
mer to be aware of hardware timing constraints and can 
complicate program debugging. 


4. Dual Channel Operation 


Related 
to 
multiprocessor 
operation 
and 
real-time 


dependencies, the two independent channels available 
on the 8089 may have to be coordinated with each other 
to make the whole system function. Dependence of one 
channel on the other can also complicate debugging. 


Due to the complexities 
of running in a real-time envi- 


ronment, as many steps as possible should be taken to 
facilitate debugging. A major help here is to make sure 
as much of the hardware and software as possible is 
working before running real-time tasks. This is a good 
practice anyway, but it should be reemphasized that a 
complex multichannel 
system can quickly get out of 


hand if more than a few things are not right. 


modular software Interfaces to tne nost l,;I"U, via me 
linked-list 
initialization 
structure, and parameter com- 


munication 
through 
the 
parameter 
block 
(PB) area. 


Some of the aspects of structured programming that aid 
debugging are: 


• 
Top Down Programming 
- 
The functions done by 


low-level routines 
are well 
understood, 
and the 


number of program fixes, which can cause more 
errors, is minimized. 


• 
Program Modularity 
- 
Small, easy to manage sub· 


programs can be debugged independently, increas- 
ing the chance that the entire system will work the 
first time. 


• Modular 
Remoteness 
- 
By having all program 


modules communicate only through a well-defined 
interface, one module's knowledge of the "inner 
workings" 
of another is minimized. System soft- 


ware complexity 
is reduced. Updates to program 


modules are more reliable, too. 


Two major areas of debugging will be outlined here - 
static (or functional) debugging in which the hardware 
and software are not tested in a real·time environment, 
and real-time debugging. Applying a logic analyzer to 
lOP debugging will also be explained, and a review of 
lOP operation and potential problems will be done. 


STATIC (OR FUNCTIONAL) DEBUGGING 


The predominant errors in a system, when first tried out, 
are,either errors in implementation (i.e., wrong hookups 
or coding errors), or an incorrect 
implementation 
(a 


wrong assumption somewhere). Most of these bugs can 
be found through static debugging techniques that are 
usually easier to work with than real·time testing. 


Hardware Testing 


Static hardware testing is done mainly to see if all indi- 
vidual parts of the system work, so the whole system 
will "play" when run. The level of testing can run from 
checking for continuity 
and shorts (which finds only 


hookup errors) to tryi ng to move data around and run- 
ning I/O devices from a monitor or special test programs 
(which can also find incorrect circuit design). In all but 
the simplest 
systems, the latter approach Is recom- 


mended since it is a step towards software debugging. 


Several approaches to hardware testing will be covered. 
Running diagnostic programs (such as a monitor) out of 
the lOP's host system, in both the LOCAL and REMOTE 
modes, will be covered. The case where the host system 
cannot support diagnostic software and must have an 
external processor to exercise the lOP and its perlph· 
erals will also be explained. 


The case where the host system can run diagnostics or 
test programs that have interactive user I/O, such as a 
CRT terminal or teletype, provides the most straightfor- 
ward way to test the lOP. Naturaliy, before these pro- 
grams can be run, the basic hardware must be correct 
enough to run programs. When this point is reached, a 
monitor program can be used to exercise memory and 
I/O controllers on the system bus. 


It should be mentioned that aids, other than just testing 
with 
software, 
are 
helpful 
for 
hardware 
debugging. 


While 
a necessity 
for 
real-time 
debugging, 
a logic 
analyzer 
is also 
a definite 
help" for 
static 
hardware 
debugging. 
Its main use in hardware debugging is show- 
ing timing relationships 
between address or data paths 


and other signals. 
It is especially 
useful for functional 
software 
debugging, 
to be described 
shortly. 
The last 


\lebugglng 
se'ction outlines 
the use of an analyzer with 


the lOP. Of course, an oscilloscope, 
logic probes and 


pulsers, etc., can be used to trace out specific 
logic or 


timing problems. 


LOCAL Mode 
, 


When the lOP is running in the LOCAL Mode, all I/O con- 
trollers 
and memory are accessible 
by the host or con· 
trolling 
CPU. Thus a standard monitor, such as the one 


supplied 
with 
the SDK-86 or available 
for the 
ISBC- 


86112™ development 
kit, can exercise 
all hardware on 


the bus.· 
The breakpoint 
routines, 
however, will 
not 


work due to the different 
Instruction 
set. The 8086 or 


8088 is best suited for running the lOP in the LOCAL 
mode due to identical 
status 
lines and bus timing, 
as 


well as the-Reque~tIG~ant line, which eliminates 
bus ar· 


bitration 
hardware. Figure 1 shows the general LOCAL 


mode configuration. 


·The SDK·86 serial monitor 
is a good basis for a general 8086 monitor. 


The lOP cannot 
be used directly 
with 
the SDK-86, since the 60861s run- 


ning in the minimum 
mode. The SDK-86 can be converted to run in the 
maximum 
mode, 
if desired. 


REMOTE 
Mode 


From a system 
design 
standpoint, 
running 
the lOP in 


the REMOTE Mode is advantageous 
in that it removes 


the I/O bus cycles from the system bus. Normally, the 
remote I/O is not accessible 
to the host CPU. Until the 


lOP is able to run its own test programs to transfer data 
from the REMOTE bus to the system bus, I/O controllers 
and memory on the REMOTE bu.s will be invisible to the 
host. To get around this problem 
during 
prototyplng, 


either an external processor 
interface can be used (see 


next section), or a temporary bypass can be made to ac- 
cess the REMOTE bus from the system bus. 


Bypassing 
the normal REMOTE/SYSTEM interface 
is a 


handy technique 
for doing preliminary 
debugging on the 


REMOTE bus. This can be done by memory-mapping 
the 


lOP's 
I/O space 
into 
an unused 
portion 
of the 
host 


CPU's 
system 
memory' space. 
When 
accessing 
this 


space, the lOP access to its own I/O space Is disabled, 
and a separate set of address buffers, transceivers 
and 


bus control signal buffers are enabled. Reads and writes 
can then be done to the formerly inaccessible 
REMOTE 


bus by the host CPU. 


A simple system (Figure 2) implements 
this bypassing 


scheme. 'It was designed 
for just forcing 
or examining 


devices on the REMOTE bus and may not read or write 
correctly 
if the lOP is simultaneously 
trying to do bus 


cycles. A more sophisticated 
arbitration 
system would 


permit reliable Tun-time checking 
also. 
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Running the lOP in the REMOTE mode, particularly 
if 
the MULTIBUS™ protocol is adhered to, has the advan- 
tage that the lOP can be exercised with any MULTIBUS- 
compatible 
processor. 
If the main processor 
is not 
amenable to ,being used as a debugging tool, another 
processor could be used to debug the hardware inter- 
face. If the microprocessor 
is of the same type as the 
intended host processor, software debugging 
can be 
done as well. A generalized REMOTE mode configura- 
tion using the MULTIBUS is shown in Figure 3. 


External Processor Interface 


A technique thai can be used if the host processor can· 
not run any debugging or monitor routines is to have an 
external processor tie into the host processor's 
bus. 
This is useful if the main system CPU cannot run an in- 
teractive 
monitor 
or other debugging 
programs. If a 
MULTIBUS interface is being used, an 8289 bus arbiter 
and a set of address/data/control buffers can be used. A 
somewhat simpler system, similar to the remote bus ac· 
cess system mentioned above, could be used for static 
debugging of non-MULTIBUSsystems. Again, if true bus 
arbitration is added (which brings us nearly to a MULTI· 
BUS interface), 
it could 
also 
be used for 
run·time 
testing. 
Intel 
processors 
that 
have the 
MULTIBUS 
interface include the iSBC·80/20™, iSBC-86/12™, iSBC· 


80/10™, 
iSBC-80/0S™, 
the 
Intel lee'" development 


systems, among others. 


In the previously described systems, the external proc· 
essor would disable the host CPU's access to the bus, 
either by some form of bus request or by a "brute force" 
disabling of the CPU's buffers. In the latter case, the ex· 
ternal processor could only control 
the bus during a 


time that the CPU is halted, without destroying the pro· 
gram flow. Mapping the processor's memory space into 
the external processor memory space is the simplest 
method, but can impact programs being run on the 
external processor. If the processor under test utilizes 
the MULTIBUS interface (with bus arbitration), 
then a 


processor like the iSBC-80/30™ or iSBC·86/12™ could 
be used as the debug vehicle with no special hardware. 
A more flexible interface that would have less impact on 
the system memory space would have the addresses for 
the system under test generated from latches loaded by 
the I/O instructions 
from the external processor. This 


case must have software routines to interface to the I/O 
ports and handle the desired debugging routines (see 
Figure 4). 


Software Testing 


It is desirable to check as much of the lOP program as 
possible statically, 
since various tools and techniques 


are available which may not be usable during real·time 
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testing. This "static" 
software testing is not applicable 
to heavily I/O-dependent or DMA-dependent routines, 
but is best suited to longer computational or data han- 
dling routines. The idea is to test the correctness of 
algorithms, rather than seeing if the whole system runs. 


There are two main approaches to functional software 
testing. One is to essentially run the program in real 
time and monitor program flow on a logic analyzer. The 
difference between this and real-time testing is that pro- 
gram subsections can be tested separately by using dif- 
ferent 
TP (Task Pointer) starting 
addresses. If it is 
necessary to set up certain registers or parameters in 
memory, a small "setup" 
program can be run after ini- 
tialization, which can load up registers or memory, then 
jump to the program section desired. 


Another technique is to run the programs with break- 
point 
routines 
so that one can step through 
code 
segments 
and follow 
program execution. 
Software 
breakpoints 
are usually implemented 
by inserting 
a 
jump or restart to a monitor routine at the breakpoint 
location. 
This jump 
or restart 
is machine language 
dependent so, unfortunately, 
the existing 
breakpoint 
routines within monitors for the 8080 or 8086 are not 
applicable. 


New routines tailored to the 8089 can be used, and, if 
done properly, can even be used to examine programs 
running on a REMOTEbus. Using breakpoints is some- 
what complicated on the 8089 because the minimum in- 
struction length is two bytes. There is no absolute CALL 
instruction, 
only a relative one (which would have to 
have its displacement 
recalculated each time it was 
used). But, with a several-byte absolute jump inserted at 
each place a breakpoint 
is desired, full 
breakpoint 
capabilities can be obtained. 


There are many ways the breakpoints can be imple- 
mented. When a breakpoint is reached, the 8089 itself 
could output the machine state to a c'onsole through its 
own routines. Better suited to debugging, though, is a 
system that has the 8089 place its machine state in 
memory, alert the host processor, and then halt. The 
host then picks up the 8089's state and can treat it in the 
same way it runs its own breakpoint routines. Since the 
host processor is more likely to be running a monitor or 
some other kind of debugging routine (and most likely 
has at least temporary console I/O),it is the logical sys- 
tem to initiate and examine 8089 breakpoints. If the lOP 
is running in the REMOTEmode, and the host processor 
has access to the 1/0 bus via the scheme mentioned in 
the hardware debugging section, then lOP programs 
running on the REMOTEbus can be examined. 


The breakpoint itself can consist of an escape sequence 
that is used to save the TP value and jump to the save 
routine, or just a jump to the save routine. This routine 
saves all register contents for the channel the break- 
point is in, signals the host processor, and stops the 
lOP. All user programmable registers (GA, GB, GC, IX, 
MC, BC, TP),as well as the pointer tags, are accessible. 
The PP (Parameter Pointer) and PSW are not normally 
accessible, but if the generation of the CA is such that 
the lOP can send itself a CA, then by sending a CA 
HALT, the PSW will appear at PP+ 3. Remember that 


since the lOP doesn't have arithmetic or logical condi- 
tion codes, the PSW is not as important as in other 
machines. 


The most straightforward way to pass data from the lOP 
to the host processor is through the PB (Parameter 
Block) area since the PP will normally remain relatively 
fixed throughout the lOP program. In order not to in- 
fringe on the PB areas used by the programs, an area 18 
bytes long should be allocated at the end of the PB 
block to hold the register contents. Using other areas to 
store the register data requires saving and reloading a 
pointer 
register 
as part 
of 
the 
breakpoint 
escape 
sequence. 


The data returned from the breakpoint save routine will 
appear to the host processor as a sequential block of 
data in the PB area. Sixteen-bit data can easily be ex- 
tracted, 
but 
20-bit 
pointer 
data 
will 
have 
to 
be 
reconstructed from the move pointer (MOVP)format: 


7 
07 
07 
0 


HIGHEST ~~---------I 
LOWEST 


ADDRESS 
~ 
__ 
D_15_,,_,D_8 
0_7_,,_,0_0_ 
ADDRESS 


TAG 
BIT 


O=SYSTEM 
l=UO 


Several means are available to signal the host processor 
that a breakpoint has been reached. A bit could be set in 
memory or an interrupt sent to the CPU. The best way, 
though, is to use the BUSY flag (at CP+ 1 or CP+ 9). 
After starting the lOP, the BUSYflag is set to FF. When 
a breakpoint 
is reached, the lOP performs 
its save 
routine and does either a software or CA HALT. These 
result in clearing the BUSYflag, which then signals the 
CPU to obtain valid breakpoint data. The CPU can then 
restart the lOP by either a CA START or CA CONTINUE. 


The breakpoint routine outlined above will work for a 
"one-shot" 
test. 
However, to be more useful 
as a 
general purpose debugging 
tool, 
some refinements 
must be added. To keep from destroying the program 
whenever a breakpoint is placed, the supervisory pro- 
gram running from the host processor must save the 
lOP code that is occupied by the escape sequence. 
When the breakpoint is completed and lOP execution is 
to resume, the host program restores the lOP code, sets 
the TP in the CB area back to where the breakpoint was 
placed, and sends a CA START,Since the length of each 
instruction can be easily found from bits 1-4 of the op- 
code, a single stepping function can also be done.· By 
the time this 
is implemented, 
the host program is 


becoming a full-fledged debugging routine. Appendix 3 
describes a debugging program that makes use of tlie 
ideas presented here. 


Breakpoint 
routines 
can be quite 
useful, 
but some 
restrictions 
and limitations 
should be mentioned. The 


processor examining the breakpoints must have access 
to the lOP program memory, either directly, or through 
lOP programs that simulate direct access. The program 
memory must be in RAM. The breakpoint 
must be 


placed on an instruction boundary, and multiple break· 
points must not be placed so that they overlap. There 
may be some impact on the PBarea. CA generation may 
have to be different 
than usual. But, despite 
these 


limitations, the breakpoints offer a useful and more con· 
ventional software debugging tool than analyzers. 


REAL·TIME TESTING 


Running an lOP program in its final environment with 
real I/O devices is the true test of dynamic operation. 
The program is no longer in a static, isolated environ· 
ment. The demands of DMA and multiprocessing 
may 


reveal unplanned timing dependencies or critical sec- 
tion problems. There may also be sections of hardware 
or software, which couldn't 
be tested statically, 
that 


may have bugs. The whole purpose of static or func· 
tional testing is to dig these problems out while con- 
venient debugging tools can be used. Since there are no 
simple techniques for real·time debugging, the use of a 
logic state analyzer and techniques to fully understand 
the lOP's real-time operation will be emphasized. 


Multiprocessing operations and real·time asynchronous 
I/O requests can cause the timing complexity 
of the 


system as a whole to rise beyond the point of complete 
comprehension by an individual. It is then essential that 
techniques to ensure correctness are used. These in· 
elude good design methods, especially a clean, well- 
structured design, as well as good testing. A thorough 
test requires the attitude 
that the system should be 


tested for failures, rather than tested for correctness. In 
other words, one should try to make the system fail, 
tests should be chosen that will put the worst stress on 
critical timing areas. 


The best way to do this is to write a diagnostic program 
that puts the CPU, lOP, and I/O devices through the 
worst conceivable timing and program combinations. 
Ideally, the program should be self·checking so that it 
can be run without supervision, printing any data or pro- 
gram errors that occur, much like a memory test. 


The two main real-time problem areas are insufficient 
data rates or latency, and critical section problems. To 


test for data rate problems, run the system clock at its 
lowest expected frequency and use memory and I/O 
with 
maximum 
expected 
wait 
states. 
Identify 
the 


tightest program timings and try to have these sections 
coincide with worst case DMA or other heavy bus utiliza· 
tion (see dual channel operation later). Critical section 
problems can occur when two independent processors 
communicate with each other with improper "handshak· 
ing." This can result in one processor missing another's 
message, or even having both processors 
hang up, 


waiting for each other to go ahead. The 8089 provides 
aids to these problems, including the TSL instruction (to 
implement semaphores) and the BUSY flag. However, 
any interprocessor communication (including one chan· 
nel of the lOP to the other) should be checked. Beware 
of cases when one processor is running considerably 
slower than the other (due to DMA overhead or chained 
instruction sequences). 


The techniques 
for real-time debugging 
evolve from 


functional testing using a logic analyzer. For all but the 
simplest systems, an analyzer is essential, since it can 
graphically show program execution and timing 
rela· 


tionships during real-time execution. Another aid is a 
delayed oscilloscope. 
Triggering the scope from the 


logic analyzer, the delay can be adjusted so that any 
signal in the system can be monitored. 


To facilitate the use of the logic analyzer, especially if 
its memory is not very deep or when using it to trigger 
an oscilloscope, a repetitive system can be used to con· 
tinually 
update the display. Using a repetitive 
reset 


helps to debug the software-hardware interface, since 
oscilloscope 
or logic analyzer probes can be readily 


moved around the 
circuit 
to 
observe new signals 


without 
manually 
retriggering 
the 
display. 
At 
its 


simplest, the reset to the host processor can be strob· 
ed, say every 10 ms. The processor will then provide the 
two channel attentions 
(CAs) that are needed to in· 


itialize the lOP.Where this isn't feasible, the CAs can be 
externally forced by either a string of one·shots or a sim· 
pie processor with timing loops (such as a SDK·85 or 
SDK·86).See Figure 5 for initialization timing. 
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Memory protection of the lOP and system programs is 
helpful when debugging DMA operation. It is quite easy 
for runaway DMA to wipe out mem,ory.Another precau:, 
lion to avoid this problem is to set an upper limit on the. 
number of bytes transferred by always specifying a byte 
count termination. 


Logic Analyzer Techniques 


In the absence of other powerful debugging systems, 
the logic analyzer has shown to be an extremely useful 
tool. Because of its importance in debugging an lOP 
system, some basic techniques and' observations that 
relate to monitoring lOP operation will' be reviewed here. 
The particular brand or type of analyzer used is not too 
important, but would be desirable to have the following 
features: 


• At least a 24·bit data width 
• Flexible triggering and qualification 
control 
• Display after triggering on a sequence of states 
• Capability for hexadecimal data display 


It is best to hook up to the address/data lines at the lOP, 
as opposed to looking at the separate address and data 
lines, since 39 lines would be required just to look at ad· 
dress, data and status lines. The three lower status lines 
should be monitored to show the type of bus cycle be· 
ing run. Other lines can be connected where needed, at 
places like the DRQ lines, the EXT lines or other lines' 
related to the system, 


For general purpose debugging, triggering the analyzer 
on the rising edge of the lOP clock shows the most 
useful data concerning bus cycles. Of course, using the 
falling edge may be necessary to check certain signals, 
particularly ones that are active only while the clock is 
low., The following 
discussion 
is based on sampling 
data on the clock's rising edge. 


One should be careful when setting up the triggering 
for the analyzer that the desired event is what is dis· 
played and not a later event with the same trigger word. 
This can happen when the logic analyzer is in the repet· 
itive trigger mode. It may retrigger before the system ac· 
tually resets. A sequence restart feature is helpful. 


The basis of following program execution and DMA on a 
logic analyzer is to follow an 8089 bus cycle, which is 
identical to a 8086 and 8088 bus cycle. The following 
diagram shows a typical 8089 bus cycle. 


For general purpose debugging, displaying every clock 
is usefUl, but for quickly finding one's way around a pro· 
gram, the analyzer can be qualified so that only instruc· 
tion fetches (status = 100 or 000), with ALE active, are 
trapped. A much more compact display of execution 
flow results. 
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As mentioned earlier, on a 16·bit bus, most instructions 
starting on odd addresses won't show the first fetch, 
since the internal q4eue is in use. It is a good idea in 
that case to use only even instruction 
boundaries as 
trigger words. When following dual channel operation, 
one should keep an eye on the upper status bits (53-56), 
since 53 indicates which channel is running (0=CH1, 
1= CH2), and 
54 
indicates 
DMA/non-DMA 
transfer 
(0= OMA, 1= non·DMA). 


A REVIEWOF lOP OPERATION 
(With things to look out for) 


When trying to get an unfamiliar system going for the 
first time, it is too easy to stumble on apparent prob· 
lems that are really just unexpected operation modes or 
peculiarities 
of the machine. For this reason the basic 
principles of lOP operation will be reviewed here with 
special emphasis on possible problem areas or pitfalls 
that a user might encounter when aebugging a 8089 sys· 
tem. The topics are covered generally in the order en- 
countered when bringing 
up a system. For complete 
details of operation and some design examples, see the 
8086 Family User's Manual. 


RESET 


RESET must be active (HIGH) for at least four clocks in 
order to fully initialize all internal circuitry. On power up, 
RESET should be held high for at least 50 microsec- 
onds. The chip is only ready to accept a Channel Atten- 
tion (CA) one clock after RESET goes inactive. 


Note that the SEL pin is sampled on the falling edge of 
the first CA after RESETto tell the 8089 whether it is a 
master (0)or a slave (1)for its request/grant circuitry. If a 
master, it will assume it has the bus from the beginning. 
If a slave, it will strobe the RQ/GT Line to request the 
bus back and will not start any bus transfers until it has 
been granted the bus. If the RQ/GT line is not being 
used, make sure the lOP comes up in the master mode. 


Initialization 


Upon the first CA after reset, a sequence of instructions 
is executed from an internal ROM. These instructions 
pick up parameters and load data from the linked list 
sequence (Figure 6). The instruction sequence is essen· 
tially: 


MOVB SYSBUS from FFFF6 
LPD System Configuration 
Block (SCB) from PFFF8 
MOVB sac from (SCB) 
LPD Control Pointer (CP) from (SCB)+ 2 
MOVBI "00" to CP+ 1 (clears BUSY flag) 


Remember that four bytes must be fetched during an 
LPD. If on a 16·blt bus, with even addressed boundaries, 
only two fetches are needed. Otherwise (8-blt bus or odd 
boundaries), four fetches are needed. 


Even though no bus cycles are run to fetch these in- 
structions, the CH1 Task Pointer (TP)appears on the ad- 
dress latches during the short internal fetch periods. 
On power up, this value is meaningless, but if a repeti- 
tive RESETIs used, the TP remains unchanged from the 
end of the last program run. See Figure 6 for the start of 
a typical initialization 
sequence as viewed on a logic 
analyzer. 


Bit 0 in the SYSBUS field sets the actual (or physical) 
system bus width that the lOP expects. In the 8-bit 
mode, only byte accesses are made, and all 8-bit data 
should appear on the lower eight data lines. In the 16-bit 
mode, word accesses can be made (if the address is 
even), all data on even addresses appears on the lower 
eight data lines, and all data at odd addresses appears 
on the upper eight. 


Bit 0 in the SOC field sets the physical width for the I/O 
bus. The same rules for the system bus apply here. Note 
that these bits should reflect the actual hardware imple- 
mentation and are not to be confused with the oMA logi- 
cal widths set by the WID instruction. 


The R bit (bit 1) in the SOC field is used to change the 
mode of the RQ/GT circuitry. When the lOP is on the 
same bus as an 8086, it is required to havethe R bit be 0, 
with the 8086 as the master and the 8089 as the slave. 
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T1 
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14 


The master (8086 or 8088) can never tab 
the-bus away 
from the slave (8089);only the slave can give back the 
bus. In other words, during oMA transfers, the 8089 
would not have the bus taken away. This is the only 
mode compatible with the 8086 or 8088. 


When two lOPs are being used on the same bus, the 
RQ/GT circuity can be put into an equal priority mode 
by setting the R bit to one. A slave can only be granted 
the bus if the master is doing unchained instructions or 
running idle cycles. The master can request the bus 
back from the slave at any time. The slave grants it if do- 
ing unchained instructions 
or if it is idling. The master 
and slave are put on essentially the same priority. 


At the end of initialization, 
the "BUSY" flag of CH1 is 
cleared. For systems where the 8086 is waiting for the 
initialization sequence to end before giving another CA, 
it can set the BUSY flag high prior to initialization. The 
BUSY flag going low is a sign that the lOP is ready for 
another CA. It is important to remember that the lOP will 
not respond to, nor latch, a CA during an initialization 
sequence. 


Channel Attentions 


The main system processor initiates communications 
with the lOP through the Channel Attention (CA)line. As 
mentioned earlier, the first CA after system RESET in- 
itializes the lOP. All subsequent CAs cause the lOP to 
do a two-step process. It first fetches the Channel Con- 
trol Word (CCW)from the appropriate channel at (PP)for 
channel 1 or (PP+ 8) for channel 2. (SEL at the time of 
CA falling determines the channel for all following ac- 
tions.) The lower three bits of the CCW Command Field 
(CF) are examined and then cause the lOP to execute 
the desired function. 


Command Field (CF) 


Control 
of 
task 
block 
programs 
is 
accomplished 
through the command field. The various CF functions 
are: 


CF 


000 - 
Examine other field only and set BUSY flag 
001 - 
Start task program in I/O space 
011 - 
Start task program in system memory 


The start command causes the following instructions 
to be executed out of the internal ROM: 


LOP CP from (CP)+ 2 (CH1) or + 10 (CH2) 
LOP TP from (PP)(for TP in system) or 
MOVB TBP from (PP)(for TBP in I/O) 
MOVBI "FF" to (CP)+ 1 or + 9 (set BUSY flag) 
111 - 
HALT channel. BUSY flag cleared to "00" 


110 - 
HALT channel. Save state of machine and 
clear BUSY flag by executing: 
MOVP TP to (PP) 
MOVB PSW to (PP)+ 3 
MOVBI "00" to (PP)+ 1 or + 9 


The channel will 
HALT and the machine will con- 
tinue execution on the other channel or go to Idle if 
the other channel is idle. 


101 - 
Continue 
channel. 
The channel 
is 
revived 
after a HALT by executing: 
MOVP TP from (PP) 
MOVB PSW from (PP)+ 3 
MOVBI "FF" to (CP)+ 1 or +9 


(set BUSY flag) 


Do not do a CONTINUE after initialization without doing 
a CA STAAT first since the (PP)register in CH1 is used 
as a temporary register (to hold SCB)and is only correct- 
ly loaded by a CA STAAT. 


The upper 5 bits in the CCW will have affect if CF = 000 
or upon a CA STAAT. Some things to note about these 
upper fields are: 


• Priority Bit - 
If both channels are doing tasks of 


the same overall priority, the tasks with the higher 
priority bit will run. If the priority bits are the same, 
execution will alternate between the two channels. 


• BLL Bit (Bus Load Limit) - 
Keeps nonchained in- 
structions 
from occurring 
more often than once 


every 128clocks. However, channel attention or ter- 
mination cycles, even on the other channel, may 
disrupt 
the 
exact 
time 
interval 
to 
the 
next 


instruction. 


It should be noted that the setting or clearing of the 
BUSY flag 
occurs 
after 
the 
loading 
or 
storing 
of 


registers, so that in a system where the main CPU uses 
the BUSY flag as a form of semaphore to tell when the 
lOP is truly finished, there is no danger that the SCB, 
CP, PP or TP could be changed before the lOP loads 
them. 


Also since DMA termination cycles and chained instruc- 
tion execution have a higher priority than CA, it is possi- 
ble for 
CA to be "shut·out" 
by these higher priorities 


runn'ing on the other channel. However, since CA is 
always latched (except during initialization), it won't be 
forgotten. 


How Can a Channel be Halted? 


Sometimes 
a channel may stop its operation 
unex- 
pectedly. To see what could cause this, and to show the 
impact of halting a channel, the various ways of stop· 
ping a channel are explained: 


HALTED CHANNEL - 
If the channel has never started 


after initialization, 
if it has received a CA HALT com- 


mand or a software 
HALT, channel operation is sus· 


pended. If the other channel can run, it will, otherwise 
idle cycles will run. Only a CA STAAT or CONTINUE can 
resume operation. 


WAITING FOA A DMA AEQUEST- 
If the channel is in a 


source or destination synchronized DMA transfer mode, 
it will wait until DAQ is active before running its syn- 
chronized 
transfer. 
To minimize 
the impact 
on the 


overall throughput of the chip, the other channel can run 
during these DAQ wait periods. 


WAITING TO GET THE BUS BY AQ/GT - 
If the lOP has 


given the bus away via I'm/err, it won't initiate any bus 
transfers until it has the bus back. The machine will run 
up to just before T1 of a bus clock cycle and will three- 
state its address/data and status pins until It has been 
granted the bus. 


WAITING FOA AEADY - 
When running bus transfers, 


"READYis sampled at T3 of a busy cycle. If inactive, the 
whole chip will walt until AEADY goes active. 


The last two cases of waiting (or "wait" states) stop the 
whole chip and do not permit the other channel to run. 
However, with AEADY inactive or with the bus not ac- 
quired, there Is not much that can be done on the other 
channel anyway. These two cases only stop the chip 
when running bus cycles. Any internal operations can 
proceed without having the bus or with the system not 
AEADY. 


Note the difference between when the chip is HALTed 
when using AQ/GT and an external arbiter (8289) for 
bus arbitration. 
Not having the bus due to AQ/GT will 
inhibit the bus cycle from even starting. Since the 8289 
stops the chip by forcing 
AEN inactive, which goes 


through the 8284 clock generator to force AEADY inac· 
tive to the lOP (or 808618088), 
a bus cycle has already 


been started, with ALE asserted, and the address on the 
address/data lines. When the bus is obtained, operation 
proceeds at T3 of the bus cycle. 


As will be mentioned later, many invalid opcodes will 
cause the machine to hang up. In these cases the 
address/data lines will point to where the bad opcode 
was fetched. 


Task Execution 


Although optimized for fast and flexible DMA operation, 
the lOP is also a fUll-fledged microprocessor. The 8086 
Family 
User's 
Manual 
deals 
with 
programming 


strategies and other details. Some of the things to be 
noted during debugging will be mentioned here. 


InstructIon FetchIng 


Unlike the 8085 (but like the 8086), the 8089 labels all 
fetches from the instruction stream, whether OPCODE, 
offset, displacement, or literal data, as an instruction 
fetch on the status lines. In some cases, such as MOV 
A,I and ADD A,I, the instruction 
fetch time greatiy ex- 


ceeds execution time because literals are treated as in- 
struction fetches. When following programs on a logic 
analyzer, triggering on status = 100 or 000 (instruction 
fetch) and a known program address is the handiest way 
to trace the flow of the program. 


When running programs on a 16·bit bus, a 1-byte queue 
register comes into play, saving the upper byte fetched 
from 
the 
last instruction 
fetch, 
If not used by the 


previous Instruction. This reduces fetch time and bus 
utilization since the odd byte doesn't need to be fetched 
again. An internal four·clock cycle fetches data from the 
queue. Like the internal AOM fetches, the task pointer Is 
put out on the address/data lines, but no bus cycle Is run. 


The queue can have some possible unexpected 
affects 
that have to be taken into account 
during debugging. 


These apply only to 16·bit systems and are: 


1. Instructions 
that 
start 
on odd boundaries 
will 
not 
likely 
have bus cycles 
run to fetch 
the odd byte 
unless jumped 
to, unless 
preceded by LPDI (which 
clears the queue), or an instruction 
that modifies 
the 
task pointer'is 
executed. The laVer causes the'queue 
to be cleared so that part of an old instruction 
won't 
become part of the new one. 


2. There is a queue register for each channel so loading 
or clearing the queue on one channel has no affect on 
th.e other channel's 
queue. 


3. The second 
word of immediate 
data fetched 
by a 
LPDI is done during a pseudo-instruction 
fetch cycle 
that cannot make use of the queue or already fetched 
data. Thus, If on an odd boundary, fetching 
an LPDI 
will be byte, word, byte, byte, byte, and the queue will 
not be loaded. 


When Can the Other Channel In~errupt Instruction 
Execution? 


This will be explained more in the "dual channel" 
opera· 
tlon section, but a few points will be mentioned here. All 
Instructions 
are made up of internal 
cycles, with each 
cycle composed 
of two to eight clocks. Each bus cycle 
is one Internal cycle, but there can be internal 
cycles 
with 
no comunications 
to outside 
the chip. 
Internal 
cycles will be extended by the number of wait states in 
each bus cycle, Between any of these cycles, DMA from 
the other channel can intervene if the priorities 
permit it. 


Instruction 
fetching and execution can only interrupt in- 
structions 
on the other channel 
when the instruction 
has been completed, 
not between internal cycles, 


Registers 


All the registers 
have some special purpose use in the 
Instruction, 
Execution 
or DMA, but all except 
the CC 
register can be used as general purpose registers during 
instruction 
sequences. A few are loaded specially: 


• CP - 
Is only 
loaded during 
an initialization 
se- 
quence. There is one CP register that handles both 
channels. jAil 
others 
are duplicated, 
one set for 
each channel.) 


• 
PP - 
Is only properly 
loaded during a CA START 
command. It holds the SCB value after the initializa· 
tion sequence. 


• 
TP - 
This is included as part of the registers in the 
RRR field, 
but cannot 
be operated on unless you 
plan 
on 
having 
your 
program 
execution 
jump 
around. Everytime this is operated on, the queue is 
cleared. The TP is loaded from two words (address 
and displacement) 
on a CA START, LPD, or LPDI, 
and loaded from 3-byte MOVP format (see illustra· 
tion On page 5) on a CA CONTINUE, and can be op· 
erated on using any register oriented 
instructions. 


The following 
registers are loaded during program exe- 
cution, 
but can have special effects: 


• CC - 
The only thing that affects instructions 
in the 
CC register is the' chaining 
bit. If chaining 
doesn't 
matter (if only one channel is being used without 
channel attentions, 
for example), then the CC reg· 
ister can be general purpose. However, for portabil- 
ity of programs, it is strongly suggested 
not to use 
the CC register except for altering DMA parameters 
and chaining. 


• MC - 
Is a general purpose 
16·blt register, 
but Is 
also used to do a masked comparison 
either 
for 
DMA search/match termination 
or forthe 
JMCE and 
JMCNE instructions. 


• 
BC,./X 
- 
Both general purpose 16·bit registers. 
In 
instructions 
that reference 
memory using the AA 
field, if AA = 11, the IX register 
is incremented 
by 
the number of bytes fetched or stored. 


• Pointer Registers (GA, GB, GC and TP) - 
Are 20-bIt 
registers, 
but can also be used as 16-blt registers. 
Adds 
will 
carry 
into 
the upper 4 bits, 
but other 


.operations 
(COMP, OR, AND) are done only on the 


- 
lower 16 bits. Note that when used as pointers 
to 
system memory, It is possible to add a large 16-blt 
number to the pointer and to put the pointer 
Into 
another 64K block of memory. 


Sign Extens/on 


All program data brought into the chip, either literals or 
displacements 
in opcodes, 
or program 
data fetched 
from 
memory, 
is 
sign-extended. 
Offsets 
used 
for 
calculating 
addresses are not sign extended. Any 8·blt 
data brought 
in has bit 7 sign-extended 
up to bit 19. 


Sixteen·bit 
data is sign-extended 
from bit 15 to bit 19. It 
is important 
to note this, because it can affect 
logical 
operations. 
For example, 
if one wanted 
to OR 0084H 
with 
1234H in register 
GC, you couldn't 
do ORBI GC, 
84H, because bit 7 would 
sign-extend 
Into the upper 
byte. Instead, you should 
code ORI, 0084H to do this 
properly 
(note that this has a word for the immediate 
data). The non-ADD operations 
will cause the upper four 
bits of the pointer registers to be invalid since the upper 
four bits of the ALU come only from the adder. 


Tags 


It should be noted that the way the lOP knows which 
bus to access (system or I/O) is via the Tag bit associ· 
ated with the pointer register used. The TAG can only be 
set in these ways: loading as a 16-bit register (MOV R,M, 
MOV R,I) sets TAG to I/O space, loading 
as a pointer 
(LPD, LPDI) sets TAG to a system space), or bringing the 
TAG in from memory by a MOVP instruction. 


Effects 
0# Invalid Opcodes 


The upper 6 bits of the 2-btye opcode actually determine 
which opcode will be executed. 
If these bits are a valid 
opcode, but lower bits are invalid, the chances are good 
that the bad bits will be ignored. But if the upper six bits 
are invalid, there is a very good chance that the chip will 
hang up and stop execution 'in that channel. The only 
way to get out of this mode is to reset the chip. If this 
hang-up occurs, 
it can usually 
be traced 
because the 
last address of the instruction 
fetch will stili be on the 


Going from Instruction 
Execution 
Into DMA 


The XFER Instruction 
places the current 
channel 
Into 
the OMA mode after the next Instruction. 
This permits 
one last instruction 
to start up an 1/0 device (start CRT 
display on an 8275, for example). However, in order for 
the lOP to get setup 
for OMA, the GA, GB, and CC 
registers 
should not be altered during this last Instruc· 
tlon. 
Failure to observe this will 
probably 
result 
in an 
improper 
first 
OMA fetch. The WID instruction 
can be 
placed after XFER. 


DMA Trans'ers 


Incrementlng/Non·lncrementlng 
pointers 


A memory or 1/0 pointer can be made to increment 
for 
each byte transferred 
during OMA or It can remain fixed. 
Incrementing 
is 
used 
primarily 
for 
memory 
block 
transfers, 
and non·lncrementing 
Is used to access 
1/0 
ports. 


BIW Mode 


Each OMA transfer 
is composed 
of separate fetch and 
store cycles so that 8/16-bit data can be assembled and 
disassembled, 
and translation 
and termination 
may also 
be easily handled. There are four possible transfers 
or 
BIW modes. They are: 


B - B-1 
byte fetched, 1 byte stored 
BIB - W - 
2 bytes fetched, 1 word stored 
W - BIB - 
1 word fetched, 2 bytes stored 
W - W - 
1 word fetched, 1 word stored 


The BIW mode used depends on the logical 
bus width 
(selected 
by the WID Instruction), 
address 
boundary, 
and Incrementlng 
mode. 


All systems with 8·bit physical buses will run in the BIB 
mode. On 16-bit physical 
buses the other 
modes are 
possible, 
dependirg 
on the 
logical 
widths 
selected. 


Note that the logical bus width can be different 
than the 
physical bus width since there are cases where an 8·bit 
peripheral may be used on a 16-bit bus. The selection of 
the logical 
width, 
and not the physical 
width, 
is what 
determines 
the BIW mode. Thus it is the responsibility 
of the programmer 
not to program an invalid'combina· 
tion (i.e., don't specify a 16-bit logical width on an 8·bit 
physical 
bus). 


Any transfer on an odd boundary will be BIB but if the 
pointer is incrementing 
and on a 16·bit logical bus, after 
the first transfer, the pointer will be on an even bound· 
ary. The lOP will then try to maintain word transfers 
in 
order to transfer data as effeciently 
as possible. See the 
user's manual for details. The change in BIW mode oc· 
curs only after the first transfer or, as explained 
in the 
termination 
section, 
upon 
certain 
byte 
count 
ter· 
minations. 


Synchronization 


In the unsynchronzied 
mode, transfers 
occur as fast as 
priorities 
will 
allow. 
This 
is the 
lOP's 
"block·move" 
mode. Most I/O peripherals only want a OMA transfer on 
demand; 
the 
ORO lines, 
along 
with 
synchronization 
specified, 
will handle this need. Source synchronization 


IT me IUt" 
IS wailing 
lor a UMA reques., II Will run pro- 


grams or OMA on the other channel, 
or execute 
Idle 
cycles If nothing Is pending. If running Idle cycles when 
the ORO comes, 
the transfer 
starts 
five clocks 
after 
ORO Is recognized. 
If running 
OMA or Instructions 
on 
the other channel, the ORO cannot be serviced until the 
current 
internal 
cycle is done, and may require a max- 
Imum 
of 
12 clocks 
(without 
bus arbitration 
or wait 
states). 


Consecutive 
ORO·synchronized 
OMA transfers 
on the 
same channel are separated by four idle clocks (assum- 
ing no other delays) by an internal sampling mechanism. 
This happens 
between 
the 2·byte fetches 
on source· 


synchronized 
B/B·W cycles, and between the two stores 
on destination·synchronized 
W·B/B cycles. 
This delay 
between consecutive 
OMA cycles allows adequate time 
for proper 
acknowledgement 
of the current 
OMA reo 


quest 
before 
the 
next 
request 
is 
processed. 
On 
destination·synchronized 
OMA, this isn't a problem, but 
on source·synchronized 
OMA, there will 
be four extra 
clocks 
per transfer. 
Unless one is running 
right at the 
speed limit, this won't be a 'problem. Near the maximum 
data rate, unsynchronized 
transfers 
can be used, with 
synchronization 
done by manipulating 
the READY line. 


Translate Mode 


When the translate 
bit is set, the data fetched 
during 
OMA will be added to the GC register. This new pointer 
will in turn be used to fetch, via a seven clock extra fetch 
cycle, new data, which will then be stored. Translate 
is 
only defined for byte transfers. 
The bytes are added to 
GC as a positive offset, so a lookup table for translating 
data can be a maximum of 256 bytes iong. Even if the 
data to be translated 
falls within a smaller range (such 
as ASCII code), a full 256·byte lookup table is recom- 
mended so that erroneous data can be flagged and con· 
trolled. 


Translate can be run on any of the,BIB transfer modes, 
so it is useful for doing block translation 
within program 
execution as well as translation 
directly to or from an I/O 
port. 


DMA Termination 


One of the powerful 
features 
of the lOP Is Its varied 
OMA termination 
conditions 
and their close tie·in with 
resuming 
Instruction 
Block 
programs. 
However, 
be- 


cause of the multitude 
of OMA modes, care must be 
taken In predicting 
the exact termination 
parameters. 
Various things to be careful about will be outlined 
Mre. 


Byte Count (BC) Termination 


The BC register 
is decremented 
for every byte trans- 
ferred whether or not BC termination 
is set. If BC ter· 


mination 
is set, the last transfer 
done is the one that 
results in BC being zero. To avoid the problem of miss· 
ing BC = 0 on word transfers, 
if BC is odd between every 
transfer, 
the lOP detects 
when BC is 1, and forces the 
last transfer to be in the BIB mode. Since both the fetch 
and store cycles are complete, 
the source and destina- 
tion pointers point exactly to the next byte or word that 
would have been fetched. 


Masked 
Compare 
(MC) Termination 


An MC termination occurs when a pattern matches (or 
doesn't match, depending on mode selected) the lower 
half of the MC register (the match pattern) with only the 
bits that are enabled by the upper half of MC (the mask 
pattern) contributing 
to a match. Thus the masked bits 
can be "don't careS" in both the data byte and the match 
byte. 


The masked comparison is only done on store (deposit) 
cycles. Any bytes transferred (in BIB or W·BIB mode) will 
be compared. But, since the MC comparison is done on 
only one byte, any words stored (W·W or B·B/W) have 
only their lower byte compared. This may be fine, but if 
not, make the destination logical width 8 bits. 


Just like BC termination, the pointers will point to the 
next data to be transferred. The BC will also be decre- 
mented correctly, except if the termination occurs on 
the first byte of a W·B/B transfer. In this case the BC will 
be decremented as if the entire transfer (both bytes) hl\d 
taken place. 


The store cycle that causes an MC termination will be 
lengthened by two extra clocks (or by one extra clock if 
there are wait states), to allow time to set up the ter· 
mination cycle. 


External (EXT) Termination 


External termination -allows the I/O device or controller 
to use its own conditions 
to generate 'a termination. 
Basically, the lOP will halt DMA as soon as it recognizes 
an EXTterminate, even if a transfer is only partially com· 
plete. There might be concern that multibyte 
cycles 
(W·B/B or B/B·W) might have data lost if an EXT ter· 
minate stopped .the stQrecycle. In unsynchronized DMA 
this would happen, but this mode is typically not used 
with 
1/0 controllers 
that could generate external ter· 
minations_ In synchronized DMA mode.s, it is assumed 
that the 1/0 controller will only do·a DRQ for valid data 
transferred, and that it won't give an EXTterminate with 
its 
DRQ active. 
In destination 
synchronization, 
the 
possible problem occurs in the W·B/B mode, where EXT 
terminate comes after the first store but before the sec· 
ond. This is fine, since even though datl\ was over· 
fetched, the proper amount was actually transferred. In 
source synchronization, 
the B/B·W mode raises prob· 
lems since if an EXT terminate came after the first byte 
fetched and before the second byte fetched, normally 
no store cycles would be done at all, thus losing the first 
byte fetched. In this case (i.e., source synced, DRQinac- 
tive, and 1 byte already fetched), a single byte store 
cycle is run before the termination cycle, ensuring data 
integrity. 


In orderto prevent an invalid signal level from becoming 
trapped from the asynchronous 
EXT term lines, two 
clocks of delay and signal conditioning 
are done on 
these lines. In addition, a termination cycle can only be 
started at certain limes during DMA (or TB on the other 
channel - 
see dual channel operation section). The EXT 
terminate lines should be valid eight clocks before the 
start of the DMA cycle to be stopped. 


EXT is sampled even when the lOP is running something 
on the other channel. Remember though, that despite 
the h'igh priority of termination, the current instruction 
on the other channel has to finish before the termination 
cycle is run. Simultaneous EXTson both channels result 
in CH1 termination being done first. 


In order to have enough time to process a byte count ter- 
mination, the BC register is always decremented during 
DMA fetch cycles. Because of this, external or MC ter· 
minations that occur during W·B/B cycles will result in 
the byte count always being decremented by two, even 
if only one byte is stored. This also occurs in the block· 
to-block or block·to-port B/B-W modes. To find the exact 
number of bytes transferred, the source pointer address 
can be checked in the block·to-port and block·to·block 
modes during 
B/B-W cycles and in the block-to-port 


W-B/B mode. The destination 
pointer address can be 
used to find the number of bytes transferred in the port- 
to-block and block-to-block modes during W-B/B cycles. 


Termination 
Cycles and Multiple 
Terminations 


Upon termination, the user can run different task block 
programs, depending on.which type of termination has 
occurred, by specifying an appropriate termination off- 
set. That 
is, instruction 
fetching 
will 
begin after 
a 
termination cycle starting at either the TP value before 
the DMA started, TP + 4 or TP+ 8. These offsets permit 
long or short jumps to termination routi'nes. 


The termination cycle is an add immediate instruction 
that runs from the internal ROMand adds the proper off· 
set '0 the TP. It is 15 clocks long for TP+ 4 and TP+ 8 
termination and 12 clocks long for TP+ 0 termination. 


As mentioned earlier, EXT terminate must come a cer· 
tain time before the end of a transfer to ensure that the 
next transfer doesn't start_ If it 'comes in time and MC 
termination also occurs on tbe current transfer, then the 
termination 
cycle 
with 
the largest 
offset 
is run. A 
simultaneous BC terminate cycle wili have priority over 
MC and wili result in the running the BC termination 
program. 


PrloritleslDual Channel Operation 


The lOP can share its internal and external hardware 
between two separate channels. The user sees two 
identical lOP channels with all register~, machine flags, 
etc., independent of the other channel. The only register 
in common is the CP register, loaded by the initializa- 
tion sequence. The mechanism for achieving dual chan- 
nel operation 
is time multiplexing 
between the two 
channels. 


Since interleaving two channels affects their response 
time to external events and since interfacing to these 
events is the prime purpose of the lOP, several means of 
adjusting the priorities of the channels are provided. 


Before going into the priority algorithms in detail the 
four types of cycles that are affected by the priorities 
will be outlined: 


1. DMA Cycles - 
Any type of DMA transfer cycle, 
including single transfers and translate cycles. DMA 
can be interrupted after any bus transfer by the other 
channel. 


2. Instruction 
Cycles,,- 
Any instructions 
that have 
been fetched out of I/O or system memory. Instruc- 
tion cycles are made up of internal cycles, each two 
to eight clocks long (assuming no wait states). Some 
cycles may not run bus transfers. Instructions can be 
interrupted 
by DMA after anyone 
of the internal 
cycles, but can only be interrupted by.instructions on 
the other channel (normal ones or ones from internal 
ROM)after the current instruction is completed. 


3. Termination Cycle - 
Performed when DMA transfers 
end and 
instructions 
resume (except 
on single 
transfers). 


4. Channel Attention Cycles - 
Performed when chan- 
nel attention is given, performs actions specified in 
the CCW field. Both termination and CA cycles can 
be interrupted by DMA after'any internal cycle, but 
can only be interrupted by instruction 
cycles after 
the complete sequence of internal cycles is done. 


Termination and channel attention cycles as well as the 
initialization cycle (which never runs concurrently with 
other operations) are sequences of instructions fetched 
from an internal ROM. 


Recognizing the higher importance in doing DMA, ter- 
mination and (to a lesser extent) CA cycles, the follow· 
ing priority scheme is built into the lOP. Any channel 
that has a higher-priority operation will run continuously 
until done. If both'channels are running the same priori- 
ty, execution will alternate between them. 


Highest Priority 


1. DMA transfers, termination, chained instructions 
2. Channel attention cycles 
3. Instruction cycles 
4. Idle cycles 


Lowest Priority 


Two ways exist to alter the priority scheme. One way is 
to utilize the priority bits for each channel. If one is 
greater than the other, that channel will run at the ex- 
pense of the other if both channels are otherwise run- 
ning at the same priority. Thus the P bit only has effect 
on channels running at the same priority level. 


If one wants to run instructions along withor in place of 
DMA on the other channel, the other technique is to set 
the chaining bit (in the CC register) which brings the 
instruction priority up to the level of DMA. Care should 
be taken with this since now CAs are at a lower priority 
than instructions and will not be serviced unless that 


channel goes idle. Chaining will also lock out normal in- 
structions on the other channel. Chaining should thus 
be used with care. 


In order to reduce the possibility of shutting out channel 
attentions, an exception is made to the above priority 
scheme. After every DMA transfer, whether synchro- 
nized or unsynchronized, the lOP will service any pend- 
ing CA. However, chained task block execution will still 
shut out CAs on the other channel. 


What 
is the importance 
of 
priorities? 
Well, as an 
example, let's say that we are running long periods of 
non-time-critical block moves (via DMA)on one channel 
and running short bursts of DMA that must be serviced 
promptly 
on 
the 
other 
channel. 
With 
the 
default 
priorities, the short DMA channel bursts would be in- 
terleaved with the longer DMA, reducing the maximum 
transfer rate for both channels. If, however, the priority 
bit was one on the burst mode DMA and zero on the 
other, the bursts would be serviced continuously at the 
fastest possible data rate. 


An even more critical case would be the same low prior- 
ity, long DMA transfers on one channel with DMA on the 
other channel that must terminate, run a short instruc- 
tion sequence, and resume DMA again within a short, 
fixed time. (This might be the case in running a CRTdis- 
play with linked list processing between lines.) Normal- 
ly, the low priority, long DMA could indefinitely 
block 
the short TB sequence. By setting the high-priority chan- 
nel's priority bit to one and putting it into the chained 
instruction 
mode, the low priority channel would stop 
its DMA entirely so that the terminationlinstruction 
se- 
quence could run. 


When establishing the priorities to be run, care should 
be taken that both channels will run successfully under 
a worst case combination. This can be tricky when the 
channels are running asynchronously with fast data 
rates andlor 
short latencies, but must be taken into ac- 


count. Of course, running only one channel on the lOP is 
an easy solution, but if more than one lOP is being used 
in the system, the priorities and delays of the bus ar- 
bitration used (either RQ/GT 
or an 8289bus arbiter) must 
be taken into account. It may be found that the on·chip 
arbitration between the two channels is faster and more 
powerful than external arbitration. 


It is hoped that the material presented here will aid 
those who are putting together and debugging an 8089 
lOP system, and help them in understanding the opera- 
tion of the lOP. Many of the debugging techniques 
should be familiar to those who haveworked with micro- 
and minicomputer 
systems before. Other debugging 
techniques not mentioned here, which work well with 
microprocessor systems, could be just as applicable to 
the 8089. The unique nature of the lOP among LSI 
devices warrants special consideration for its I/O func· 
tions and multiprocessor capabilities. 


Appendix I 


CHECKLIST 
OF POSSIBLE PROBLEMS 


HARDWARE 
PROBLEMS 


• Is RESETat least four clocks long? 


• Are both Vss lines connected to ground? 


• Does the first CA falling edge come at least two clocks 
after RESETgoes away? 


• Does the second CA come at least 150 clocks (16-bit 
system, no wait states) after the first CA? 


• Is 
READY correctly 
synchronized 
and 
gated 
by 
local/system bus lines? 


• Is SEL correct for first CA so that lOP comes up cor- 
rectly as master or slave? 


• If two lOPs are local to each other, is a 2.7K pull-up re- 
sistor used on RQIGT? 


• Are the initialization 
parameters in the initialization 
linked-list correct? 


• Is BUSY flag being properly tested by host CPU soft- 
ware before modifying 
PB or providing a new com- 
mand? 


• Has the chaining, 
translate, 
or lock bit in the CC 
register been erroneously set? 


• Have DMA termination conditions been met? The lOP 
could be trying to do endless DMA. 


Appendix II 


BREAKPOINT 
ROUTINE 
AND 
CONTROL PROGRAM 


The debugging program described here is an example of 
the kind of software 
development 
tool 
that 
can be 
developed for the 8089 lOP. It was written to tryout 
various 
breakpoint 
schemes, and has been used to 
debug an engineering application test system. The pro- 
gram is not meant to be the ultimate debugging tool, but 
is an example of what can be put together to utilize the 
breakpoint routine described earlier in the application 
note. 


The debugging program was tested on a 8086-based 
system that emulates the SDK-86I/O structure, and uses 
the SDK~6 serial monitor. This enables it to use the 
SDK-86 
Serial 
Downloader 
to 
interface 
to 
an 
Intelle~ 
development system on which the software 
was 
created. The 8086 system 
is 
interfaced 
via a 
MULTIBUS™ interface to an lOP running in the REMOTE 
mode. The remote bus access technique, mentioned 
earlier in this note, is implemented on this system, but 
was not used in the software debugging program. 


The breakpoint 
routine uses a simple jump to a save 
routine. The PUM-86 supervisory 
or control 
program 
handles the placement of the jump within the users pro- 
gram. Since it can not normally access the remote bus, 
all lOP programs to be tested must run out of system 
memory. 


When the control program starts, it assumes the lOP has 
just been reset. It then prompts the user for the CP 
and PPvalues. After this, it sends the first (initialization) 
channel attention. It then asks the user for the channel 
to be run, and the starting and stopping addresses. After 
the stopping address has been entered, a Channel At· 
tention 
Start is given. If the breakpoint 
is reached, a 
HALT is executed, and the control program prints the 
register contents. If the breakpoint hasn't been reached, 
the user can type any character, and a Channel Atten- 
tion Halt will be sent to the lOP. If the lOP responds 
within 
50 ms, the TP where it was halted is printed. 


Otherwise, 
the 
control 
program 
issues 
an 
error 
message. If, at any time, the user wants to get out of the 
program, typing an ESC will pass control back to the 
SDK-86 monitor. Figure 9 shows the flow of the control 
program. 


Note that, unlike a single CPU debugging routine, hav- 
ing the 8086 supervise the 8089 enables a clean exit 
from crashed lOP programs. The program code where 
jumps had been placed are always restored. The control 
program is a good example of how the power of dual 
processors can be put to good advantage. 


Comments 
within 
the 
control 
program 
indicate 
parameters that need to be changed to run on different 
systems. It should be noted that channel attentions are 
invoked by the recommended method of using an I/O 
write to a port to generate CA and using AOfor SEL. 


Source and object files of this program are available 
through Intel's INSITeM 
User's Program Library as pro- 


gram 8089 Break. 89 (number AD6). 


MASTER 
DATA 
STORAGE 
LOCATIONS: 
=l 


INCREASING 
ADDRESS 
PP 
TP 
- 
TP 


GA 
r 
GA 


GB 


GC 
GB 


GC 
- 
BC 


IX 
i- 
cc 
I- 


Me 
I- 


PP+ 245 
PP+ 248 
PP+250 


pp + 252 


PP+2$ot 


lOAD 
PP 
WITH 
STARTING 
POINT, 


BUSY 
FlAG 
WITH 
OFFH 


ISIS-II 
PL/M-·86 
XID3 
COMPILATION 
OF 
MODULE 
BREAKPOINT 


OBJECT 
MODULE 
PLACED 
IN 
BREAK.oBJ 


COMPILER 
INVOKED 
BY: 
:Fl:PLM86 
BREAK. SRC 
PAGEWIDTH 
(IDD) 


8D89 
BREAK 
POINT 
PROCEDURE 
WRITTEN 
BY 
DAVE 
FERGUSON 
2/2/79 


INTEL 
CORPORATION 


BREAK$PoINT: 


00; 
DECLARE 
I BYTE; 


DECLARE 
SAVECoDE 
(4) 
WORD; 
I*BUFFER 
FOR 
SToRAGE*1 


DECLARE 
oNEPP 
POINTER; 
1* CHAN 
ONE 
PP 
*1 


DECLARE 
TWoPP 
POINTER; 
1* CHAN 
TWO 
PP 
*1 
DECLARE 
STARTBYTES 
(4) 
BYTE; 
1* 
BUFFER 
FOR 
START 
ADDRESS 
*1 


DECLARE 
STARTPoINTER 
POINTER; 
1* POINTER 
FOR 
START 
ADDR. 
*1 
DECLARE 
ENDPoINTER 
POINTER; 
1* POINTER 
FOR 
END 
ADDR. 
*1 
DECLARE 
PRESENT 
POINTER 
AT 
(@INPNTR); 
1* POINTER 
BUFFER 
*1 


DECLARE 
TRUE 
LITERALLY 
'DFFH', FALSE 
LITERALLY 
'DDDH'; 


1* 
YOU 
MUST 
CONFIGURE 
YOUR 
liD STRUCTURE 
AND 


SYSTEM 
TO 
MATCH 
THE 
PROGRAM 
OR 
VISA 
VERSA 
*1 
DECLARE 
CRTSTATUS 
LITERALLY 
'D~FF2H', 
1* 8251 
STATUS 
PORT 
*1 


CRTDATA 
LITERALLY 
'DFFFDH', 
1* 8251 
DATA 
PORTS 
*1 


CHANATTEN 
LITERALLY 
'DFAH', 
1* CHANNEL 
ONE 
CHANNEL 
ATTENTION 
PORT 
*1 


1* 
CHANNEL 
TWO 
CHANNEL 
ATTENTION 
PORT 
s 
CHANATTEN 
+ 
1 *1 


CHANNELoNE 
LITERALLY 
'DDH' , 
CHANNEL 
TWO 
LITERALLY 
'DIH', 


1* ASCII 
IS 
A 
STRING 
OF 
HEX 
CHARACHTERS 
IN 
ASCII 
FORM 
*1 
ASCII 
(*) 
BYTE 
DATA 
('DI23456789ABCDEF'), 
TITLE$STRING 
(*) 
BYTE 
DATA 
<DAH, DOH, '8D89 
BREAKPOINT 
VER 
I.D', 
DAH, DOH, 'TYPE 
ESCAPE 
TO 
RETURN 
TO 
MONITOR. 
" 


DAH, DOH, D), 


CHANGIVEN 
(*) BYTE 
DATA 
('CHANNEL 
ATTENTION 
GIVEN 
TYPE 
ANY 
KEY 
TO 
ABORT. 
' 


,DAH, DOH, D), 


BKREACHED 
(*) 
BYTE 
DATA 
(DAH, DOH, 'BREAKPOINT 
REACHED',DAH,DDH,D), 
GETCP 
(*) 
BYTE 
DATA 
('INPUT 
CP 
IN 
HEX',DAH,DDH,DD), 


GET$PP 
(*) BYTE 
DATA 
('INPUT 
PP 
IN 
HEX 
FOR 
',DDH), 


GETSTART 
(*) 
BYTE 
DATA 
(DAH,DDH, 
'INPUT 
STARTING 
ADDRESS 
IN HEX',DAH,DDH,DDH), 
SToPADDR 
(*) 
BYTE 
DATA 
('INPUT 
END 
ADDRESS 
IN 
HEX', DAH, DDH,ODHl, 
CHANNUMBER 
(*) 
BYTE 
DATA 
(DAH, DOH, 'CHANNEL 
ONE 
OR 
TWO? 
',DDH), 
ABORT 
(*) BYTE 
DATA 
(' FATAL 
ERROR 
- 
lOP 
DOES 
NOT 
RESPOND 
TO 
CHANNEL', 


, ATTENTION. 
RE-INITIALIZE 
SYSTEM 
',D), 


ABoRTAT 
(*) 
BYTE 
DATA 
(' TP 
WAS 
',D), 


ONE 
<*) 
BYTE 
DATA 
(' CHANNEL 
ONE',DAH,DDH,DDH), 


TWO 
(*) 
BYTE 
DATA 
(' CHANNEL 
TWo',DAH, 
DOH, DOH), 
GASTRING 
(*) BYTE 
DATA 
< 'GA = 
',DDH), 


18 


19 
2 
20 
2 
22 
2 


23 
2 


24 


25 
1 
26 
2 
28 
2 


29 
2 
30 
2 
32 
2 
33 
2 


34 


35 
2 
36 
2 
37 
3 
39 
3 
40 
:2 
41 
:2 


GBSTRING 
<*) 
BYTE 
DATA 
< 'GB = 
',OOH), 


GCSTRINQ 
<*) 
BYTE 
DATA 
< 'GC 
= 
',OH), 


BCSTRINQ 
(*) 
BYTE 
DATA 
(OAH,ODH, 
'BC = 
',OOH), 


IXSTRINQ 
<*) 
BYTE 
DATA 
(OAH,ODH, 
'IX 
',OOH), 


CCSTRING 
(*) 
BYTE 
DATA 
<OAH,ODH, 
'CC 
',OOH), 


MCSTRING 
<*) 
BYTE 
DATA 
<OAH,ODH, 
'MC 
',OOH) 


DECLARE 
CHAR 
BYTE; 


DECLARE 
ONETWO 
BYTE; 


1* SDKMON 
IS 
A 
PLM 
TECHNIGUE 
USED 
TO 
FORCE 
THE 
CPU 
INTO 
AN 
INTERUPT 
LEVEL 
3. 
IN 
ORDER 
TO 
USE 
THIS 
THE 
PROGRAM 
MUST 
BE 
COMP IL.ED (LARGE). 
*1 


SDKMON: 
PROCEDURE; 


DECLARE 
HERE 


1* 
THIS 
IS 
AN 
WHERE 
WORD 


CALL 
WHERE; 


END; 


<*) 
BYTE 
OATA 
<OCCH), 


INT. 
3 
*1 


DATAl. HERE); 


1* CO 
SENDS 
A 
CHAR 
TO 
THE 
CONSOLE 
WHEN 
READY 
*1 


1* 
THIS 
ROUTINE 
IS 
WRITTEN 
TO 
RUN 
VIA 
THE 
SERIAL 
PORT 
OF 
AN 
SDK86 
*1 


CO: 
PROCEDURE 
(C); 


DECLARE 
C 
BYTE; 


00 
WHILE 
(INPUT<CRTSTATUS) 
AND 
OIH) 
OUTPUT 
(CRTDATA) 
= C; 


END; 


1* CI 
GETS 
A·CHARACHTER 
FROM 
THE 
USER 
VIA 
THE 
SERIAL 
PORT 
*1 
1* CI 
AUTOMATICALLY 
ECHOS 
THE 
CHARACHTER 
TO 
THE 
USER 
CONSOLE 
*1 


DECLARE 
tSCAPE 
LITERALLY 
'IBH'; 


CI: 
PROCEDURE 
BYTE; 


DO 
WHILE 
(INPUT(CRTSSTATUS) 
AND 
O:2H) = 0; 
END; 


CHAR 
= 
INPUT 
(CRTDATA) 
AND 
07FH; 


CALL 
CO <CHAR); 


IF 
CHAR 
= ESCAPE 
THEN 
CALL 
SDKMON; 
1* GO 
TO 
SDK 
MONITOR 
*1 


RETURN 
CHAR; 


END; 


VALIDHEX: 
PROCEDURE 
(H) 
BYTE; 
DECLARE 
H 
BYTE; 


DO 
1=0 
TO 
LAST(ASCII); 
IF 
H=ASCII(I) 
THEN 
RETURN 
TRUE; 
END; 
RETURN 
FALSE; 


END; 


42 


43 
2 
44 
2 
46 
2 
47 
3 
49 
3 


50 
2 


51 


52 
2 
53 
2 
54 
2 
55 
2 


56 


57 
2 


58 
2 


59 
2 


60 
2 


61 


62 


63 
2 


04 
2 
65 
2 
06 
2 


67 
2 


68 
2 


69 
2 


/* HEXCONV 
CONVERTS 
A HEX 
CHARACTER 
TO BINARY 
FOR 
MACHINE 
USE. 


IF THE 
CHARACTER 
IS NOT 
A VALID 
HEX 
CHAR. 
THE 
PROCEDURE 
RETURNS 
THE 
VALUE 
OFFH 
*/ 


HEXCONV: 
PROCEDURE 
(OAT) 
BYTE; 
DECLARE 
OAT 
BYTE; 


IF VALIDHEX(DAT) 
<> OFFH 
THEN 
RETURN 
TRUE; 
DO 
1=0 TO LAST(ASCII), 


IF OAT 
= ASCII(I) 
THEN 
RETURN 
I; 
END; 
END; 


HEXOUT: 
PROCEDURE (C); 


DECLARE 
C BYTE; 


CALL 
CO(ASCII(SHR(C.4) 
AND 
OFH»; 


CALL 
CO(ASCll (C AND 
OFH»; 


END; 


/* WORDOUT 
CONVERTS 
A VALUE 
OF 
TYPE 
WORD 
TO AN ASCII 
STRING 
AND 
SENDS 
IT TO 
THE 
CONSOLE 
*/ 


WORDOUT: 
PROCEDURE 
(W); 


DECLARE 
W WORD; 
CALL 
HEXOUT(HIGH(W»; 


CALL 
HEXQUT (LOW (W) ); 


END; 


/* GETAODRESS 
IS A PROCEDURE 
TO GET 
AN ADDRESS 
FROM 
THE 
CONSOLE. 


THIg 
PROCEDURE 
WILL 
ONLY 
CONSIDER 
THE 
LAST 
5 CHARACHTERS 
ENTERED 


*/ 


DECLARE 
INPNTR 
(4) BYTE; 


GEHADDRESS· 
PROCEDURE 
POINTER; 


DECLAR~ 
BUFF 
BYTE; 
/*CLEAR 
ALL 
VALUES 
TO 
ZERO 
*/ 


INPNTR(O) 
0; 


INPNTR(I) 
= O. 


INPNTR (2) 
O. 


INPNTR(3) 
= 0; 


BUFF 
~ 0, 


DO WHILE 
BUFF 
<> 
TRUE; 


/* THIS 
SEQUENCE 
OF 
SHIFTS 
ALLOW 
THE 
USER 
TO 
TYPE 
IN FIVE 
OR 
MORE 
CHARACHTERS 
TO BECOME 
THE 
ACTUAL 
POINTER 
FOR 
8089 
OR 
8086. 
THIS 
PROCEDURE 
RETURNS 
THE 
LAST 
FIVE 
IN PROPER 
SEQUENCE 
STORED 
IN INPNTR(0-3). 
THE 
STORAGE 


IS AS FOLLOWS: 


1.THE 
LAST 
CHARACTER 
INPUT 
GOES 
INTO 


THE 
LOW 
FOUR 
BITS 
OF 
INPNTR(O). 


2.THE 
NEXT 
TO LAST 
CHARACTER 
GOES 
INTO 
THE 
LOW 
FOUR 
BITS 
OF 
INPNTR(2). 


3.THE 
THIRD 
CHARACTER 
INPUT 
GOES 
INTO 
THE 
HIGH 
FOUR 
BITS 
OF 
INPNTR(2) 
4.THE 
SECOND 
CHARACHTER 
INPUT 
GOES 
INTO 
THE 
LOW 
FOUR 
BITS 
OF 
INPNTR(3) 
5.THE 
FIRST 
CHARACTER 
INPUT 
GOES 
INTO 
THE 
UPPER 
FOUR 
BITS 
OF 
INPNTR(3). 


THE 
86 SHIFTS 
INPNTR 
(2,AND3) 
LEFT 
FOUR 
BITS 
AND 
ADDS 
THIS 
TO 
INPNTR(O) 
RESULTING 
IN THE 
ADDRESS 
THE 
USER 
TYPED 
IN. *1 


INPNTR(3) 
= 
(SHLI INPNTR(3). 4) OR 
(SHR( 
INPNTR(2),4) 
AND 
OFH». 


INPNTR(2) 
= 
(SHL(INPNTR(2).4) 
OR 
(INPNTRIO) 
AND 
OFH». 
INPNTR(O) 
= 
BUFF. 
BUFF 
= CI. 


BUFF 
= HEXCONV(BUFF). 


END. 
CALL 
COlOAH). 
I*LINE 
FEED 
TO CRT*I 
CALL 
COlODH). 
I*CARRIAGE 
RET 
TO CRT*I 


RETURN 
PRESENT. 
1* 
PRESENT 
IS A POINTER 
TO THE 
ARRAY 
INPNTR. 
*1 


END. 


STR INGfOUT: 
PROCEDURE (PTR )• 


DECLARE 
PTR 
POINTER,STR 
BASED 
PTR 
II) BYTE. 


I = 0; 
DO WHILE 
STR(I) 
<> O. 
CALL 
COlSTRlI». 


I = I + 
I. 


END. 
END. 


DECLARE 
TAGIS 
1*) BYTE 
DATA 
(' OPERATING 
IN 
',0), 
TAGISONE 
1*) BYTE 
DATA 
('10 SPACE',OAH,ODH,O), 
TAGISZERO 
1*) BYTE 
DATA 
('SySTEM 
SPACE'.OAH.ODH,O). 
1* 
TAGTEST 
TESTS 
THE 
TAG 
BIT 
AND 
SENDS 
A MESSAGE 
TO 
THE 
CONSOLE 
THE 
TAG 
IS LOCATED 
IN BIT 
THREE. 
A TAG 
BIT 
OF 
ONE 
MEANS 
THE 
POINTER 
IS TO 
110 SPACE, 
AND 
A TAG 
BIT 
OF 
ZERO 
MEANS 
THE 
POINTER 
IS TO SYSTEM 
SPACE 
*1 


1* 
THE 
CALLER 
MUST 
DECIDE 
WHICH 
BYTE 
HAS 
THE 
TAG 
AND 
PASS 
IT TO 
TAGTEST 
*1 


TAGTEST: 
PROCEDURE(TEST); 


DECLARE 
TEST 
BYTE; 


CALL 
STRINGOU~(~TAGIS); 


IF 
(TEST AND 
01000B) 
<> 0 
THEN 
DO. 


CALL 
STRINGOUT(ITAGISONE). 


END. 
ELSE 
DO. 


CALL 
STRINGOUT(~TAGISZERO)I 


END; 


1(;3 
1.)4 


:')5 


lC·6 
2 


107 
2 


:08 
2 


109 
2 
110 
;: 


111 
2 


112 
~ 
113 
2 
114 
2 


115 


110 
117 


118 
119 


120 


END; 
DECLARE 
SAVESADDR 
LITERALLY 
'2000H'. 
SAVESSEG 
LITERALLY 
'OOCOH'; 


DECLARE 
BREAK89 
(4) WORD 
DATA 
(9BB1H,OB91H.SAVESADDR.SAVESSEG); 


1* 
BREAK89 
IS AN 
4 WORD 
ESCAPE 
SEQUENCE 
TO ADDRESS 
2000H 


CONSISTING 
OF AN 
LPDI 
TP,SAVESADDR 
WITH 
SEGMENT· 


LOCATED 
AT OCOOH. 
. ~I 


1* 
BRKRTN 
IS 33 BYTES 
OF 
CODE 
THAT 
STORES 
ALL 
REGISTERS 


AS FOl LOWS. 


GA 
STORED 
AT 
PP 
.•. 239 


GB 
STORED 
AT PP 
.•. 242 


GC 
STORED 
AT 
PP 
+ 245 
BC STORED 
AT PP 
.•.24B 
IX STORED 
AT 
pp 
.•. 250 


CC 
STORED 
AT 
pp 
.•. 252 
!"ICSTORED 
AT 
pp 
.•.254 


*1 


DECLARE 
BRKRTN 
(33) BYTE 
AT 
(02COOH) 


i * 
02C OOH 
IS AC TUALL Y 
(SAVESADDR 
.•.(SHL( SAVESSEG )•4) )• AND 
SHOULD 


MATCH 
ADDRESS 
AND 
SEGMENT 
WHERE 
BREAK 
ROUTINE 
IS WANTED 
*1 


INITIAl.. 


,03H.09BH.OEFH.023H.09BH.OF2H.043H.09BH.OF5H.063H.OB7H. 
OFBH.OA3H.OB7H. 


OF~H.OC3H.OB7H.OFCH.OE3H.OB7H.OFEH.020H.048H) 


DECLARE 
PP POINTER; 


DECLARE 
PPP 
BASED 
PP 
(1) BYTE; 


STARTSPRGM 


PROCEDUR[(ONESTWO.PPP); 
DECLARE 
ONESTWO 
BYTE.PPP 
POINTER. 


WHERE 
BASED 
PPP 
(1) BYTE, 


WHERE(O) 
= STARTSBYTES(O); 


WHERE() 
p. 


WHERE(2) 
= STARTSBYTES(2); 


WHERE(]) 
~ STARTSBYTES(3); 


CPDAT«ONESTWO) 
* B) = 3. 


1* 
IF ONETWO 
= 
1 THEN 
OUTPUT 
TO PORT 
OFBH. 
IF ONETWO 


IS 0 THEN 
OUTPUT 
TO PORT 
OFAH 
*1 
OUTPUT(CHANATTEN 
.•.(ONETWO 
» 
= 0; 
CALL 
STRINGOUT(@CHANGIVEN); 
END. 


1* 
THIS 
PART 
OF 
THE 
PROGRAM 
ALLOWS 
THE 
USER 
TO DEFINE 
THE 


CP.PP 
OF EACH 
CHANNEL 
*1 


DECLARE 
BREAKOUT 
BASED 
ENDPOINTER 
(1) WORD; 


121 
122 
123 
124 
125 
126 
127 
128 
129 


130 


131 
132 


134 
2 
135 
2 
136 
2 


137 
1 
138 
2 
139 
2 
140 
2 


141 


142 
1 
143 
1 
144 
2 
145 
2 
146 
1 


147 
1 
148 
1 
149 
2 
150 
2 
151 
1 
152 
2 
153 
2 
154 
1 
155 
1 
156 
1 
157 
2 
158 
2 
159 
2 
160 
1 


CALL 
STRINGOUT(@GETCP); 
CP = GETADDRESS; 
CALL 
STRINGOUT(@GETPP); 
CALL 
STRINGOUT(@ONE); 
ONEPP 
= GETADDRESS; 
CALL 
STRINGOUT(@GETPP); 
CALL 
STRINGOUT(@TWO); 
TWOPP 
= GETADDRESS; 
OUTPUT 
(CHANATTEN) 
= 0; 
1* 
INITIALIZATION 
CA 
*1 


MAIN: 
CALL 
STRINGOUT(@CHANNUMBER); 
CHAR 
= CI; 
1* GET 
CHANNEL 
NUMBER 
*1 
IF 
(CHAR 
AND 
01H) <> 0 
1* 
CHECK 
BIT' ZERO 
TO DEFINE 
CHANNEL 
NUMBER 
*1 
THEN 
DO; 
CALL 
STRINGOUT(@ONE); 
ONE TWO 
CHANNEL$ONE; 
END; 
ELSE 
DO; 
CALL 
STRINGOUT(@TWO); 
ONETWO 
= CHANNEL$TWO; 
END; 


STARTPOINTER 
= GET ADDRESS; 
DO 
I = 0 TO 3; 
1* MOVE 
STARTING'ADDRESS 
INTO 
CP AREA 
*1 
STARTBYTES(I) 
= 
INPNTR(I); 
END; 
CALL 
STRINGOUT(@STOPADDR); 
1* 
GET 
STOP 
ADDRESS 
FROM 
USER 
*1 


ENDPOINTER 
= GET ADDRESS; 
DO 
I = 0 TO 3; 
1* MOVE 
CODE 
TO 
SAFE 
AREA 
*1 
SAVECODE(I) 
= BREAKOUT(I); 
END; 
DO 'I = 0 TO 
3; 
BREAKOUT(I) 
= BREAK89(I); 
1* MOVE 
ESCAPE 
SEQUENCE 
INTO PLACE 
*1 
END; 
CPDAT(I) 
= OFFH; 
1* SET 
CHANNEL 
ONE 
BUSY 
FLAG 
*1 
CPDAT(9) 
= OFFH; 
1* SET 
CHANNEL 
TWO 
BUSY 
FLAG 
*1 
DO CASE 
ONETWO; 
PP = ONEPP; 
PP = TWOPP; 
END; 
CALL 
START$PRGM(ONESTWO,PP); 
1* 
WAIT 
FOR 
ONE 
OF 
THE 
FOLLOWING 
I.CPDAT(I) 
= 0 
CHI 
NOT 
BUSY 
2.CPDAT(9) 
= 0 CH2 
NOT 
BUSY 
3. THE 
8251 
REC. 
BUFFER 
IS FULL 
BECAUSE 
USER 
HAS 
DEPRESSED 
A KEY 


PL/M-86 
COMPILER 


162 
2 
163 
1 


164 
1 
165 
2 
166 
2 
167 
3 
168 
3 


169 
2 


170 
2 
171 
2 
172 
3 
173 
3 


174 
2 


175 
2 
176 
3 
177 
3 


178 
2 
179 
3 
180 
3 


181 
3 


182 
3 


183 
3 
184 
2 
185 
2 
186 
2 
187 
1 


188 
2 


189 
2 
190 
2 
191 
2 
192 
2 
193 
2 


194 
2 
195 
2 
196 
2 


END, 
IF 
(INPUT(CRTSSTATUS) 
AND 
02H) (> 0 
THEN 
DO, 
CHAR 
= CI, 
DO 
I = 0 TO 3, 
BREAKOUT(I) 
·SAVECODE(!), 
END, 
1* 
IF ONETWO 
= 0 THEN 
PUT 
CHA 
HLT 
IN CPDAT(O) 
IF ONETWO 
1 THEN 
PUT 
CHA 
HLT 
IN CPDAT(8) 


CPDAT(ONESTWO 
*8) 
= ObH, 
1* 
IF ONETWO 
= 0 THEN 
OUTPUT 
TO PORT 
OFAH, 
IF ONETWO 
15 1 THEN 
OUTPUT 
TO PORT 
OFBH. 


OUTPUT(CHANATTEN 
+ ONETWOl 
= 0, 
DO 
I ~. 0 TO 
5, 
CALL 
TIME<lOO), 
END, 


IF CPDAT(SHL(ONETWO.3) 
+ 
1) (> 0 
1* 
CHECK 
BUSY 
FLAG 
*1 


THEN 
DO, 


CALL 
STRINGOUT(@ABORT), 


END, 
ELSE 
DO, 
CALL 
STRINGOUT(@ABORTAT), 
CALL 
CO(ASCII(SHR(PPP(2),4»); 
1* 
UPPER 
NIBBLE 
OF 
ADDR 
STORED 
BY HALT 
*1 


END, 
CPDAT(ONETWO 
* 8) = 3H, 
1* 
CA START 
IN CPDAT(O) 
OR CPDAT(8) 
*1 


GO 
TO MAIN, 
END, 
DO, 


CALL 
STRINGOUT(@BKREACHED), 


CALL 
STRINGOUT(@GASfRING), 


CALL 
CO(ASCII(SHR(PPP(241),4»), 
CALL 
HEXOUT(PPP(240», 
CALL 
HEXOUT(PPP(239», 
CALL 
TAGTEST(PPP(241», 


CALL 
STRINGOUT(@GBSTRING), 
CALL 
CO(ASCII(SHR(PPP(244).4»), 
CALL 
HEXOUT(PPP(243», 


PL/M-86 
COMPILER 
8089 
BREAKPOINT 
ROUTINE 


197 
2 
CALL 
HEXOUTIPPP(242), 


198 
2 
CALL 
TAGTEST(PPP(244», 


1<;9 
2 
CALL 
STRINGOUT(@GCSTRING), 


200 
2 
CALL 
CO(ASCII(SHR(PPP(247).4»), 


201 
2 
CALL 
HEXOUT<PPP(246», 


202 
2 
CALL 
HEXOUT(PPP(245», 
203 
;: 
CALL 
TAGTESTiPPP(247», 
., 


204 
2 
CALL 
STRINGOUT(@BCSTRING), 


205 
2 
CALL 
HEXOUT< 1'1'1' 
(249) ), 


2Gb 
2 
CALL 
BEXOUT (1'1'1' 
(248) ), - 


207 
2 
CALL 
STRINGOUT(@IXSTRING), 


208 
2 
CALL 
HEXOUT(PPP(251», 
209 
2 
CALL 
HEXOUT (1'1'1' 
(250) ), 


210 
2 
CALL 
STRINGOUT(@CCSTRING), 


211 
2 
GALL 
HEXOUT(PPP(253)), 


212 
2 
CALL 
HEXOUT(PPP(252)), 


213 
;;; 
CALL 
STRINGOUT(@MCSTRING), 


;:,4 
2 
CALL 
HEXOUT<PPP(255»), 
215 
2 
CALL 
HEXOUT.(PPP(254»), 


~16 
-, 
END. 
1* RESTOflE CODE 
TO ORIGINAL 
LOCATION 
*1 
217 
DO 
I ~ 0 TO 3, 
218 
2 
BREAKOUT< I) 
SAVECODE <I), 
219 
2 
END. 


220 
GO 
TO MAIN' 


221 
END. 


MODULE 
INFORMATION 


CODE 
AREA 
SIZEt 
-~ 0619H 
151010 


CONSTANT 
AREA 
SIZE 
01EFH 
4950 


VARIABLE 
AREA 
SIZE 
0020H 
320 


MAXIMUM 
STACK 
SIZE 
0014H 
200 


427 
LINES 
RElID 


o PROGRAI1 ERROR (S) 


END 
OF 
PL/M-8b 
COI1PILATION 


ISIS-II 
8089 
ASSEMBLER 
X004 
ASSEMBLY 
OF MODULE 
APSO_BREAKPOINT_ROUTINE 


OB~ECT 
MODULE 
PLACED 
IN 
:FO:BRKASM.OB~ 
ASSEMBLER 
INVOKED 
BY ASM89.4 
BRKASM.SRC 


0000 
OOEF 
00F2 
00F5 
00F8 
OOFA 
OOFC 
OOFE 
0100 


2000 
039B 
EF 
2003 
239B 
F2 
2006 
439B 
F5 
2009 
6387 
F8 


200C 
A387 
FA 
200F 
C387 
FC 
2012 
E387 
FE 


2015 
2048 


2017 


NAME 
APSO_BREAKPOINT_ROUTINE 
BRKPNT 
SEGMENT 
i************************************** 
BASIC 
8089 
BREAKPOINT 
ROUTINE 
BY 
~OHN 
ATWOOD 
REV 
3 
8/13/79 
INTEL 
CORPORATION 
i************************************** 


THE 
FOLLOWING 
CODE 
IS CONTAINED 
IN THE 
PL/M-86 
CONTROL 
PROGRAM(BREAK. 
89) 
AND 
IS ASSEMBLED 
HERE 
TO 
ILLUSTRATE 
HOW 
THE 
ESCAPE 
SE~UENCE 
AND 
SAVE 
ROUTINE 
CODE 
WAS 
GENERATED. 
TO USE 
THE 
8089 
BREAK- 


POINT 
PROGRAM, 
THIS 
ASM89 
PROGRAM 
WOULD 
NOT 
BE 
NEEDED. 
SAVE_ADDR 
IS THE 
SAME 
AS SAVE$ADDR 
IN THE 


BREAK. 89 PROGRAM. 


REGS 
PBLOCK: 
GASAV: 
GBSAV: 
GCSAV: 
BCSAV: 
IXSAV: 
CCSAV: 
MCSAV: 
REQS 


STRUC 
DS 
DS 
DS 
DS 
DS 
DS 
DS 
DS 
ENDS 


;PARAMETER 
BLOCK 
;GA AREA 
;QB AREA 
;GC AREA 
;BC AREA 
;IX AREA 
;CC AREA 
;MC AREA 


ORG 
SAVE_ADDR 


MOVP 
[PPl.QASAV.QA 
MOVP 
[PPl.GBSAV.QB 
MOVP 
[PPl.GCSAV.GC 
MOV 
[PPl.BCSAV.BC 
MOV 
[PPl. IXSAV, IX 
MOV 
[PPl.CCSAV.CC 
MOV 
[PPl.MCSAV,MC 


;SAVE 
QA 
;SAVE 
QB 
;SAVE 
GC 
;SAVE 
BC 
;SAVE 
IX 
;SAVE 
CC 
;SAVE 
Me 


HL T 
;STOP 
THIS 
CHANNEL. 


;CLEAR 
BUSY 
FLAQ. 
i**************************************** 
BRKPNT 
ENDS 
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Our thanks to Jim Nadir, the author of this application note. Jim is a 
design engineer in the microprocessors' and peripherals ope'ration 
division, 
Please direct any technical questions you may have to 
your local Intel FAE (Field Application Engineer). 


increasing 
In popUiarny. 
Ine penormance 
Improve:- 
ments and cost reductions afforded by LSI technology 
have spurred on the design motivation of using multiple 
processors 
to 
meet 
system 
real·time 
performance 
requirements. The desire for improved system real-time 
response, system reliability and modularity has made 
multiprocessing 
techniques an increasingly attractive 
alternative to the system design engineer; techniques 
that are characterized as having more than one micro· 
processor share common resources, such as memory 
and 1/0, over a common multiple processor bus. 


This type of design concept allows the system designer 
to partition 
overall system functions 
into tasks that 
each of several processors can handle individually to 
increase system performance and throughput. But, how 
should a designer proceed to implement a multiproc- 
essing system? Should he design his own? If so, how 
are the microprocessors synchronized to avoid conten- 
tion problems? The designer could put them all in phase 
using one clock for all the microprocessors. This may 
work, until 
the physical 
dimensions 
of the system 
become large. When this occurs, the designer is faced 
with many problems, like clock skew (resulting in bus 
spec violations) and duty cycle variations. 


A better approach to implementing 
a multiprocessor 
system is not to have a common processor clock, but 
allow 
each processor to work asynchronously 
with 
respect to each other. The microprocessor requests to 
use the multiple processor bus could then be synchro- 
nized to a high frequency external clock which will per· 
mlt duty cycle and phase shift variations. This type of 
approach has the benefit of allowing modularity of hard- 
ware. When new system functions are desired, more 
processing 
power can be added without 
impacting 
existing processor task partitioning. 


One approach to implement this asynchronous process· 
Ing structure would be to haveall the bus requests enter 
a priority encoder which samples its Inputs as a func- 
tion of the higher frequency "bus clock". The inputs 
would arrive asynchronously to the priority encoder and 
would be resolved by the priority encoder structure as to 
which microprocessor would be granted the bus. An· 
other approach, that used by Intel, is rather than allow- 
Ing the requests to arrive asynchronously with respect 
to one another at the priority encoder, the bus requests 
are synchronized first to an external high frequency bus 
clock and then sent to the priority encoder to be re- 
solved. In this way, the resolving circuitry common to all 
microprocessors Is kept at a minimum. Overall system 
reliability Is improved in the sense that should a circuit 
which serves to synchronize the processor's request 
(which is now located on the same card as the micro- 
processor itself) fail, it is only necessary to remove that 
card from the system and the rest of the system will 
continue to function. Whereas in the other approach, 
should the synchronizing 
mechanism fail, the whole 


. 
.. 
- 
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mechanism to the processor permits processor control 
over that mechanism, thereby permitting system flexi- 
bility (as will be shown) which could not be reasonably 
obtained by any other approach. 


This synchronizing 
or arbitrating 
function 
was inte- 
grated into the 8289, a custom arbitration unit for the 
8086, 8088, and 8089 processors. This note basically 
describes the 8289 arbitration 
unit, illustrates 
its dif- 
ferent modes of operation and hardware connect in a 
multiprocessor system. Related and useful documents 
are: 8086 user's manual, 8289data sheet, Article Reprint 
-55: 
Design 
Motivations 
for 
Multiple 
Processor 
Microcomputer Systems (which discusses implement- 
ing a semaphore with the MULTIBUS™)and Application 
Note 28A, Intel MULTIBUS™interfacing. 


The 8289 Bus Arbiter operates in conjunction 
with the 
8288 Bus Controller to interface an 8086, 8088, or 8089 
processor to a multi-master system bus (the 8289 is 
used as a general bus arbitration unit). The processor is 
unaware of the arbiter's 
existence 
and issues com· 
mands as though it has exclusive use of the system bus. 
If the processor dpes not have the use of the multi· 
master system bus, the bus arbiter prevents the bus 
controller, the data transceivers and the address latches 
from accessing the system bus (i.e., all bus driver out· 
puts are forced into the high impedance state). Since 
the command was not issued, a transfer acknowledge 
(XACK)will not be returned and the processor will enter 
into wait states. Transfer acknowledges 
are signals 
returned from the addressed resource to indicate to the 
processor that the transfer is complete. This signal is 
typically used to control the ready inputs of the clock 
generator. The processor will remain in wait until the 
bus arbiter acquires the use of the multi·master system 
bus, whereupon the bus arbiter will allow the bus con· 
troller, the data transceivers and the address latches to 
access the system bus. Once the command has been 
issued and a data transfer has taken place, a transfer 
acknowledge (XACK) is returned to the processor. The 
processor then completes its transfer cycle. Thus, the 
arbiter serves to multiplex a processor (or bus master) 
onto a multi-master system bus and avoid contention 
problems between bus masters. 


Since there can be many bus masters on a multi-master 
system bus, some means of resolving priority between 
bus masters simultaneously requesting the bus must be 
provided. The 8289 Bus Arbiter 
provides for several 
resolving techniques. All the techniques are based on a 
priority concept that at a given time one bus master will 
have priority 
above all the rest. These techniques in- 
clude the parallel priority resolving techniques, serial 
priority resolving and rotating priority techniques. 


A parallel priority resolving technique has a separate 
bus request (BREQ) line for each arbiter on the multi- 
master bus (see Figure 1). Each BREQ line enters into a 
priority encoder which generates the binary address of 
the highest priority BREQ line which is active at the 
inputs. The output 
binary address is decoded by a 


decoder to select the corresponding BPRN(bus priority 
in) line to be returned to the highest priority requesting 
arbiter. The arbiter receiving priority (BPRN active low) 
then allows its associated bus master onto the multi- 
master system bus as soon as it becomes available (i.e., 
it is no longer busy). When one bus arbiter gains priority 
over another arbiter, it cannot immediately seize the 
bus, it must wait until the present bus occupant com- 


pletes its transfer cycle. Upon completing its transfer 
cycle, the present bus occupant recognizes that it no 
longer has priority and surrenders the bus, releasing 
BUSY. BUSY is an active low OR-ti\ld signal line which 
goes to every bus arbiter on the system bus. When 
BUSY goes high, the arbiter which presently has bus 
priority (BPRN active low) then seizes the bus and pulls 
BUSY low to keep other arbiters off 
the bus. (See 


waveform timing diagram, Figure 2.) Note that all multi- 
master system bus transactions are synchronized to the 
bus clock (BCLK). This allows for the parallel priority 
resolving 
circuitry 
or, 
any 
other 
priority 
resolving 
scheme employed, time to settle and make a correct 
decision. 
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BUS. 
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PRIORITY 
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A serial priority resolving technique eliminates the need 
for the priority encoder-decoder arrangement by daisy- 
chaining the bus arbiters together. This is accomplished 
by connecting 
the higher priority 
bus arbiter's 
BPRO 
(bus priority out) output to the BPRN of the next lower 
priority (see Figure 3). The highest priority bus arbiter 
would have its BPRN line grounded, signifying to the ar- 
biter that it always has highest priority when requesting 
the bus. 


THE 
NUMBER 
OF ARBITERS 
THAT 
MAY 
BE DAISY·CHAINED 
TOGETHER 
IN 
THE 
SERIAL 
PRIORITY 
RESOLVING 
TECH- 


NIQUE 
1$ A FUNCTION 
OF !C[j( AND 
THE 
PROPAGATION 
DELAY 
FROM 
ARBITER 
TO ARBITER. 
NORMAllY, 
AT 10 MHz 
ONLY 
3 ARBITERS 
MAY 
BE DAISY-CHAINED. 
SEe 
TEXT. 


A rotating priority resolving technique arrangement is 
similar to that of the parallel priority resolving technique 
except that priority is dynamically reassigned. The pri- 
ority encoder is replaced by a more complex circuit 
which rotates priority between requesting arbiters, thus 
guaranteeing 
each arbiter 
equal time 
on the multi- 
master system bus. 


There are advantages and disadvantages for each of the 
techniques 
described above. The rotating 
priority 
re- 
solving technique requires an extensive amount of logic 
to implement, while the serial technique can accommo- 
date only a limited number of bus arbiters before the 
daisy-chain propagation delay exceeds the multi-master 
system bus clock (BCl.,K).The parallel priority resolving 
technique is, in general, the best compromise. It allows 
for many arbiters to be present on the bus while not 
requiring much logic to implement. 


Whatever 
resolving 
technique 
is 
chosen, 
it 
is 
the 
highest priority bus arbiter requesting use of the multi- 
master system bus which obtains the bus. Exceptions 
do exist with the 8289 Bus Arbiter where a lower priority 
arbiter may take away the bus from a higher priority ar- 
biter without the need for any additional external logic. 
This is accomplished through the use of the CBRa pin, 
discussed in a later section. 


MULTI-MASTER SYSTEM BUS SURRENDER AND 
REQUEST 


The 8289 Bus Arbiter provides an intelligent interface to 
allow a processor or bus master of the 8086 family to ac- 
cess a multi-master system bus. The arbiter directs the 
processor onto the bus and allows 
both higher and 
lower priority 
bus masters to acquire the bus. Higher 
priority masters obtain the bus when the present bus 
master utilizing the bus completes its transfer cycle (in- 
cluding 
hold time). Lower priority bus masters obtain 
the 
bus when a higher 
priority 
bus 
master 
is 
not 
accessing the system bus and a lower priority arbiter 
has pulled 
CBRa 
low. This signifies 
to the arbiter 
presently holding the multi-processor bus that a lower 
priority arbiter would like to acquire the bus when it is 
not being used. A strapping option (ANYRaST) allows 
the multi-master system bus to be surrendered to any 
bus master requesting the bus, regardless of its priority. 
If there are no other bus masters requesting the bus, the 
arbiter maintains the bus as 10l)gas its associated bus 
master has not entered the HALT state. The 8289 Bus 
Arbiter will not voluntarily 
surrender the system bus and 
has to be forced off by another bus master. An excep- 
tion to this can be obtained by strapping CBRa low and 
ANYRaST 
high. In this 
configuration 
the 8289 
will 
release the bus after each transfer cycle. 


How the 8289 Bus Arbiter is configured determines the 
manner in which the arbiter requests and surrenders the 
system bus. If the arbiter is configured to operate with a 
processor which 
has access to both a multi-master 
system bus and a resident bus, the arbiter requests the 
use of the multi-master system bus only for system bus 
accesses (i.e., it is a function of the SYSB/RESB input 
pin). While the processor is accessing the resident bus, 
the arbiter permits a lower priority bus master to seize 
the system bus via CBR?l, since it is not being used. A 
processor configuration with both an I/O peripheral bus 
and a system bus behaves similarly. If the processor is 
accessing the peripheral bus, the arbiter permits the 
surrendering of the multi-master system bus to a lower 
priority 
bus master. To request the use of the multi- 


master system 
bus, the 
processor 
must perform 
a 
system memory access (as opposed to an I/O access). 


The arbiter decodes the processor status lines to deter- 
mine what type of access is being performed and be- 
haves correspondingly. 
For 
simpler 
system 
config- 
urations, such as a processor which accesses only a 
multi-master system bus, the arbiter requests the use of 
the system bus when it' detects 
the status 
lines in- 
itiating a transfer cycle. The decoding of these status 
lines can be referenced in the 8086, 8088 (non-I/O proc- 
essor) data sheets or the 8089 (I/O processor) data 
sheet. 


There is one condition common to all system configura- 
tions where the multi-master system bus is surrendered 
to a lower priority bus master requesting the bus by pull- 
ing CBRa low. This is the idle or inactive state (TI)which 
is unique to the 8086 and 8088 processor family. This TI 
state comes about due to the processor's 
ability 
to 
fetch instructions 
in advance and store them internally 
for quick access. The size of the internal queue was op- 
timized so that the processor would make the most ef- 


fective use of its resources and be slightly execution 
bound. Since the processor can fetch code faster than it 
can execute it, it will fill to capacity its internal storage 
queue. When this occurs, the processor will enter into 
idle or inactive states (TI) until the processor has ex- 
ecuted some of the code in the storage queue. Once this 
occurs, the processor will exit the TI state and again 
start code fetching. Between entering into and exiting 
from the TI state an indeterminate number of TI states 
can occur during which the bus arbiter permits the sur- 
rendering of the multi-master system bus to a lower 
priority 
bus 
master. 
As 
noted 
earlier 
and 
worth 


repeating here, once the 8289 Bus Arbiter acquires the 
use of the multi-master system it will not voluntarily sur- 
render the bus and has to be forced off by another bus 
master. This will be discussed in more detail later. 


Two other signals, LOCK and CRQLCK (Figure 4), lend 
to the flexibility 
of the 8289 Bus Arbiter within system 


configurations. LOCK is a signal generated by the proc- 
essor to prevent the bus arbiter from surrendering the 
multi·master system bus to any other bus master, either 
higher or lower priority. CRQLCK(common request lock) 
serves to prevent the bus arbiter from surrendering the 
bus to a lower priority bus master when conditions war- 
rant 
it. LOCK 
is 
used 
for 
implementing 
software 


semaphores for critical 
code sections and real time 


critical 
events 
(such 
as 
refreshing 
or 
hard 
disk 


transfers). 


8289 BUS ARBITER INTERFACING TO THE 8288 
BUS CONTROLLER 


Once the 8289 Bus Arbiter determines to either allow its 
associated processor onto the multi-master system bus 
or to surrender the bus, it must guarantee that com· 
mand setup and hold times are not violated. This is a 
two part problem. One, guaranteeing hold time and two, 
guaranteeing setup time. The 8288 Bus Controller per· 
forms the actual task of establishing setup time, while 
the 8289 Bus Arbiter establishes hold time (see Figure 


5). 


The 8289 Bus Arbiter communicates with the 8288 Bus 
Controller via the AEN line. When the arbiter allows its 
associated processor access to the multi-master sys· 
tem bus, it activates AEN. AEN immediately enables the 
address latches and data transceivers. The bus con- 
troller responds to AEN by bringing its command output 
buffers out of high impedance state but keeping all 
commands disqualified 
until command setup time is 


established. 
Once established, 
the appropriate 
com- 


mand is then issued. Am is brought to the false state 
after the command hold time has been established by 
the arbiter when surrendering the bus. 


02 
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LOCK 
TIMING 


THE ONLY CRITICAL 
LOCK TIMING 
IS THAT SHOWN ABOVE. LOCK MUST BE 
ACTIVATED 
NO SOONER THAN 20 ns INTO 01 AND NO LATER THAN 40 ns 


PRIOR TO THE END OF 02. LOCK INACTIVE 
HAS NO CRITICAL 
TIMING 
AND 


CAN BE ASYNCHRONOUS. 


CRQLCK 
HAS NO CRITICAL 
TIMING 
AND IS CONSIDERED 
AS AN 
ASYNCHRONOUS 
INPUT SIGNAL. 


COMMAND 
ACTIVE 


(8288) 


"ADDRESS 
(8282, 8283) 
CONTROLLED 
BY 
AEN FROM 8289 


"ADDRESSES 
ARE ACTIVATED 
IMMEDIATELY 
WHILE COMMAND 
IS DELAY 


TO ESTABLISH 
SETUP TIME REQUIREMENTS. 


"THE 
8289 ARBITER 
INTERNALLY 
TRACKS THE PROCESSOR 
CYCLE TO 
ESTABLISH 
THE PROPER AMOUNT 
OF HOLD TIME AFTER THE COMMAND 
HAS GONE 
INACTIVE. 


8289 
BUS ARBITER 
INTERNAL 
ARCHITECTURE 


A block diagram of the internal architecture of the 8289 
Bus Arbiter is shown in Figure 6. It is useful to under- 
stand this block diagram when discussing the different 
modes of the 8289 and their impact on processor bus 
operations; however, you may want to skip this section 
to "8086 family processor types and system configura- 
tions" 
and return to it afterwards, as this section ad· 


dresses the very involved reader. The front end state 
generator (FETG) and the back end state generator 
(BETG)allow the arbiter to track the processor cycle. An 
examination of an 8086 family processor state timings 
show that all command and control signals are issued in 
states T1 and T2 while being terminated in states T3 and 
T4, with an indeterminate number of wait states (Tw)oc- 
curring in between. Note further, that an indeterminate 
number of idle or inactive states can occur immediately 
proceeding and following a given transfer cycle. Since 
an indeterminate number of wait states can occur two 
state generators are required; one to generate co'ntrol 
signals (the FETG)and one to terminate control signals 
(the BETG). The FETG is triggered into operation when 
the processor activates the status lines. The FETG is 
reset and the BETG is triggered into operation by the 
status lines going to the passive condition. The BETG is 
reset when the status lines again go active. 


It is necessary for the 8289 Bus Arbiter to track the proc- 
essor in order that it is properly able to determine where 
and when to request or surrender the use of the multi- 
master system bus. In system configurations 
which ac· 


cess a resident bus, the use of the multi-master system 


bus is requested later in order to allow time for the 
SYSB/RESB input to become valid. For systems which 
access a peripheral bus, the arbiter issues a request for 
the system bus only for m.emorytransfer cycles which it 
decodes from the stall:ls lineS (and time must be al- 
lowed for the status lines to become valid and then de- 
coded). In a system which accesses only a multi-master 
system bus, a request is made as soon as the arbiter 
detects an active-going transition 
on the processor's 


status 
lines. Thus, when 
the 
processor 
initiates 
a 


transfer cycle, the FETG is triggered into operation and, 
depending upon what mode the arbiter is configured in, 
the STATUS & MODE DECODE circuitry 
initiates a re- 


quest for the system bus at the appropriate time. The re- 
quest enters the BREQ SET circuitry where it is then 
synchronized 
to the multi·master 
system 
bus clock 


(BCLK) by the PROCESSOR SYNCHRONIZATION cir- 
cuitry. * Once synchronized, the multi-master 
system 


bus interface circuitry issues a BREQ.When the priority 
resolving circuitry returns a BPRN (bus priority in), the 
PROCESSORSYNCHRONIZATION circuitry seizes the 
bus the next time it becomes available (i.e., BUSY goes 
high) by pUllin~USY 
low one BCLK after it goes high 


and enables AEN. (See waveform timing 
diagram in 


Figure 2). Once the arbiter acquires the use of the 
system bus and a data exchange has taken place (a 
transfer acknowledge, XACK, was returned to the proc- 
essor), the processor status lines go passive and the 


• Due ,to the asynchronous 
nature of processor 
trasosfer 
request to the 
multi-master 
system bus clock, it is necessary to synchronize 
the proc- 
essor's transfer request to BCLK. 


PROCESSOR 
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CIRCUITRY 
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BUS 
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CIRCUITRY 


BREQ 
RESET 
WINDOW 


master bus, i.e., the bus arbiter lost priority to a higher 
bus master or the processor has entered into TI states 
and CBRQ is pulled low, etc. If such is the case, the 
BREQ RESET DECODER initiates a bus surrender reo 
quest. The bus surrender request is synchronized by the 
MMS BUSSYNCHRONIZATIONCIRCUITRYto the proc- 
essor clock. The MMS BUS SYNCHRONIZATION CIR- 
CUITRY instructs the bus controller interface circuitry 
to make AEN go false and resets the BREQ SET cir- 
cuitry. Resetting the BREQ SET circuitry will cause its 
output to go false and be synchronized by the processor 
synchronization, 
eventually 
instructing 
the 
MULTI- 
MASTER SYSTEM BUS INTERFACE circuitry to reset 
BREQ. In the event that a lower priority arbiter has 
caused the arbiter to surrender the bus, it is necessary 
that BREQ be reset. Resetting BREQallows the priority 
resolving circuitry to generate BPRNto the next highest 
priority 
bus master requesting the bus. The BREQ 
RESETWINDOW circuitry provides a 'window' wherein 
the arbiter allows the multi-master system bus to be sur- 
rendered and serves as part of the MMS bus-processor 
synchronization circuitry. 


8086 FAMILY PROCESSORTYPESAND 
SYSTEMCONFIGURATIONS 


There are two types of processors in the 8086 family - 
an I/O processor (the 8089 lOP)and a non-I/O processor 
(the 8086 and 8088 CPUs).Consequently, there are two 
basic operating modes in the 8289 Bus Arbiter. One, the 
lOB (I/O peripheral bus) mode, permits the processor ac- 
cess to both an I/O peripheral bus and a multi-master 
system bus. The second, the RESB(residen\ bus) mode, 
permits the processor to communicate over both a resi- 
dent bus and a multi-master system bus. Eventhough it 
is intended for the arbiter to be configured in the lOB 
mode when interfacing to an I/O processor and for it to 
be in the RESBmode when interfacing to a non-I/O proc- 
essor, it is quite possible for the reverseto be true. That 
is, it is possible for a non-I/O processor to have access 
to an I/O peripheral bus or for an I/O processor to have 
access to a resident bus as well as access to a multi- 
master system bus. The lOB strapping 
option 
con- 
figures the 8289 Bus Arbiter into the lOB mode and 
RESB strapping option configures it into the resident 
bus mode. If both strapping options are strapped false, 
a third mode of operation is created, the single bus 
mode, in which the arbiter interfaces the processor to a 
multi-master system bus only. With both options strap- 
ped true, the arbiter interfaces the processor to a multi- 
master system bus, a resident bus and an I/O bus. 


To better understand the 8289 Bus Arbiter, each of the 
operating modes, along with their respective timings, 
are examined by means of examples. The simplest con- 
figuration, the Single Bus Configuration, (both lOB and 
RESB strapped inactive) will be considered first, fol- 
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two multi-master system buses. This particular con- 
figuration is briefly mentioned because, as will be seen, 
it is simply an extension of the resident bus configura· 
tion. When discussing the Single Bus Configuration, 
processor/arbiter, arbiter/system 
bus and internal ar· 


biter, considerations are made resulting in a table that il· 
lustrates overhead in requesting the system bus. As this 
applies to the other 8289configurations, only additional 
considerations will be given. A summary of when to use 
the different configurations is given at the end. 


8289 SINGLE BUS INTERFACE 


Figure 7 shows a block diagram of a bus master which 
has to interface only to a system bus - 
preferably the 


MULTIBUS - 
where there exists more than one bus 


master. In later configurations, it will be shown how the 
processor can be made to interface with more than one 
bus. Since the processor has only to interface with one 
bus, this configuration is called "single". 


Connecting the 8289 Bus Arbiter to the processor is as 
simple as it was to connect the 8288 Bus Controller. 
Namely, the three status 
lines, SO, S1, and S2 are 


directly connected from the processor to the arbiter. 
The clock line from the 8284Clock Generator is brought 
down and connected. (Note that both the 8288 Bus Con- 
troller and the 8289 Bus Arbiter are connected to the 
same clock, CLK and not the peripheral clock, PCLK as 
the 8086 processor.) From the arbiter, AEN is con- 
nected to the bus controller and to the clock generator. 
The lOB pin on the arbiter is strapped high and on the 
controller the lOB pin is strapped low. In addition, the 
RESB pin on the arbiter is strapped low, finishing the 
processor interface. 


Some flexibility 
exists with the MULTIBUS or multi- 
master system bus interface. The system designer must 
first decide upon the type of priority resolving scheme 
to be employed, whether it is to be the serial, parallel, or 
rotating 
priority 
scheme. A rotating 
priority 
scheme 


would be employed where the system designer would 
want to guarantee that every bus master on the bus 
would be given time on the bus. In the serial and parallel 
schemes, the possibility exists that the lowest assigned 
priority bus master may not acquire the bus for long 
periods of time. This occurs because priority is perma- 
nently assigned and if bus demand is high by the higher 
assigned priorities, then the lower priorities must wait. 
In most cases, this situation is acceptable because the 
highest priority is assigned to the bus master that can- 
not wait. Highest priority is usually assigned to DMA 
type devices where service requirements occur in real 
time. CPUs are assigned the lower priorities. For the 
purpose of this discussion, the parallel priority scheme 
will be used with brief reference to the serial priority 
scheme. 
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Figure 8 shows how a typical multi-processing system 
might be configured with the 8289 
in the Single Bus 


mode. In the system there are three bus masters, each 
having the assigned priority as indicated-priority 
1 


being the highest and priority 3 being the lowest. Prior- 
ity is established using the parallel priority scheme 
(ignore the dotted signal interconnect for the moment). 
Each bus arbiter monitors its associated processor and 
issues a bus request (BREa) whenever its processor 
wants the bus. A common clocking signal (BCLK) runs 
to each of the arbiters in the system. It is from the fail- 
ing edge of this clock that all bus requests are issued. 
Since all bus requests are made on the same clock 
edge, a valid priority can be established by the priority 
resolving circuitry by the next falling BCLK edge. Note 
that all multi-master system bus (MULTIBUS)input sig- 
nals are considered to be valid at the falling edge of 
BCLK. And that all multi-master system bus output 
signals are issued from the falling edge of BCLK. With 
the parallel resolving module, arbiters 2 and 3 would 
issue their respective BREas (Figure 9) on the falling 
~dge of BCLK 1, as shown. The outputs (BPRN 1, BPRN 
2, and BPRN 3) of the priority encoder-decoder arrange- 
ment change to reflect their new input conditions and 
need to be valid early enough in front of BCLK 2 to 
guarantee the arbiter's setup time requirements. Since 
arbiter 2 at the time is the highest priority arbiter re- 
.9.uestingthe bus, bus priority is given to arbiter 2 (BPRN 
2 goes low), and since the bus was not busy (BUSY is 
high) at the time priority was granted to arbiter 2, arbiter 
2 pulls BUSY inactive on BCLK 2, thereby seizing the 
bus and excluding all other arbiters access to the bus. 
Once the bus is seized, arbiter 2 activates its AEN. AEN 
going low directly enables the 8283 address latches and 


wakes up the 8288 
Bus Controller. The bus controller 


enables the 8287 transceivers, waits until the address to 
command setup time has been established, and then 
enables its command drivers onto the bus. 


If the serial priority resolving mode was used instead, 
much of the events that happened for the parallel prior- 
ity resolving mode would be the same except, of course, 
there would be no parallel priority resolving module. In- 
stead, the system would be connected as indicated in 
Figure 8 by the dotted signal lines connecting the BPRO 
of one arbiter to BPRNof the next lower priority arbiter. 


The BREa lines would be disconnected and the priority 
encoder-decoder arrangement removed. This arrange- 
ment is simpler than the parallel priority arrangement 
except that the daisy·chain propagation delay of the 
highest priority bus arbiter's BPROto the lowest priority 
bus arbiter's BPRN, including setup time requirement 
(BPRN to BLCK), cannot exceed the BCLK period. In 
short, this means there are only so many arbiters that 
can be daisy-chained for a given BCLK frequency. Of 
course, the lower the BCLK frequency, the more arbiters 
can be daisy-chained. The maximum BCLK frequency is 
specified at 10 MHz, which would allow for three 8289 
arbiters to be daisy-chained. In general, the number of 
arbiters that can be connected in the serial daisy-chain 
configuration 
can be determined from the following 


equation: 
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Returning 
to Figure 9, it can be seen that K BCLKs 
later, 


arbiter 
1 has decided 
to request 
the bus and its BREQ, 


BREQ 1, has gone low. Since arbiter 
1 is of higher 
priori- 
ty than arbiter 
2, which 
presently 
has the bus, bus priori- 
ty is reassigned 
by the 
priority 
module 
(or the 
daisy- 


chain approach 
in the serial priority) 
to arbiter 
1. BPRN 1 
goes 
low and BPRN 2 now goes 
high (BPRN 3 remains 
high, 
even 
though 
decoding 
can 
cause 
it 
to 
glitch 
momentarily). 
The loss of priority 
instructs 
arbiter 
2 that 
a higher 
priority 
arbiter 
wants 
the bus and that 
it is to 
release 
the bus as soon 
as its present 
transfer 
cycle 
is 
done. 
Since 
arbiter 
2 cannot 
immediately 
release 
the 
bus, arbiter 
1 must wait. In the particular 
case illustrated 
in Figure 9, arbiter 
2 releases 
the bus (allows 
BUSY to go 
high) on clock 
edge M, and on clock 
edge M + 1, arbiter 
1 
now seizes 
the bus, pulling 
BUSY low. Arbiter 
1 is the 
highest 
priority 
arbiter 
in the system 
and it now has the 
bus. Arbiters 
2 and 3 still 
want the bus (their 
BREQs are 
both 
low). 


How quickly 
arbiter 
1 can acquire 
the bus is dependent 
upon the configuration 
and strapping 
options 
of the ar- 


biter 
it is trying 
to acquire 
it from. 
For example, 
if the 
LOCK input to arbiter 
2 was active (low) at the time, then 
arbiter 
1, even though 
it was of higher 
priority, 
would 
not 
have acquired 
the 
bus 
until 
after 
LOCK 
was 
released 
(goes high). Effectively, 
LOCK locks 
the arbiter 
onto the 
bus 
once 
the 
bus 
has 
been 
acquired. 
LOCK 
will 
not 
force 
another 
arbiter 
to release 
the bus any sooner, 
it 


just 
prevents 
the bus from 
being 
given away no matter 
what 
the priority 
of the other 
arbiter. 
Another 
factor 
to 
be considered 
is where 
in the transfer 
cycle 
is the proc- 
essor 
when the arbiter 
is instructed 
to give up the bus. 


Obviously, 
if 
the 
cycle 
had 
just 
started, 
it 
will 
take 
longer 
for the bus to be released 
than 
if the cycle 
was 
just 
ending. 
Another 
factor 
to be included 
in this 
con- 
sideration 
is the phase 
relationship 
of the processor's 
clock 
(CLK) to the bus clock 
(BCLK). This relationship 
is 
examined 
in more detail 
later on. Table 
1 lists 
the time 


requirements for various arbiter actions such as bus ac- 
quisition 
and bus 
release (under lOCK 
and other 
circumstances) taking into account the phase relation- 
ships between ClK and BClK. 


Bus Request (BREQI) 
Mode 
Delay 
Deley 
(Mex) 
(Mln) 


Status-BREOI 
Single 
2 BCLKs 
1 BCLK 


Status-BREOI 
lOB 
2 BCLKs+ 
1 BCLK+ 


-1 
CLK' 
- 
'12 CLK' 


Status-BREOI 
RESB 
2 BCLKs+ 
1 BCLK+ 


-2 
CLKst 
-1'/2 
CLKst 


Status- 
BREOI 
IOB·RESB 
2 BCLKs+ 
1 BCLK+ 


-2 
CLKst 
1'12 
CLKst 


'Request 
originates 
off of <,1>2 
of T1 and BREOI occurs 
1 BCLK(min) 


to 2 BCLKs (max) thereafter. 
Depending 
upon where status occurs 
with respect to clock determines 
how long a time exists between 
status and 4>2of T1, and is anywhere 
from 
1/2 eLK (min) to 1 elK 
(max). 


tRequest 
originates 
off ofT2·<,I>l and BREOI occurs 
1 BCLK(min) 
to 
2 BCLKs (max) thereafter. 
The same reasoning as used in the lOB 
mode is valid here: 


Bus Releese (BREQf) 
Mode 
Delay 
Delay 
(Max) 
(Mln) 


Higher Priority (BPRN I) 
All 
2 CLKs+ 
1 CLK+ 
2 BCLKs 
1 BCLK 


Lower Priority 
(CBRO I) 
All 
2 CLKs+ 
1 CLK+ 
2 BCLKs 
1 BCLK 


Surrender 
occurs once the prqper surrender conditions 
exist. 


One signal which has been basically ignored to this 
point is CBRQ. CBRQ, like BUSY, is an open-collector 
signal from the arbiter which is tied to the CBRQ signals 
of the other arbiters and to a pull·up resistor (see Figure 
8). CBRQ is both an input and an output. As an output, 
CBRQ serves to instruct the arbiter presently on the bus 
that another arbiter wishes to acquire the bus. As an in- 
put, CBRQ serves to instruct the arbiter presently on the 
bus that another arbiter wants the bus. CBRQ is an input 
or output, dependent on whether the arbiter is on the 
bus or not (respectively), and is issued as a function of 
BREQ.Thus, a lower priority arbiter requesting the bus 
already controlled by a higher priority arbiter will pull 
CBRQ low, as well as BREQ. Even a higher priority ar- 
biter will pull CBRQ low until it acquires the bus. Note, 
however, that the higher priority arbiter will acquire the 
bus through the reassignment of priorities - 
it being 
given priority and the other arbiter presently on the bus 
losing it. In effect, CBRQ serves to notify the arbiter that 
an arbiter of lower priority wants the bus. 


If the arbiter presently on the bus is configured to react 
to CBRQ and the proper surrender conditions exist, the 
bus is released. When releasing the bus, the arbiter also 
turns off its BREQ (BREQ goes high) in order to allow 
priority to be established to the next lower arbiter reo 
questing the bus. Such is the case shown in Figure 9. 
Whereas it was assumed that the proper surrender con· 
ditions did not exist for arbiter 2 when it had the bus, it 
is assumed that the proper conditions do exist during 
the time that arbiter 1 has the bus. Arbiter 2 had to give 
up the bus because an arbiter of higher priority was re- 


questing it. Arbiter 1 surrenders the bus because the 
proper surrender conditions exist and a lower priority ar- 
biter requested the bus by pulling CBRQ low. This is an 
assumed condition 
which is not otherwise shown in 
Figure 9. This is not an unrealistic condition. Normally, 
a higher priority arbiter will acquire the bus through the 
reassignment of priorities, while lower priority arbiters 
acquire the bus through CBRQ. 


Digressing for a moment, the 8289 Bus Arbiter will not 
voluntarily 
surrender the bus (except when the proc- 
essor halts execution). As a result, it has to be forced off 
the bus. The 8289 Bus Arbiter does not generate a BREQ 
for each cycle. It generates a BREQ once and then 
hangs onto the bus. To do otherwise would require that 
BREQ be dropped (go high) after each transfer cycle so 
that if it did need to do another transfer cycle, another 
arbiter would automatically 
be assigned priority. This 
approach, however, entails certain overhead. Command 
to address setup and hold time must be prefixed and ap- 
pended to each transfer 
cycle. 
Each transfer 
cycle 
would be characterized by first acquiring the bus, then 
establishing 
the setup time requirements, finally per- 
forming the transfer cycle, establishing the hold time re- 
quirements, and then releasing the bus (see Figure 10). 
If another transfer cycle was to immediately follow and 
if the arbiter still had priority, then the whole above pro· 
cedure would be repeated. The end result would be 
wasted time as hold times following setup times (see 
Figure 10A).The approach taken by the 8289 Bus Arbiter 
of having to be forced off the bus, even when it is not 
using the bus (i.e., forced off by a lower priority arbiter), 
provides for greater bus efficiency. A lower priority ar· 
biter h-avingto force off another arbiter that is not using 
the bus but just hanging on to it, may not seem very effi· 
cient. In actuality it is a good trade-off. In many multi· 
master systems some bus masters occasionally 
de- 


mand the bus, while others demand the bus constantly. 
The bus master which constantly demands the bus may 
momentarily need not to access the bus. Why should 
that arbiter surrender the bus when chances are that the 
other bus masters which occasionally access the bus 
don't want it at the time? If it doesn't give up the bus, 
then it can momentarily cease access to the bus and 
then continue, without any performance penalty of hav- 
ing to reestablish control of the bus. The greater bus ef- 
ficiency that it affords is well worth the added complexi· 
ty (Figure 10B). 


Returning to Figure 9, the combination of the proper sur- 
render conditions existing and CBRQ being low, forced 
the higher priority arbiter, arbiter 1, off the bus. Arbiter 
2, being of next higher priority and wanting the bus, ac- 
quired the bus on clock edge N + 1. If arbiter 1 decides 
to re·access the bus, it would reacquire the bus through 
the reassignment of priorities. 
This is not the case 
shown in Figure 9. Arbiter 1 has decided that it does not 
need the bus and does not renew its BREQ. Arbiter 2, 
having acquired the bus through CBRQ, is now the 
highest priority arbiter requesting the bus. As can be 
seen it is not the only arbiter requesting the bus. Arbiter 
3 is still patiently waiting for the bus and CBRQ remaIns 
low. The same conditions that forced arbiter 1 off the 


bus for arbiter 2 now forces arbiter 2 off the bus for ar- 
biter 3. When the proper surrender conditions exist, ar- 
biter 2 releases its BREQ and surrenders the bus to ar- 
biter 3. Arbiter 3 acquires the bus on clock edge P+ 1 
and releases its CBRQ.Since no other arbiter wants the 
bus (i.e., there is no other arbiter holding CBRQ low), 
CBRQ goes high (inactive). This would have also been 
true when arbiter 2 acquired the bus and released its 
CBRQ if arbiter 3 didn't want the bus. 


In the Single interface, the arbiter monitors the proc- 
essor's status lines, which are activated ,whenever the 
processor performs a transfer cycle. The arbiter, on 
detecting the status lines going active, will 
issue a 
BREQ if the status is not the HALT status. If the proc- 
essor issues the HALT status, the arbiter will not re- 
quest the bus, and if it has the bus, will release it. 


This effectively concludes how arbiters interact to one 
another on the bus. Having examined the processor-to- 
arbiter interface, and arbiter-to-MUlTIBUS (arbiter-to- 
arbiter) interaction, one interface is left, the internal 
interface 
of 
processor-related 
signals 
to 
that 
of 
MUlTIBUS·related signals. 


An important point to remember is that the processor 
has its own clock (ClK) and the multi-master system 
bus has its own (BClK). These two clocks are usually 
out of phase and of different frequencies. Thus, the ar- 
biter must synchronize events occurring on one inter- 
face to events occurring on another interface. As a 
result of this back and forth synchronization, ambiguity 
can arise as to when events actually do take place. 


Very simply, the 8289 arbiter operation can be repre- 
sented as two events, requesting and surrendering. 
Figure 11 is a representation of the timing relationships 
involved. The request input is a function of the proc- 
essor's clock and the surrender input is a function of 
either the bus clock or the processor's clock. To request 


the bus, the processor activates its status lines which in 
turn enables the request input. Depending upon the 
phase relationship between the occurrence of status (re- 
quest active) and BClK, 
BREQ appears one to two 
BClKs later. As shown in Figure 12, the phase relation- 
ship between request and BClK is such that the BRQ1 
flip-flop 
mayor 
may not catch request on the first 
BClK.· 


If BRQ1flip-flop does'catch the request, then one ElCO< 
later, BREQ goes low and one BClK after that, ~ 
goes low (it is assumed that priority 
is immediately 
granted and that the bus is available). If BRQ1 flip-flop 
does not catch the request, then request is caught on 
the next BClK and BREQgoes low one BClK later, fol- 
lowed by BUSY which also goes low one BClK later. 
Note that BREQ and BUSY track, as BREQ is an input 
term for BUSY. During bus acquisition, 
the surrender 
flip-flop 
is false' (SURNDR Q= low) and AEN follows 
BUSY. 


Once the bus is acquired, the surrender circuitry 
is 
enabled so that when a valid surrender condition exists, 
the bus can be surrendered. The surrender circuitry syn- 
chronizes .the surrender request to the processor's 
clock and drives SURNDR low. Like the acquisition cir- 
cuitry, it takes from one to two processor clocks to gen- 
erate SURNDR and depends upon the phase relation- 
ship between the surrender request and the processor's 
clock. 


"The two bus request flip·flops. 
BRQ1 and BRQ2. are edge·triggered, 
high resolution 
flip-flops and serve to reduce the probability of walkout 
down to an acceptable 
level. Walkout 
occurs 
because 
BCLK is asyn- 
chronous with respect to request. If walkout does occur on BRQ1 flip- 
flop, the probability 
is high that the BRQ1 flip-flop 
will resolve 
itself 


prior to BRQ2 flip·flop 
being triggered. 
Even if BRQl 
flip·flop 
did not 
quite resolve itself, the probability 
of BRQ2 f1ip·flop walking out to an 
unacceptable 
point in time is itself low. 


aJ BUS UTILIZATION 
AS A RESULT OF HAVING 
TO REOllEST AND RelEASE 
THE 8US 
FOR EACH TRANSFER 
CYCLE. THIS PERMITS LOWER PRIORITY ARBITERS EASY 
ACCESS TO THE BUS SHOULD THE HIGHER 
PRIORITY ARBITER NO LONGER NEED 
THE BUS. HOWEVER, 
BUS EFFICIENCY 
IS POOR DUE TO THE ARBITER THRASING 
ON 
AND OFF OF THE BUS FOR EACH TRANSFER 
CYCLE. 


b) 8289 BUS UTILIZATION 
IS MORE EFFICIENT 
IN THAT THE ARBITER HAS ONLY TO 
ACQUIRE THE BUS ONCE. THE 8289 HANGS ONW..fHE.BUS 
UNTIL FORCED OFF. 
THIS APPROACH 
ADOS A LITTLE MORE COMPLEXITY 
TO THE SYSTEM INASMUCH 
AS 
SOME MEANS MUST BE PROVIDED 
FOR LOWER PRIORITY ARBITERS TO FORCE THE 
HIGHER PRIORITY ARBITER OFF OF THE BUS WHEN IT IS NOT USING IT. THE ADDEO 
COMPLEXITY 
IS WELL WORTH THE BUS EFFICIENCY 
AND SYSTEM FLEXIBJLlTY IT 
AFFORDS. 
THE 8289 ARBITER CAN BE CONFIGURED 
TO HAVE THE TRA.NSFER TIMING 
AS SHOWN 
IN (aj (IMITATING 
THE METHOD 
8218 AND 8219 USES. BUS ARBiTERS 
FOR 
8080 AND 8085 
RESPECTIVELY) 
BY STRAPPING 
ANYRQST 
HIGH AND CBREQ LOW. 


THIS 
CONCEPTUAL 
DIAGRAM 
IS PROVIDED 
FOR AIDING 
IN UNDERSTANDING 


CLOCK 
AND 
BUS 
CLOCK 
RELATED 
EVENTS. 
IT DOES 
NOT 
REPRESENT 
THE 
ACTUAL 
SCHEMATIC 
OF THE 
8289 
DEVICE. 
AND 
IS FOR 
CONCEPTUAL 
PURPOSES 
ONLY. 
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*WHEN 
THE 
~ 
OCCURS 
SIMULTANEOUSLY 
WITH 
BCLK, 
BCLK 
MAY 
OR 
MAY 
NOT 
CATCH 
THE 
REQUEST. 
IF IT DOES, 
THE 
WAVEFORMS 
FOllOW 


THOSE 
SHOWN 
DESIGNATED 
BY @ . IF NOT, 
THE 
REQUEST 
IS PICKED 
UP 
ON THE 
NEXT 
EDGE 
OF 'B"CIK 
AND 
THE 
WAVEFORMS 
FOLLOW 
THOSE 
SHOWN 
DESIGNATED 
BY ® . 


Having synchronized the surrender request to the proc- 
essor's clock to generate SURNDR, SURNDR is then 
synchronized to BClK to reset the BUSY and BRQ flip- 
flops. When BUSY-Qgoes low, the surrender circuitry is 
reset which in turn re-enables the request input. The tim- 
ing in Figure 13 shows the surrender request input 
going high on the falling edge of the clock. If the Sample 
flip-flop was able to catch the surrender request on the 
edge of clock 1, then SURNDR would be generated (go 
low) on clock edge 2. If not, SURNDR would be gener- 
ated on clock edge 3. SURNDRgoing low on clock edge 
2 will be, for ease of discussion, referred to as SURNDR 
a and SURNDR going low on clock edge 3 will be refer- 
red to as SURNDR b. As can be seen from Figure 13, 
SURNDR a just happens to go low on BClK 
edge 2. 
Since SURNDR is used to reset the BRQ flip-flops, 
which are clocked 
by the falling 
edge of BClK, 
the 
BRQ1 flip-flop 
mayor 
may not catch SURNDR a on 
BClK edge 2. If it does, then BRQ and BUSY go high on 
BClK 
edge 3 which, for convenience, will 
be called 
BREQ a or BUSY a. If not, theh BREQ and BUSY will go 
high on BClK edge 4, which will be referred to as BREQ 
b or BUSY b, respectively. 
SURNDR b occurs early 
enough to assure that BUSY and BREQ are reset on 
BClK edge 5, which will be referred to as BUSY b1 and 


BREQb1. Depending upon when BUSYgoes high, deter- 
mines when the surrender circuitry 
is reset and how 
soon the next BREQcan be generated. BUSYa1 causes 
SURNDRc to occur where shown and SURNDRc in turn 
would allow the earliest bus request to occur at BREQ 
c1. At the other extreme, BUSY b1 allows the earliest 
bus request to occur at BREQ e1. 


Table 1 summarizes the maximum and minimum delays 
for bus request, once the pr.operrequest and surrender 
conditions exist. Table 2 lists the proper surrender con- 
ditions. 


Mode 


Single 
HALT state,loss 
of BPRN, T1·CBREQ 


HALT slate. 
loss ot BPRN, T1·CBREQ, 


I/O Command·CBRQ 


HALT slate,loss 
ot BPRN, T1·CBREQ, 


(SYSB/RESB; 
O)·CBRQ 


HALT state, 
loss of BPRN, TI·CBREQ, 
(SYSB/RESB; 
O)·CBREQ. 


I/O Command.CBRa 


·f 
'2/ bf 


___________ 
·_l~r_-_-_-_.~.z~:~~~b~l 
-- ..\-L-Cl-_-.•1-d-,--_-_-_--_-_--\'L_ 


BREO/ 
(EARLIEST 
THAT 
BREO 
COULD 
GO ACTIVE 
AFTER 
BUS 
RELEASE) 


lOB INTERFACE 


Now that the processor-arbiter, arbiter-system bus and 
internal arbiter timings 
have been discussed, it is ap- 
propriate to consider the other interfaces that the 8289 
Bus Arbiter provides. 


In the lOB mode, the processor communicates and con- 
trols a host of peripherals over the peripheral bus. When 
the I/O processor needs to communicate with system 
memory, it is done so over the system memory bus. Fig- 
ure 14 shows a possible 
I/O processor 
system con- 


figuration, 
utilizing 
the 
8089 I/O 
processor 
in 
its 


REMOTE mode. Resident memory exists on the periph- 
eral bus in order that canned I/O routines and buffer 
storage can be provided. Resident memory is treated as 
an I/O peripheral. When a peripheral device needs ser- 
vicing, the I/O processor accesses resident memory for 
the proper I/O driver routine and services the device, 
transmitting 
or storing peripheral data in buffer storage 


area of resident memory. The resident memory's buffer 
storage area could then be emptied or replenished from 
system memory via the system bus. Using the lOB inter- 
face allows an I/O processor the capability of executing 
from local memory (on the peripheral bus) concurrently 
with the host processor. 


Timing in this mode is no different from timing in the 
SINGLE BUS mode. The only difference lies in the re- 
quest and surrender conditions. The arbiter extends the 
single bus mode conditions to qualify when the system 
bus is requested and adds on additional surrender con- 
ditions. The system bus is only requested during sys- 
tem bus commands (the arbiter decodes the processor's 
status 
lines) and, in addition 
to the other surrender 


terms, the arbiter permits surrender to occur during I/O 
bus (or local bus) commands, when the I/O processor is 
using its own local bus. 


Like the arbiter, the bus controller 
must also be in- 


formed of the mode it is operating in. In the lOB mode, 
the 8288 bus controller 
issues I/O bus commands in- 


dependently of the state of AEN from the arbiter. It is 
assumed that all I/O bus commands are intended for the 
I/O bus and hence there is a separate I/O command bus 
from the controller. 
All I/O bus commands 
are sent 


directly to the I/O bus and are not influenced by AEN. 
System bus commands are assumed as going to the 
system bus. Since system bus commands are directed 
to the system bus, they must still be influenced by AEN 
and the arbitration mechanism provided by the 8289. 


As an example, suppose the processor issues an I/O bus 
command. 
The 
8288 Bus Controller 
generates 
the 


necessary control signal to latch the I/O address and 
configure the transceivers in the correct direction. In the 
lOB mode, the multiplexed MCE/PDEN pin of the 8288 
becomes PDEN (peripheral data enable) and serves to 
enable the I/O bus's data transceivers during I/O bus 
commands. DEN similarily serves to enable the system 
bus's 
data transceivers 
during 
memory 
commands. 


PDEN and DEN are mutually exclusive, so it is not possi- 
ble for both sets of transceivers 
to be on, thereby 


avoiding contention between the two sets. Since the I/O 
bus commands are generated independently of AEN In 
the lOB mode, the I/O bus has no delay effects due to 
the arbiter. During this time in which the processor is 
accessing memory the arbiter, if it already has the bus, 
will permit it to be surrendered to either a higher or 
lower priority independently of where the processor is in 


MUlTI·MASTER 
SYSTEM 
COMMAND 
'" 


MUlTl. 
•• ASTER 
SYSTEM 
ADDRESS 


XCVII 
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SYSTEM 
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its transfer cycle (i.e., independent of the machine 
state).' If the arbiter does not already have the bus, it 
will make no effort to acquire the bus. 


" the processor issues a memory command instead, the 
same set of events take place, except that 1)the system 
bus's data transceivers are enabled instead of the 
peripherals bus's data transceivers, and 2) when the 
command is issued depends upon the state of the ar· 
biter. In both cases of 1/0 bus commands and system 
bus commands, the address generated for that com· 
mand is latched into both sets of address latches, the 
system bus's address latches, and the peripherals bus's 
address latches. For each command (regardless of com· 
mand type), an address is put out on the 1/0 bus and on 
the system bus if the arbiter has the bus at that particu· 
lar time. However, the bus controller only issues a com· 
mand to one of the buses and hence, no ill effects are 
suffered by addressing both buses. 


" the arbiter already has the system bus when a system 
bus command is issued, no delays due to the arbiter will 
be noticed by the processor. " the arbiter doesn't have 
the bus and must acquire it, then the processor will be 
delayed (via the system bus command being delayed by 
the bus controller through AEN from the arbiter) until 
the arbiter has acquired the bus. The arbiter will then 
permit the bus controller to issue the command and the 
transfer cycle continues. 


The non·I/O 
processors in the 8086family can communi- 
cate with both a resident bus and a multi-master system 
bus. Two bus controllers would be needed in such a con- 
figuration as shown in Figure 15. In such a system con- 
figuration 
the 
processor would 
have to access to 


memory and peripherals of both buses. Address map- 
ping techniques can be applied to select which bus is to 
be accessed. The SYSB/RESB(system buslresident bus) 
input on the arbiter serves to instruct the arbiter as to 
whether or not the system bus is to be accessed. It also 
enables or disables commands from one of the bus con- 
trollers. 


In such a system configuration, it is possible to issue 
both memory and 1/0 commands to either bus and as a 
result, two bus controllers are needed,one for each bus. 
Since the controllers haveto issue both memory and 1/0 
commands to their respective buses, the lOB options on 
the controllers are strapped off (lOB is low). The ar- 
biter, too, has to be informed of the system configura- 
tion in order to respond appropriately to system inputs 
and has its RESBoption strapped on (RESBis high).The 
arbiter's lOB option is strapped inactive (lOB is high). 
Strapping 
the 
arbiter 
into 
the 
resident 
bus 
mode 


enables the arbiter to respond to the state of the 
SYSB/RESBinput. Depending upon the state of this in- 
put, the arbiter either requests and acquires the system 
bus or permits the surrendering of that bus. 


• Under other circumstances, 
bus surrendering 
would only be permitted 
during the period from where address to command 
hold time has been 


established 
just prior to where the next command 
would be issued. 


In the system shown in Figure 15, memory mapping 
techniques are applied on the resident bus side of the 
system rather than on the multiprocessor 
or system 
bus side. As mentioned earlier in the lOB interface, both 
sets of address latches (the resident bus's address 
latches and the system bus's address latches) are 
latched with the same address; in this case, by their 
respective bus controllers.' 
The system bus's address 


latches, however, mayor may not be enabled depending 
upon the state of the arbiter. The resident bus's address 
latches are always enabled, hence the address mapping 
technique is applied to the resident bus. 


Address mapping techniques can range in complexity 
from a single bit of the address bus (usually the most 
significant bit of the address), to a decoder, to a PROM. 
The more elaborate mapping technique, such as PROM, 
provides segment mapping, system flexibility, and easy 
mapping modifications (simply make a new PROM). 


In actual operation, both bus controllers respond to the 
processor's status lines and both will simultaneously 
issue an address latch strobe (ALE) to their respective 
address latches. Both bus controllers will issue com- 
mand and control signals unless inhibited. The purpose 
of the address mapping circuitry is to inhibit one of the 
bus controllers 
before contention 
or erroneous com· 


mands can occur. The transceivers are enabled off the 
same clock edge the commands are issued, namely </>1 
of T2 (Figure 16). The address is strobed into the ad· 
dress latches by ALE. ALE is activated as soon as the 
processor issues status, and is terminated on </>2of of 
T1. From when ALE is issued, plus the propagation 
delay of the address latches, determines where the ad- 
dress is valid. The time from which the address is valid 
to where control and commands are issued determines 
how much settling time is available for the address map· 
ping circuitry. The mapping circuitry 
must inhibit (via 


CEN) one of the bus controllers prior to where controls 
and commands are issued. Part of the settling 
time 
(see Figure 16)is consumed as a setup time requirement 
to the bus controllers. As it turns out, CEN (command 
enable) can be disqualified as late as on the falling edge 
of clock (the leading edge of </>1of T2)without fear of the 
bus controller 
issuing any commands or transceiver 
control signals. In systems (8 MHz) where less time is 
available for the address mapping circuitry, the address 
latches can be bypassed, hooking the mapping circuitry 
straight onto the processor's multiplexed addressldata 
bus (the local bus) and using ALE to strobe the mapping 
circuitry. This would avoid the propagation delay time of 
the transceivers. Besides needing to inhibit one of the 
bus controllers, the arbiter needs to be informed of the 
address mapping circuitry's decision. Depending upon 
that decision, the arbiter acquires or permits the release 
of the system bus. 


• A simpler 
system 
with an 8086 or 8088 can exist, 
jf it is desirable 
to 
only have PROM, ROM, or a read only peripheral 
interface 
on the resi- 


dent bus. The 8086 and 8088 additionally 
generate a read signal in con- 
junction 
with the 8288 control 
signals. 
By using this read signal and 


memory 
mapping, 
the 8086 or 8088 could operate 
from local program 


store without 
having the contention 
of using the system 
bus. 
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The 
arbiter 
is 
informed 
of 
this 
decision 
via 
its 


SYSB/RESB input. 
If the memory mapping 
circuitry 


selects the resident bus, then SYSB/RESB input to the 
arbiter and CEN input of the system bus controller are 
brought low; and the CEN Input of the resident bus con- 
troller 
is brought 
high. The commands 
and control 


signals of the resident bus are now enabled and those of 
the system bus are disabled. In addition, with the arbiter 
being informed that the transfer cycle is occurring on 
the resident bus, the system bus is permitted to be sur- 
rendered. Glitching is permitted on the SYSB/RESB in· 
put of the arbiter up until <1>1 
of T2. Thereafter, only clean 


transitions 
can occur on the input.- So, if mapping cir- 


cuitry can settle prior to <1>1 
of T2, there is no need to be 


concerned 
over glitching. 
If the mapping circuitry 
is 


unable to settle prior to this time, then the designer 
must guarantee a clean transition on the SYSB/RESB in- 
put. 


INTERFACE TO TWO MULTI-MASTER BUSES 


The interface of an 8086 family processor to two multi- 
system buses is simply an extension of the resident bus 
interface. The only difference is that now two arbiters 
are needed, one .for each multi·master bus, and the ad- 
dress mapping circuitry must acquire its input straight 
off the processor's 
multiplexed 
address/data bus (the 


local bus), using ALE as an address strobe input. Figure 
17 depicts how such a system might be configured. 


Figure 17 illustrates the use of the 8289 in a system en· 
vironment in three of its four modes. The host 8086 CPU 
(priority 
3) is using the 8289 in its single bus multi- 
master mode, while an 8089 I/O processor is using the 
8289 in its lOB mode. A work station based on an 8088 
processor uses the 8289 in it system/resident bus mode. 
This diagram represents a hypothetical system wherein 
there can exist more than one work station (only one 
shown). Each work station shares system resources and 
I/O. The lowest priority processor (8086)would provide 
supervisory 
functions 
and system control, 
Le., allow 


operator intervention into the system resources. A work 
station would call in assemblers and compilers or ap· 
plication 
programs 
as 
needed. When 
compiled 
or 


assembled, the results are transferred to the I/O station 
for output, thus freeing up a work station for another 
user. 


'In 
certain 
memory 
mapping 
techniques, 
the CENs 
of the bus control· 


lers are controlled 
differently 
from the SYSBfRESB 
Input of the arbiter. 


In short, 
CEN 
Is brought 
low 
automatically 
to both 
bus 
controllers, 
thereby 
disabling 
their 
command 
and control 
outputs. 
This 
permits 
a 
longer 
settling 
time 
for the memory 
mapping 
circuitry, 
since 
both con· 
trollers 
are disabled. 
When 
the 
mapping 
circuitry 
settles, 
sometime 
after t;1 of T2, one of the bus controllers 
and its associated 
bus arbiter 
(If one exists) 
Is enabled. 
After 
<1>1of T2, the arbiter 
can only 
permit 
clean 
transitions 
on the SYSB/RESB 
Input 
line. 


If one work station is used, the serial priority resolving 
technique could be used between the 8289 Bus Arbiters 
(shown in dotted lines). If more than one work station is 
desired, it would be necessary to either slow down the 
system bus clock to accommodate 
the additional 
ar· 


biters, or resort to the parallel resolving technique (as 
shown). 


Single Bus Multi·Master Interface 


This mode is the simplest and is sufficient 
for systems 


where a multiprocessing 
environment 
exists and the 


system bus bandwidth is sufficient 
to handle the peak 
concurrent requirements of a multi-master environment. 
This solution 
can provide an inexpensive solution 
for 


multi·masters 
to access an expensive 
I/O device. If, 


however, the system bus bandwidth 
is exceeded, the 


lOB or system/resident modes should be considered. 


The lOB mode is ideal when the bus can be separated in· 
to an I/O bus and memory or system bus. This mode is 
commonly 
used with 
the 8089 I/O processor 
in its 


REMOTE configuration 
to separate the I/O space from 


memory space. With the 8089, all instructions 
operate 


on either system or I/O address space. 64K bytes of I/O 
space can be accessed by the processors in the 8086 
family. 


The remaining processors in the 8086 family are con- 
strained to using only I/O instructions 
when referencing 


I/O space. If this is a limitation, 
and it is desirable to 
remove some of the processor functions 
to its private 
resources, the resident bus mode should be considered. 


Resident Bus Mode 


The resident bus mode allows for maximum flexibility 
for a CPU device, giving it both access to its own local 
resources with full instruction 
set capability, 
and the 


system resources. The CPU can work from its own local 
resources without 
contention 
on' the system bus. By 


using a PROM for memory mapping, memory space can 
be easily altered in this mode. This mode requires the 
use of a second 8288 bus controller chip. 


The 8289 brings a new dimension to microcomputer ar· 
chitecture by allowing the advanced 8/16-bit microproc· 
essors to play easily in a mUlti·master, multiprocessing 
environment. With the flexible modes of the 8289,a user 
can define one of several bus architectures to meet his 
cost/performance 
needs. Modularity, improved system 


reliability 
and increased performance are just a few of 


the benefits that designing a multiprocessing 
system 


provides. 
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INTRODUCTION 


The Intel 8259A is a Programmable Interrupt Controller 
(PIC) designed 
for use in real-time interrupt 
driven 
microcomputer 
systems. The 8259A manages eight 
levels of interrupts and has built-in features for expan· 
sion up to 64 levels with additional 8259A's. Its versatile 
design allows it to be used within 
MCS·80, MCS-85, 
MCS·86, and MCS-88 microcomputer 
systems. Being 
fully programmabie, the 8259A provides a wide variety of 
modes and commands to tailor 8259A interrupt process· 
ing for the specific needs of the user. These modes and 
commands control a number of interrupt oriented func- 
tions such as interrupt priority selection and masking of 
interrupts. The 8259A programming may be dynamically 
changed by the software at any time, thus allowing com- 
plete interrupt control throughout program execution. 


The 8259A is an enhanced, fully compatible revision of 
its predecessor, the 8259.This means the 8259Acan use 
all hardware and software originally designed for the 
8259 without any changes. Furthermore, it provides ad· 
ditional 
modes that increase its flexibility 
in MCS·80 
and MCS-85systems and allow it to work in MCS-86and 
MCS-88 systems. These modes are: 


• MCS·86/88Mode 
• Automatic End of Interrupt Mode 
• Level Triggered Mode 
• Special Fully Nested Mode 
• Buffered Mode 


Each of these are covered in depth further in this appli- 
cation note. 


This application note was written to explain completely 
how to use the 8259A within MCS-80,MCS-85,MCS·86, 
and MCS-88 microcomputer systems. It is divided into 
five sections. The first section, "Concepts", 
explains 


the concepts of interrupts and presents an overview of 
how the 8259A works with each microcomputer system 
mentioned 
above. The second 
section, 
"Functional 
Block Diagram", describes the internal functions of (he 
8259A in block diagram form and provides a detailed 
functional description of each device pin. "Operation of 
the 8259A", the third section, explains in depth the 
operation and use of each of the 8259A modes and com- 
mands. For clarity of explanation, this section doesn't 
make reference to the actual programming of the 8259A. 
Instead, all programming is covered in the fourth sec- 
tion, "Programming the 8259A". This section explains 
how to program the 8259A with the modes and com- 
mands mentioned in the previous section. These two 
sections are referenced in Appendix A. The fifth and 
final section "Application 
Examples", shows the 8259A 


in three typical 
applications. 
These applications 
are 


fully explained with reference to both hardware and soft- 
ware. 


The reader should note that some of the terminology 
used 
throughout 
this 
application 
note 
may differ 


slightly from existing data sheets. This is done to better 
clarify and explain the operation and pro'gramming of 
the 8259A. 


1. CONCEPTS 


In microcomputer systems there is usually a need for 
the processor to communicate with various Input/Out- 


put (I/O) devices such as keyboards, displays, sensors, 
and other peripherals. From the system viewpoint, the 
processor should spend as little time as possible servic- 
ing the peripherals since the time required for these I/O 
chores directly affects the amount of time available for 
other tasks. In other words, the system 
should 
be 


designed so that I/O servicing has little or no effect on 
the 
totai 
system 
throughput. 
There are two 
basic 


methods of handling the I/O chores in a ;>ystem:status 
polling and interrupt servicing. 


The status poll method of I/O servicing essentially in· 
volves having th~ processor "ask" each peripheral if it 
needs servicing by testing the peripheral's status line. If 
the peripheral requires service, the processor branches 
to the appropriate service routine; if not, the processor 
continues 
with the main program. Clearly, there are 


several problems in implementing such an approach. 
First, how often a peripheral is polled is an important 
constraint. 
Some idea of the "frequency-of-service" 


required by each peripheral must be known and any soft- 
ware written for the system must accommodate this 
time dependence by "scheduling" 
when a device is 


polled. Second, there will obviously be times when a 
device is polled that is not ready for service, wasting the 
processor time that it took to do the poll. And other 
times, a ready device would have to wait until the proc- 
essor "makes its rounds" before it could be serviced, 
slowing down the peripheral. 


Other problems arise when certain peripherals are more 
important than others. The only way to implement the 
"priority" 
of devices is to poll the high priority devices 


more frequently than lower priority ones. It may even be 
necessary to poll the high priority devices while in a low 
priority device service routine. It is easy to see that the 
polled approach can be inefficient 
both time-wise and 


software-wise. Overall, the polled method of I/O servic- 
ing can havea detrimental effect on system throughput, 
thus limiting 
the tasks that can be performed by the 


processor. 


A more desirable approach in most systems would allow 
the processor to be executing its main program and only 
stop to service the I/O when told to do so by the I/O 
itself. This is called the interrupt service method. In 
effect, the device would asynchronously signal the proc- 
essor when it required service. The processor would 
finish 
its current instruction 
and then vector to the 


service routine for the device requesting service. Once 
the service routine is complete, the processor would 
resume exactly where it left off. Using the interrupt ser- 
vice method, no processor time is spent testing devices, 
scheduling 
is not needed, and priority 
schemes are 


readily implemented. It is easy to see that, using the in- 
terrupt service approach, system throughput would in- 
crease, allowing 
more tasks 
to be handled by the 


processor. 


However, to implement the interrupt 
service method 


between processor and peripherals, additional hardware 
is usually required. This is because, after interrupting 
the processor, the device must supply information for 
vectoring program execution. Depending on the proc- 
essor used, this can be accomplished by the device tak- 
ing control of the data bus and "jamming" 
an instruc- 


tion(s) onto it. The instruction(s) then vectors the pro- 


gram to the proper service routine. This of course re- 
quires additional 
control 
logic for each interrupt 
re- 


questing device. Yet the implementation so far is only in 
the most basic form. What if certain peripherals are to 
be of higher priority than others? What if certain inter· 
rupts must be disabled while others are to be enabled? 
The possible variations go on, but they all add up to one 
theme; to provide greater flexibility 
using the interrupt 


service method, hardware requirements increase. 


So, we're caught in the middle. The status poll method 
is a less desirable way of servicing 
1/0 
in terms of 


throughput, but its hardware requirements are minimal. 
On the other hand, the interrupt service method is most 
desirable in terms of flexibility 
and throughput, 
but 


additional hardware is required. 


The perfect situation would be to havethe fleXibility and 
throughput of the interrupt method in an implementa- 
tion with minimal hardware requirements. The 8259A 
Programmable Interrupt Controller (PIC) makes this all 
possible. 


The 8259A Programmable Interrupt Controller (PIC)was 
designed to function as an overall manager of an inter- 
rupt driven system. No additional hardware is required. 
The 8259A alone can handle eight prioritized interrupt 
levels, controlling the complete interface between pe- 
ripherals and processor. Additional 
8259A's can be 


"cascaded" to increase the number of interrupt levels 
processed. A wide variety of modes and commands for 
programming the 8259A give it enough fleXibility for 
almost any interrupt 
controlled 
structure. 
Thus, the 


8259A is the feasible answer to handling 1/0 servicing in 
microcomputer systems. 


Now, before explaining exactly how to use the 8259A, 
let's go over interrupt structures of the MCS-80,MCS-85, 
MCS-86, and MCS-88 systems, and how they interact 
with the 8259A. Figure 1 shows a block diagram of the 
8259A interfacing with a standard system bus. This may 
prove useful as reference throughout the rest of the 
"Concepts" 
section. 


I 


INTERRUPT 
ReaUESTS 


1.1 MCS·80™_8259A 
OVERVIEW 


In an MCS-80-8259A 
interrupt 
configuration, 
as in 
Figure 2, a device may cause an interrupt by pUlling one 
of the 8259A's interrupt request pins (IRO-IR7)high. If 
the 8259A accepts the interrupt request (this depends 
on its programmed condition), the 8259A's INT (inter- 
rupt) pin will go high, driving the 8080A's INT pin high. 


The 8080A can receive an interrupt request any time, 
since its INT input is asynchronous. The 8080A, how- 
ever, doesn't always have to acknowledge an interrupt 
request immediately. 
It can accept or disregard re- 


quests under software control using the EI(Enable Inter- 
rupt) or 01 (Disable Interrupt) instructions. 
These in- 


structions either set or reset an internal interrupt enable 
flip-flop. The output of this f1ip·flop controls the state of 
the INTE (Interrupt Enabled) pin. Upon reset, the 8080A 
interrupts are disabled, making INTE low. 


At the end of each instruction cycle, the 8080A exam- 
ines the state of its INT pin. If an interrupt request is 
present and interrupts are enabled, the 8080A enters an 
interrupt machine cycle. During the interrupt machine 
cycle the 8080A resets the internal interrupt enable flip- 
flop, disabling further interrupts until an EI instruction 
is executed. Unlike normal machine cycles, the interrupt 
machine cycle doesn't increment the program counter. 
This ensures that the 8080A can return to the pre- 
interrupt program location after the interrupt is com- 
pleted. 
The 8080A then 
issues 
an INTA (Interrupt 
Acknowledge) pulse via the 8228System Controller Bus 
Driver.This INTA pulse signals the 8259Athat the 8080A 
is honoring the request and is ready to process the inter- 
rupt. 


The 8259Acan now vector program execution to the cor· 
responding service routine. This is done during a se- 
quence of the three INTA pulses from the 8080A via the 
8228. Upon receiving the first 
INTA pulse the 8259A 
places the opcode for a CALL instruction on the data 
bus. This causes the contents of the program counter to 
be pushed onto the stack. In addition, the CALL instruc- 
tion causes two more INTA pulses to be issued, allow- 
ing the 8259A to place onto the data bus the starting 
address of the corresponding 
service routine. This 
address is called the interrupt-vector address. The lower 
8 bits (LSB)of the interrupt-vector address are released 
during the second INTA pulse and the upper 8 bits 
(MSB) during the third INTA pulse. Once this sequence 
is completed, program execution then vectors to the 
service routine at the interrupt·vector address. 


If the same registers are used by both the main program 
and the interrupt service routine, their contents should 
be saved when entering the service routine. This in· 
cludes the Program Status Word (PSW)which consists 
of the accumulator and flags. The best way to do this is 
to "PUSH" each register used onto the stack. The ser- 
vice routine can then "POP" each register off the stack 
in the reverse order when it is completed. This prevents 
any ambiguous operation when returning to the main 
program. 


Once 
the 
service 
routine 
is 
completed, 
the 
main 
program may be re-entered by using a normal RET 
(Return) instruction. This will "POP" the original con· 


tents of the program counter back off the stack to 
resume program execution where it left off. Note, that 
because interrupts 
are disabled during the interrupt 
acknowledge 
sequence, the EI instruction 
must be 
executed either during the service routine or the main 
program before further interrupts can be processed. 


For additional information on the 8080A interrupt struc- 
ture and operation, refer to the MCS·80User's Manual. 


1.2 MCS·8S™_82S9A 
OVERVIEW 


An MCS-85-8259A configuration processes interrupts 
in much the same format as an MCS-80-8259A config- 


uration. When an interrupt occurs, a sequence of three 
INTA pulses causes the 8259A to release onto the data 
bus a CALL instruction and an interrupt-vector address 
for the corresponding service routine. Other events that 
occur during the 8080A interrupt machine cycie, such as 
disabling interrupts and not incrementing the program 
counter, also occur in the 8085A interrupt acknowledge 
machine cycle. Additionally, the instructions for saving 
registers, enabling or disabling of interrupts, and return- 
ing from service routines are literally the same. 


The 8085A, however, has a different interrupt hardware 
scheme as shown in Figure 3. For one, the 8085A sup- 
plies its own INTA output pin rather than using an addi- 
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tional chip, as the 8080A uses the 8228 System Con- 
troller Bus Driver. Another hardware difference 
is the 
8085A has five hardware interrupt pins: INTR, RST 7.5, 
RST6.5,RST5.5,and TRAP.The INTR(Interrupt Request) 
pin is the equivalent to the 8080A's INT pin. The RST 
(Restart) pins and TRAP pin are all restart interrupts 
which vector program execution to an individual dedi- 
cated address when asserted. The important 
factor 
associating these interrupts is their relative priority, as 
shown below: 
TRAP 
Highest Priority 
RST 7.5 
RST 6.5 
RST 5.5 
INTR 
Lowest Priority 


The INTR pin has lowest priority among the other 8085A 
hardware interrupts. Thus, precautions to prevent inter- 
rupting 8259A service routines may be necessary. This, 
of course, depends on how the 8085A interrupts 
are 
being used in a particular application. Such precautions 
can be implemented, however, by masking the RSTpins 
using the SIM instruction. The TRAP pin on the other 
hand is non-maskable; all interrupt pins but TRAP can 
be controlled by the EI (Enable Interrupt) and 01 (Disable 
Interrupt) instructions. 


For a complete description of the 8085A interrupt struc- 
ture, refer to the MCS-85 User's Manual. 


1.3 MCS·86/88 
TM_8259A 
OVERVIEW 


Operation of an MCS-86/88-8259A 
configuration 
has 
basic similarities 
of the MCS-80/85-8259A 
configura- 


tions. That is, a device can cause an interrupt by pulling 
one of the 8259A's interrupt request pins (IRO-tR7)high. 
If the 8259A honors the request, its INT pin will go high, 
driving the 8086/8088's 
INTR pin high. Like the 8080A 
and 8085A, the INTR pin of the 8086/8088 
is asynchro- 
nous, thus it can receive an interrupt 
any time. The 
8086/8088 
can also accept or disregard 
requests on 
INTR under software control using the STI (Set Interrupt) 
or Cli (Clear Interrupt) instructions. These instructions 
set 
or 
clear 
the 
interrupt-enabled 
flag 
IF. 
Upon 
8086/8088 
reset the IF flag is cleared, disabling external 
interrupts on INTR. Beside the INTR pin, the 8086/8088 
provides an NMI (Non-Maskable Interrupt) pin. The NMI 
functions similar to the 8085A's TRAP; it can't be dis- 
abled or masked. NMI has higher priority than INTR. 


Figure 4 shows an MCS-86 MAX Mode system interfac- 
ing with an 8259A on the local bus. This MCS-86-8259A 
configuration 
is also representative of an MCS-88- 
8259A configuration except for the data bus which is 16 
bits for 8086 and 8 bits for 8088. In the MCS-86 system 
the 8259A must be on the lower 8 bits of the data bus. 
Note that the 8259A could also be interfaced on the 
system bus. 


Although there are some basic similarities, 
the actual 
processing of interrupts with an 8086/8088 
is different 
than an 8080A or 8085A. When an interrupt request is 
present and interrupts are enabled, the 8086/8088 enters 
its interrupt acknowledge machine cycle. The interrupt 
acknowledge machine cycle pushes the flag registers 
onto the stack (as in a PUSHF instruction). It then clears 
the IF flag which disables interrupts. The contents of 
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both the code segment and the instruction pointer are 
then also pushed onto the stack. Thus, the stack retains 
the pre-interrupt flag status and pre·interrupt program 
location 
which are used to return from the service 


routine. The 808618088then issues the first of two INTA 
pulses which signal the 8259A that the 808618088has 
honored its interrupt request. If the 808618088is used in 
its "MIN Mode" the INTA signal is available from the 
808618088on its INTA pin. If the 808618088is used in the 
"MAX Mode" the INTA signal is available via the 8288 
Bus Controller 
INTA pin. Additionally, 
in the "MAX 
Mode" the 808618088LOCK pin goes low during the in- 
terrupt acknowledge sequence. The LOCK signal can be 
used to indicate to other system bus masters not to gain 
control of the system bus during the interrupt acknowl- 
edge sequence. A "HOLD" 
request won't be honored 


while LOCK is low. 


The 8259A is now ready to vector program execution to 
the corresponding service routine. This is done during 
the sequence of the two INTA pulses issued by the 80861 
8088. Unlike operation with the 8080A or 8085A, the 
8259A doesn't place a CALL instruction and the starting 
address of the service routine on the data bus. Instead, 
the first INTA pulse is used only to signal the 8259A of 
the honored request. The second INTA pulse causes the 
8259A to place a single interrupt-vector byte onto the 
data bus. Not used as a direct address, this interrupt- 
vector byte pertains to one of 256 interrupt "types" sup· 
ported by the 808618088memory. Program execution is 
vectored to the corresponding 
service routine by the 


contents of a specified interrupt type. 


All 256 interrupt types are located in absolute memory 
locations 0 through 3FFH which make up the 80861 
8088's interrupt-vector table. Each type in the interrupt- 
vector table requires 4 bytes of memory and stores a 
code segment address and an instruction 
pointer ad- 


dress. Figure 5 shows a block diagram of the interrupt- 
vector 
table. 
Locations 
0 through 
3FFH should 
be 


reserved for the interrupt-vector 
table alone. Further- 


more, memory locations 00 through 7FH (types 0-31) are 
reserved for use by Intel Corporation for Intel hardware 
and software products. To maintain compatibility 
with 


present 
and future 
Intel 
prOducts, these 
locations 


should not be used. 
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When the 808618088receives an interrupt-vector 
byte 


from the 8259A, it multiplies its value by four to acquire 
the address of the interrupt type. For example, if the 
interrupt·vector byte specifies type 128 (80H), the vec- 
tored address in 808618088memory is 4 x 80H, which 
equals 200H. Program execution is then vectored to the 
service routine whose address is specified by the code 
segment and instruction pointer values within type 128 
located at 200H. To show how this is done, let's assume 
interrupt type 128 is to vector data to 808618088memory 
location 2FF5FH. Figure 6 shows two possible ways to 
set values of the code segment and instruction pointer 
for vectoring to location 2FF5FH. Address generation 
by the code segment and instruction 
pointer 
is ac- 
complished 
by an offset (they overlap). Of the total 
20-bit address capability, the code segment can desig- 
nate the upper 16 bits, the instruction 
pointer 
can 


designate the lower 16 bits. 
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When entering an interrupt service routine, those regis- 
ters that are mutually used between the main program 
and service routine should be saved. The best way to do 
this is to "PUSH" each register used onto the stack im- 
mediately. The service routine can then "POP" each 
register off the stack in the same order when it is com- 
pleted. 


Once the service routine is completed the main program 
may be re-entered by using a IRET (Interrupt Return) in- 
struction. The IRETinstruction will pop the pre-interrupt 
instruction 
pointer, code segment and flags off the 


stack. Thus the main program will resume where it was 
interrupted 
with 
the same flag status 
regardless of 


changes in the service routine. Note especially that this 
includes the state of the IF flag, thus interrupts are re- 
enabled automatically when returning from the service 
routine. 


Beside external interrupt generation from the INTR pin, 
the 808618088is also able to invoke interrupts by soft- 
ware. Three interrupt instructions are provided: INT, INT 
(Type3),and INTO. INT is a two byte instruction, the sec- 
ond byte selects the interrupt type. INT (Type 3) is a one 
byte instruction which selects interrupt Type 3. INTO is 
a conditional 
one 
byte 
interrupt 
instruction 
which 


selects interrupt Type 4 if the OF flag (trap on overflow) 
is set. All the software interrupts vector program execu- 
tion as the hardware interrupts do. 


For further information on 8086/8088interrupt operation 
and internal interrupt 
structure 
refer to the MCS-86 


User's Manual and the 8086 System Design application 
note. 


2. 8259A FUNCTIONAL BLOCK DIAGRAM 


A block diagram of the 8259A is shown in Figure 7. As 
can be seen from this figure, the 8259Aconsists of eight 
major blocks: the Interrupt Request Register (IRR),the 
In-Service Register (ISR), the Interrupt Mask Register 
(IMR), the Priority Resolver (PR), the cascade buffer/ 
comparator, the data bus buffer, and logic blocks for 
control and read/write. We'll first go over the blocks 
directly related to interrupt handling, the IRR, ISR, IMR, 
PR, and the control 
logic. The remaining functional 


blocks are then discussed. 


2.1 INTERRUPT REGISTERS AND CONTROL LOGIC 


Basically, interrupt requests are handled by three "cas- 
caded" registers: the Interrupt Request Register (IRR)is 
use to store all the interrupt levels requesting service; 
the In-Service Register (ISR)stores all the levels which 
are being serviced; and the Interrupt 
Mask Register 


(IMR)stores the bits of the interrupt lines to be masked. 
The Priority Resolver (PRl looks at the IRR,ISRand IMR, 
and determines whether an INT should be issued by the 
the control logic to the processor. 


Figure 8 shows conceptually how the Interrupt Request 
(IR) input handles an interrupt request and how the 
various interrupt 
registers interact. The figure repre- 


sents one of eight "daisy-chained" priority cells, one for 
each IR input. 


The best way to explain the operation of the priority cell 
is to go through the sequence of internal events that 
happen when an interrupt 
request occurs. However, 


first, notice that the input circuitry of the priority cell 
allows for both level sensitive and edge sensitive IR in- 
puts. Deciding which method to use is dependent on the 
particular application 
and will be discussed in more 


detail later. 


When the IR input is in an inactive state (LOW),the edge 
sense latch 
is set. If edge sensitive 
triggering 
is 


selected, the "Q" output of the edge sense latch will 
arm the input gate to the request latch. This input gate 
will be disarmed after the IR input goes active (HIGH) 
and the interrupt request has been acknowledged. This 
disables the input from generating any further inter- 
rupts until it has returned low to re-arm the edge sense 
latch. If level sensitive triggering is selected, the "Q" 
output of the edge sense latch is rendered useless. This 
means the level of the IR input is in complete control of 
interrupt generation; the input won't be disarmed once 
acknowledged. 


When an interrupt occurs on the IR input, it propagates 
through the request latch and to the PR (assuming the 
input isn't masked). The PR looks at the incoming re- 
quests and the currently in-service interrupts to ascer- 
tain whether an interrupt should be issued to the proc- 
essor. Let's assume that the request is the only one in- 
coming and no requests are presently in service. The PR 
then causes the control logic to pull the INT line to the 
processor high. 
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When the processor honors the INT pulse, it sends a se- 
quence of INTA pulses to the 8259A (three for 8080A/ 
8085A, two for 8086/8088). During this sequence the 
state of the request latch is frozen (note the INTA-freeze 
request timing diagram). Priority is again resolved by the 
PR to determine the appropriate interrupt vectoring 
which is conveyed to the processor via the data bus. 


Immediately after the interrupt acknowledge sequence, 
the PR sets the corresponding bit in the ISR which 
simultaneously clears the edge sense latch. if edge sen- 
sitive triggering is used, clearing the edge sense latch 
also 
disarms 
the 
request 
latch. 
This 
inhibits 
the 


possibility 
of a still active IR input from propagating 


through the priority cell. The IR input must return to an 
inactive state, setting 
the edge sense latch, before 


another interrupt request can be recognized. If level sen- 
sitive triggering 
is used, however, clearing the edge 


sense latch has no affect on the request latch. The state 
of the request latch is entirely dependent upon the IR in- 
put level. Another interrupt will be generated immedi- 
ately if the IR level is left active after its ISRbit has been 
reset. An ISRbit gets reset with an End-of-Interrupt (EOI) 
command 
issued 
in 
the 
service 
routine. 
End-of- 


interrupts will be covered in more detail later. 


Data Bus Buffer 


This three-state, bidirectional 8-bit buffer is used to in- 
terface the 8259A to the processor system data bus (via 


DBO-DB7). Control 
words, 
status 
information, 
and 
interrupt-vector data are transferred through the data 
bus buffer. 


ReadIWrite Control logic 


The function of this block is to control the programming 
of the 8259A by accepting OUTput commands from the 
processor. It also controls the releasing of status onto 
the data bus by accepting INput commands from the 
processor. The initialization 
and operation command 


word registers which store the various control formats 
are located in this block. The RD, WR, AO, and OS 
pins are used to control access to this block by the 
processor. 


Cascade Buffer/Comparator 


As mentioned earlier, multiple 8259A's can be combined 
to expand the number of interrupt levels. A master-slave 
relationS!lip of cascaded 8259A's is used for the expan- 
sion. The SP/ENand the CASO-2pins are used for oper- 
ation of this block. The cascading of 8259A's is covered 
in depth in the "Operation of the 8259A" section of this 
application note. 


Vcc 
GND 
+ 5V supply 


Ground 


Chip Select: 
A low on this pin en- 
ables RDand WR communication be- 
tween the CPU and the 8259A. INTA 
functions are independent of CS. 


Write: A low on this pin when CS is 
low enables the 8259A to accept 
command words from the CPU. 


Read: A low on this pin when CS is 
low enables the 8259A to release 
status onto the data bus for the CPU. 


07-00 
4-11 
1/0 Bidirectional 
Data 
Bus: 
Control, 


status and interrupt-vector 
informa- 
tion is transferred via this bus. 


CASO- 12,13, 1/0 Cascade Lines: The CAS lines form a 
CAS2 
15 
private 8259A bus to control a multi- 
ple 8259A structure. These pins are 
outputs for a master 8259A and in- 
puts for a slave 8259A. 


SP/EN 
16 
110Slave Program/Enable 
Buffer: This is 
a dual function pin. When in the buf- 
fered mode it can be used as an out- 
put 
to 
control 
buffer 
transceivers 
(EN).When not in the buffered mode 
it is used as an input to designate a 
master (SP= 1)or slave (SP= 0). 


INT 
17 
0 
Interrupt: 
This pin goes high when- 
ever a valid interrupt request is as- 
serted. It is used to interrupt 
the 
CPU, thus 
it 
is connected 
to the 
CPU's interrupt pin. 


IRO- 
18-25 
I 
Interrupt Requests: Asynchronous in- 
IR7 
puts. An interrupt 
request can be 
generated by raising an IR input (low 
to high) and holding it high until it is 
acknowledged (edge triggered mode), 
or just by a high level on an IR input 
(level triggered mode). 


INTA 
26 
Interrupt 
Acknowledge: 
This pin is 
used to enable 8259A interrupt-vector 
data onto the data bus. This is done 
by a sequence of interrupt acknowl- 
edge pulses issued by the CPU. 


AO 
27 
AOAddress Line: This pin acts in con- 
junction 
with the CS, WR, and RD 
pins. It is used by the 8259A to de- 
cipher 
between 
various 
command 
words the CPU writes and status the 
CPU wishes to read. It is typically 
connected to the CPU AO address 
line (A1 for 8086/8088). 


3. OPERATION 
OF THE 
8259A 


Interrupt operation of the 8259A falls under five main 
categories: vectoring, priorities, triggering, status, and 
cascading. Each of these categories use various modes 
and commands. This section will explain the operation 
of these modes and commands. For clarity of explana- 
tion, however, the actual programming of the 8259A isn't 


covered in this section but in "Programming the 8259A". 
Appendix A is provided as a cross reference between 
these two sections. 


3.1 
INTERRUPT 
VECTORING 


Each IR input of the 8259A has an individual interrupt- 
vector address in memory associated with it. Designa- 
tion of each address depends upon the initial program- 
ming of the 8259A. As stated earlier, the interrupt 
sequence and addressing of an MCS-80 and MCS-85 
system differs 
from that of an MCS-86 and MCS-88 
system. Thus, the 8259A must be initially programmed 
in either a MCS-80/85 
or MCS-86/88 
mode of operation to 
insure the correct interrupt vectoring. 


MCS-80185™ 
Mode 


When programmed in the MCS-80/85 
mode, the 8259A 
should only be used within 
an 8080A or an 8085A 
system. In this mode the 8080A/8085Awill handle inter- 
rupts in the format described in the "MCS-80-8259A 
or 
MCS-85-8259A Overviews." 


Upon interrupt 
request in the 
MCS-80/85 
mode, the 
8259A will output to the data bus the opcode for a CALL 
instruction and the address of the desired routine. This 
is in response to a sequence of three INTA pulses 
issued by the 8080A/8085A after the 8259A has raised 
INT high. 


The first INTA pulse to the 8259A enables the CALL 
opcode "CDH" onto the data bus. It also resolves IR pri- 
orities 
and effects 
operation 
in the cascade mode, 


which 
will 
be covered later. Contents 
of 
the first 
interrupt-vector byte are shown in Figure 9A. 


During the second and third INTA pulses, the 8259A 
conveys a 16-bit interrupt-vector address to the 8080AI 
8085A.The interrupt-vector addresses for all eight levels 
are selected when initially 
programming 
the 8259A. 
However, only one address is needed for programming. 
Interrupt-vector addresses of IRO-IR7 are automatically 
set at equally spaced intervals based on the one pro- 
grammed address. Address intervals are user definable 
to 4 or 8 bytes apart. If the service routine for a device is 
short it may be possible to fit the entire routine within 
an 8-byte interval. Usually, though, the service routines 
require more than 8 bytes. So, a 4-byte interval is used to 
store a Jump (JMP) instruction which directs the 8080Al 
8085A to the appropriate routine. The 8-byte interval 
maintains 
compatibility 
with 
current 
8080A/8085A 
Restart (RST) instruction software, while the 4-byte in· 
terval is best for a compact jump table. If the 4-byte in- 
terval is selected, then the 8259A will automatically 
insert 
bits 
AO-A4. This 
leaves A5-A15 
to 
be pro- 


grammed by the user. If the 8-byte interval is selected, 
the 8259A will automatically 
insert bits AO-A5. This 
leaves only A6-A15 to be programmed by the user. 


The LSB of the interrupt-vector address is placed on the 
data bus during the second INTA pulse. Figure 9B 
shows the contents of the second interrupt-vector byte 
for both 4 and 8-byte intervals. 


Th~ MSB of the interrupt-vector address is placed on the 
data bus during the third INTA pulse. Contents of the 
third interrupt-vector byte is shown in Figure 9C. 
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C. THIRDINTERRUPTVECTORBYTE,MCS80/85 MODE 


MCS-88/881M Mode 


When programmed in the MCS-86/88 mode, the 8259A 
should 
only 
be used within 
an MCS-86 or MCS·88 
system. In this mode, the 8086/8088 will handle inter· 
rupts in the format described earlier in the "8259A- 
8086/8088 Overview". 


Upon interrupt in the MCS-86/88 mode, the 8259A will 
output a single interrupt·vector 
byte to the data bus. 


This is in response to only two INTA pulses issued by 
the 8086/8088after the 8259A has raised INT high. 


The first INTA pulse is used only for set·up purposes in- 
ternal to the 8259A. As in the MCS·80/85mode, this set· 
up Includes priority resolution and cascade mode oper- 
ations which will be covered later. Unlike the MCS·80/85 
mode, no CALL opcode is placed on the data bus. 


The second INTA pulse is used to enable the single 
interrupt·vector 
byte onto the data bus. The 8086/8088 
uses this interrupt-vector byte to select one of 256 inter· 
rupt "types" 
in 8086/8088memory. Interrupt type selec· 
tion for all eight IR levels is made when initially 
pro- 
gramming the 8259A. However, reference to only one in- 
terrupt type is needed for programming. The upper 5 bits 
of the interrupt vector byte are user definable. The lower 
3 bits are automatically 
inserted by the 8259A depend· 
Ing upon the IR level. 


Contents of the interrupt·vector byte for 8086/8088type 
selection is put on the data bus during the second INTA 
pulse and is shown in Figure 10. 
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3.2 INTERRUPT PRIORITIES 


A variety of modes and commands are available for con· 
trolling interrupt priorities of the 8259A. All of them are 
programmable, that is, they may be changed dynamic· 
ally under software control. With these modes and com· 
mands, many possibilities 
are conceivable, giving the 
user enough versatility 
for almost any interrupt 
con· 
trolled application. 


Fully Nested Mode 


The fully nested mode of operation is a general purpose 
priority mode. This mode supports a multilevel·interrupt 
structure in which priority order of all eight IR inputs are 
arranged from highest to lowest. 


Unless otherwise programmed, the fUlly nested mode is 
entered by default upon initialization. At this time, IROis 
assigned the highest priority through IR7 the lowest. 
The fUlly nested mode, however, is not confined to this 
IR structure alone. Once past initialization, 
other IR in· 


puts can be assigned highest priority also, keeping the 
multilevel-interrupt 
structure of the fully nested mode. 


Figure 11A-C shows some variations of the priority 
structures in the fully nested mode. 
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Further explanation of the fully nested mode, in this 
section, is linked with information of general 8259A in- 
terrupt operations. This is done to ease explanation to 
the user in both areas. 


In general, when an interrupt 
is acknowledged, 
the 
highest priority request is determined from the IRR (in- 
terrupt Request Register). The interrupt vector is then 
placed on the data bus. In addition, the corresponding 
bit in the ISR(In·Service Register) is set to designate the 
routine in service. This ISR bit remains set until an EOI 
(End-Of-Interrupt) command 
is issued to the 8259A. 


EOI's will be explained in greater detail shortly. 


In the fully nested mode, while an ISR bit is set, all fur- 
ther requests of the same or lower priority are inhibited 
from generating an interrupt to the microprocessor. 
A 
higher priority request, though, can generate an inter- 
rupt, thus vectoring program execution to its service 
routine. Interrupts are only acknowledged, however, if 
the microprocessor has previously executed an "Enable 
Interrupts" 
instruction. 
This is because the interrupt 
request pin on the microprocessor gets disabled auto- 
matically after acknowledgement of any interrupt. The 
assembly language instructions 
used to enable inter· 


rupts are "EI" for 8080Al8085A and "STI" for 8086/8088. 
Interrupts can be disabled by using the instruction "01" 
for 8080Al 8085A and "CLI" 
for 8086/8088. When a 
routine is completed a "return" instruction 
is executed, 


"RET" for 8080A/8085A and "IRET" for 8086/8088. 


Figure 
12 illustrates 
the correct 
usage of interrupt 
related instructions 
and the interaction 
of interrupt 
levels in the fully nested mode. 


Assuming the IR priority assignment for the example in 
Figure 12 is IRQthe highest through IR7 the lowest. the 
sequence is as follows. During the main program, IR3 
makes a request. Since interrupts 
are enabled, the 
microprocessor is vectored to the IR3 service routine. 
During the IR3 routine, IR1 asserts a request. Since IR1 
has higher priority than IR3, an interrupt is generated. 
However, it is not acknowledged because the micro- 
processor disabled interrupts in response to the IR3 in- 
terrupt. The IR1 interrupt is not acknowledged until the 
"Enable Interrupts" 
instruction 
is executed. Thus the 
IR3 routine has a "protected" 
section of code over 
which no interrupts (except non-maskable) are allowed. 
The IR1 routine has no such "protected" 
section since 
an "Enable Interrupts" instruction is the first one in its 
service routine. Note that in this example the IR1 re- 
quest must stay high until it is acknowledged. This is 
covered in more depth in the "Interrupt 
Triggering" 
section. 


What is happening to the ISRregister? While in the main 
program, no lSR bits are set since there aren't any inter- 
rupts in service. When the IR3 interrupt 
is acknowl- 
edged, the ISR3 bit is set. When the IR1 inte~rupt is 
acknowledged, both the ISR1 and the ISR3 bits are set, 
indicating that neither routine is complete. At this time, 
only IRQcould generate an interrupt since it is the only 
input with a higher priority than those previously in ser- 
vice. To terminate the IR1 routine, the routine must 
inform the 8259A that it is complete by resetting its ISR 
bit. It does this 
by executing 
an EOI command. A 
"return" 
instruction 
then transfers execution back to 


the IR3 routine. This allows IRQ-IR2to interrupt the IR3 
routine again, since ISR3 is the highest ISR bit set. No 
further interrupts occur in the example so the EOI com- 
mand resets ISR3 and the "return" 
instruction causes 
the main program to resume at its pre-interrupt location, 
ending the example. 


A single 8259A is essentially always in the fully nested 
mode unless certain programming conditions disturb it. 
The following 
programming conditions 
can cause the 
8259A to go out of the high to low priority structure of 
the fully nested mode. 


o The automatic EOI mode 


o The special mask mode 


• A slave with a master not in the special fully nested 
mode 


These modes will 
be covered in more detail 
later, 


however, they are mentioned now so the user can be 
aware of them. As long as these program conditions 
aren't inacted, the fully nested mode remains undis- 
turbed. 


End of Interrupt 


Upon completion 
of an interrupt 
service routine the 


8259A needs to be notified so its ISR can be updated. 
This is done to keep track of which interrupt levels are in 
the process of being serviced and their relative priori- 
ties. Three different End-Of-Interrupt (EOI) formats are 
available for the user. These are: the non-specific EOI 
command, the specific 
EOI command, and the auto- 
matic EOI Mode. Selection of which EOIto use is depen- 
dent upon the interrupt operations the user wishes to 
perform. 


Non-Specific 
EOI Command 


A non-specific EOI command sent from the microproc- 
essor lets the 8259A know when a service routine has 
been completed, without specification of its exact inter- 
rupt level. The 8259Aautomatically determines the inter- 
rupt level and resets the correct bit in the ISR. 


To tpokeadvantage of the non-specific EOI the 8259A 
musi be in a mode of operation in which it can predeter- 
mine in-service routine levels. For this reason the non- 
specific EOI command should only be used when the 
most recent level acknowledged and serviced is always 
the highest priority level. When the 8259A receives a 
non-specific EOI command, it simply resets the highest 
priority ISR bit, thus confirming to the 8259A that the 
highest priority 
routine of the routines in service is 
finished. 


The main advantage of using the non-specific EOI com- 
mand is that IR level specification 
isn't necessary as in 
the 
"Specific 
EOI 
Command", 
covered 
shortly. 


However, special consideration should be taken when 
deciding to use the non-specific EOI. Here are two pro- 
gram conditions in which it is best not used: 


• Using the set priority command within an interrupt 
service routi ne. 


o Using a special mask mode. 


These conditions are covered in more detail in their own 
sections, but are listed here for the users reference. 


Specific EO' Command 


A specific EOI command sent from the microprocessor 
lets the 8259A know when a service routine of a particu- 
lar interrupt level is completed. Unlike a non-specific 
EOI command, which automatically resets the highest 
priority ISR bit, a specific EOI command specifies an 
exact ISR bit to be reset. One of the eight IR levels of the 
8259A can be specified in the command. 


The reason the specific EOI command is "needed, is to 
reset the ISR bit of a completed service routine when- 
ever the 8259A isn't able to automatically determine it. 
An example of this type of situation 
might be if the 
priorities of the interrupt levels were changed during an 
interrupt -routine ("Specific 
Rotation"). In this case, if 
any other routines were in service at the same time, a 
non-specific EOI might reset the wrong ISRbit. Thus the 
specific EOIcommand is the best bet in this case, or for 
that matter, any time in which confusion of interrupt 
priorities may exist. The specific EOI command can be 
used in all conditions 
of 8259A operation, including 
those that prohibit non-specific EOI command usage. 


Automatic EO' Mode 


When programmed in the automatic 
EOI mode, the 
microprocessor no longer needs to issue a command to 
notify the 8259A it has completed an interrupt routine. 
The 8259A accomplishes 
this 
by performing 
a non- 
specific EOIautomatically at the trailing edge of the last 
INTA 
pulse 
(third 
pulse 
in 
MCS-80/85, second 
in 
MCS-86). 


The obvious adv,antageof the automatic EOI mode over 
the other EOI command is no command has to be 
issued. In general, this 
simplifies 
programming and 
lowers code requirements within interrupt routines. 


However, special consideration should be taken when 
deciding to use the automatic 
EOI mode because it 
disturbs the fully nested mode. In the automatic EOI 
mode the ISR bit of a routine in service is reset right 
after it's acknowledged, thus leaving no designation in 
the ISRthat a sevice routine is being executed. If any in- 
terrupt request occurs during this time (and interrupts 
are enabled) it will get serviced regardless of its priority, 
low or high. The problem of "over nesting" 
may also 
happen in this situation. "Over nesting" is when an IR 
input keeps interrupting its own routine, resulting in un- 
necessary stack pushes which could fill the stack in a 
worst case condition. This is not usually a desired form 
of operation! 


So what good is the automatic EOI mode with problems 
like those just covered? Well, again, like the other EOls, 
selection is dependent upon the application. 
If inter· 
rupts are controlled at a predetermined rate, so as not to 
cause the problems mentioned above, the automatic 
EOI mode works perfect just the way it is. However, if in- 
terrupts happen sporadically at an indeterminate rate, 
the automatic EOI mode should only be used under the 
following guideline: 


• When using the automatic EOI mode with an inde· 
terminate interrupt rate, the microprocessor should 
keep its interrupt 
request input disabled during 
execution of service routines. 


By doing this, higher priority interrupt levels will be ser· 
viced only after the completion of a routine in service. 
This guideline restores the fully 
nested structure 
in 
regards to the IRR;however, a routine in-service can't be 
interrupted. 


Automatic Rotation - 
Equal Priority 


Automatic rotation of priorities serves in applications 
where the interrupting 
devices are of equal priority, 
such as communications channels. The concept is that 
once a peripheral is serviced, all other equal priority 
peripherals should be given a chance to be serviced 
before the original peripheral is serviced again. This is 
accomplished by automatically 
assigning a peripheral 
the lowest priority after being serviced thUS, in worst 
case, the device would 
have to wait 
until 
all other 
devices are serviced before being serviced again. 


There are two methods of accomplishing 
automatic 
rotation. 
One is used in conjunction 
with 
the non- 


specific 
EOI, "rotate on non·specific 
EOI command". 


The other is used with the automatic EOI mode, "rotate 
in automatic EOI mode". 


Rotate on Non·Specific EO' Command 


When the 
rotate 
on non-specific 
EOI command 
is 
issued, the highest ISR bit is reset as in a normal non· 
specific EOI command. After it's reset though, the cor· 
responding IR level is assigned lowest priority. Other IR 
priorities 
rotate to conform to the fUlly nested mode 
based on the newly assigned low priority 


Figures 13A and B show how the rotate on non-specific 
EOI command effects 
the interrupt 
priorities. 
Let's 
assume the IR priorities 
were assigned with IRO the 
highest and IR? the lowest, as in 13A. IR6 and IR4 are 
already in service but neither is completed. Being the 
higher priority 
routine, IR4 is necessarily the routine 
being executed. During the IR4 routine a rotate on non- 
specific EOI command is executed. When this happens, 
bit 4 in the ISR is reset. IR4 then becomes the lowest 
priority and IR5 becomes the highest as in 13B. 
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Rotate in Automatic EO' Mode 


The rotate in automatic EOI mode works much like the 
rotate on non·specific EOI command. The main differ· 
ence is that priority rotation is done automatically after 


the last INTA pulse of an interrupt 
request. To enter or 


exit this mode a rotate·in·automatic·EOI 
set command 
and rotate·in-automatic·EOI 
clear command is provided. 
After that, no commands are needed as with the normal 
automatic 
EOI mode. However, it must be remembered, 


when using any form of the automatic 
EOI mode, spe- 
cial consideration 
should be taken. Thus, the guideline 


for the automatic 
EOI mode also stands for the rotate in 


automatic 
EOI mode. 


Specific 
Rotation 
- 
Specific 
Priority 


Specific 
rotation 
gives the user versatile capabilities 
in 


interrupt 
controlled 
operations. 
It serves in those ap- 
plications 
in which a specific 
device's interrupt 
priority 


must 
be altered. 
As opposed 
to automatic 
rotation 


which automatically 
sets priorities, 
specific 
rotation 
is 


completely 
user controlled. 
That is, the user selects 


which 
interrupt 
level is to receive 
lowest 
or highest 


priority. 
This can be done during the main program or 


within 
interrupt 
routines. 
Two specific 
rotation 
com- 


mands are available to the user, the "set 
priority 
com- 
mand" and the "rotate 
on specific 
EOI command." 


Set Priority Command 


The set priority 
command 
allows 
the programmer 
to 


assign an IR level the lowest priority. All other interrupt 
levels will conform 
to the fully nested mode based on 


the newly assigned 
low priority. 


An example of how the set priority 
command works is 


shown in Figures 14A and 146. These figures show the 
status of the ISR and the relative priorities 
of the inter· 
rupt levels before and after the set priority 
command. 


Two interrupt 
routines 
are shown 
to be in service 
in 


Figure 
14A. Since 
IR2 is the 
highest 
priority, 
it 
is 


necessarily 
the routine being executed. 
During the IR2 


routine, priorities 
are altered so that IR5 is the highest. 


This is done simply by issuing the set priority command 
to the 8259A. In this case, the command specifies 
IR4 as 


being the lowest priority. 
The result of this set priority 


command is shown in Figure 146. Even though IR7 now 
has higher priority 
than IR2, it won't be acknowledged 


until the IR2 routine is finished (via EOI). This is because 
priorities 
are only resolved upon an interrupt 
request or 


an interrupt 
acknowledge 
sequence. If a higher priority 


request occurs during the IR2 routine, then priorities 
are 


resolved and the highest will be acknowledged. 
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When completing 
a service 
routine 
in which 
the set 


priority 
command 
is used, the correct 
EOI must 
be 


issued. 
The non·specific 
EOI command 
shouldn't 
be 


used in the same routine 
as a set priority 
command. 


This is because the non·specific 
EOI command 
resets 


the highest 
ISR bit, which, when using the set priority 


command, 
is not always the most recent routine in ser· 


vice. The automatic 
EOI mode, on the other hand, can be 


used with the set priority 
command. This is because it 


automatically 
performs 
a non-specifiC 
EOI before the 


set priority 
command 
can be issued. The specific 
EOI 


command is the best bet in most cases when using the 
set priority 
command within a routine. 6y resetting 
the 


specific 
ISR bit of a routine being completed, 
confusion 


is eliminated. 


Rotate on Specific 
EOI Command 


The rotate on specific 
EOI command 
is literally 
a com· 


bination 
of the set priority 
command 
and the specific 


EOI command. 
Like the set priority 
command, 
a speci- 


fied IR level is assigned lowest priority. Like the specific 
EOI command, a specified 
level will be reset in the ISR. 


Thus the rotate on specific 
EOI command accomplishes 


both tasks in only one command. 


If it is not necessary to change IR priorities 
prior to the 


end of an interrupt 
routine, then this command is advan· 


tageous. 
For an EOI command 
must be executed 
any- 


way (unless in the automatic 
EOI mode), so why not do 


both at the same time? 


Interrupt 
Masking 


Disabling 
or enabling 
interrupts 
can be done by other 


means than just controlling 
the microprocessor's 
inter- 


rupt request pin. The 8259A has an IMR (Interrupt 
Mask 


Register) which enhances interrupt 
control capabilities. 


Rather than all interrupts 
being disabled 
or enabled at 


the same time, the IMR allows 
individual 
IR masking. 


The IMR is an 8-bit register, bits 0-7 directly correspond 
to IRO-IR7. Any IR input can be masked by writing to the 
IMR and setting the appropriate 
bit. Likewise, any IR in· 


put can be enabled by clearing the correct 
IMR bit. 


There are various uses for masking off individual 
IR in- 


puts. One example is when a portion of a main routine 
wishes 
only 
to be interrupted 
by specific 
interrupts. 


Another might be disabling 
higher priority 
interrupts 
for 


a portion 
of a lower priority 
service routi ne. The possi- 


bilities 
are many. 


When an interrupt occurs while its IMR bit is set, it isn't 
necessarily 
forgotten. 
For, as stated 
earlier, 
the IMR 


acts only on the output of the IRR. Even with an IR input 
masked it is still 
possible 
to set the IRR. ThUS, when 


resetting an IMR, if its IRR bit is set it will then generate 
an interrupt. 
This 
is prOViding, of course, 
that 
other 


priority 
factors 
are taken into consideration 
and the IR 


request 
remains 
active. 
If the IR request 
is removed 


before the IMR is reset, no interrupt 
will 
be acknowl- 


edged. 


Special Mask Mode 


In various cases, it may be desirable to enable interrupts 
of a lower priority than the routine in service. Or, in other 
words, 
allow 
lower priority 
devices 
to generate 
inter- 


rupts. However, in the fully nested mode, all IR levels of 


priority below the routine in service are inhibited. So 
what can be done to enable them? 


Well, one method could be using an EOI command 
before the actual completion of a routine in service. But 
beware, doing this may cause an "over nesting" prob· 
lem, similar to in the automatic EOI mode. In addition, 
resetting an ISR bit is irreversible by software control, 
so lower priority IR levels could only be later disabled by 
setting the IMR. 


A much better solution is the special mask mode. Work- 
ing in conjunction with the IMR, the special mask mode 
enables interrupts from all levels except the level in ser- 
vice. This is done by masking the level that is in service 
and then issuing the special mask mode command. 
Once the special mask mode is set, it remains in effect 
until reset. 


Figure 15 shows how to enable lower priority interrupts 
by using the Special Mask Mode (SMM). Assume that 
IROhas highest priority when the main program is inter- 
rupted by IR4. In the IR4 service routine an enable inter- 
rupt instruction 
is executed. This only allows higher 


priority interrupt requests to interrupt IR4 in the normal 
fully nested mode. Further in the IR4 routine, bit 4 of the 
IMR is masked and the special mask mode is entered. 
Priority operation is no longer in the 'fully nested mode. 
All interrupt levels are enabled except for IR4. To leave 
the special mask mode, the sequence is executed in 
reverse. 


I 


MASK.lR4 
I 


seT $MM 
I 
I 


RESET 
SMM 
I 


Precautions must be taken when exiting an interrupt 
service routine which has used the special mask mode. 
A non·specific EOI command can't be used when in the 
special mask mode. This is because a non-specific 
won't ciear an ISR bit of an interrupt which is masked 
when in the special mask mode. In fact, the bit will ap- 
pear invisible. 
If the special mask mode is cleared 


before an EOI command is issued a non-specific EOI 
command can be used. This could be the case in the ex- 
ample shown in Figure 1'5;but, to avoid any confusion 
it's best to use the specific 
EOI whenever using the 


special mask mode. 


It mlJst be remembered that the special mask mode ap- 
plies to all masked levels when set. Take, for instance, 
IR1 interrupting IR4 in the previous example. If this hap- 
pened while in the special mask mode, and the IR1 
routine maSked itself, all interrupts would be enabled 
except IR1 and IR4 which are masked. 


3.3 
INTERRUPT 
TRIGGERING 


There are two classical ways of sensing an active inter- 
rupt request: a level sensitive input or an edge sensitive 
,input. The 8259A gives the user the capability for either 
method with the edge triggered mode anp the level trig- 
gered mode. Selection of one of these interrupt trigger- 
ing methods is done during the programmed initializa- 
tion of the 8259A.' 
. 


.Level Triggered 
Mode 


When in the level triggered mode the 8259A will recog- 
nize any active (high) level on an IR input as an interrupt 
request. If the IR input remains active after an EOI com- 
mand has been issued (resetting its ISR bitl, another in- 
terrupt will be generated. This is providing of course, the 
processor INT pin is enabled. Unless repetitious inter- 
rupt generation is desired, the IR input,must be brought 
to an jnactive state before an EOI command is issued in 
its service routine. However, it must not go inactive so 
soon that it disobeys the necessary timing 
require- 


ments shown in Figure 16. Note that theJequest on the 
IR input must remain until after the fallif)g edge of the 
first INTA pulse. If on any IR input/the 
reql\est goes 


inactive before the first 
INTA pulse, the, 8259A will 


respond as if IR7 was active. In any dllsign in which 
there's a possibility of this happening, the, IR7 default 
feature can be used as a safeguard. This can be accom- 
plished by using the IR7 routine as a "clean-up routine" 
which might recheck the 8259A status'or merely return 
program ex'ecution to its pre-interrupt location. 


Depending upon the particular design and application, 
the level triggered mode has a number of uses. For one, 
it provides for repetitious interrupt generation. This is 
useful in cases when a service routine needs to be con- 
tinually executed until the interrupt request goes inac- 
tive. Another possible advantage of the level triggere'd 
mode is it allows for "wire-OR'ed" interrupt requests. 
That is, a number of interrupt requests using the same 
IR input. This can't be done in the edge triggered mode, 
for if a device makes an interrupt request while the IR in- 
put is high (from another request), its transition will be 
"shadowed". Thus the 8259A won't recognize further in- 
terrupt requests because its IF! ir1Put is already high. 
Note ,that when a "wire-OR'ed" scheme is used,lhe ac- 


tual requesting device has to be determined by the soft- 
ware in the service routine. 


Caution should be taken when using the automatic EOI 
mode and the level triggered mode together. Since in 
the automatic EOI mode an EOI is automatically per· 
formed at the end of the interrupt acknowledge se· 
quence, if the processor enables interrupts while an IR 
input is still high, an interrupt will occur immediately. To 
avoid this situation interrupts should be kept disabled 
until the end of the service routine or until the IR input 
returns low. 


Edge Triggered 
Mode 


When in the edge triggered mode, the 8259A will only 
recognize interrupts if generated by an inactive (low) to 
active (high) transition on an IR input. The edge trig· 
gered mode incorporates an edge lockout method of 
operation. This means that after the rising edge of an 
interrupt request and the acknowledgement of the re- 
quest, the positive level of the IR input won't generate 
further interrupts on this level. The user needn't worry 
about quickly removing the request after acknowledge- 
ment in fear of generating further interrupts as might be 
the case in the level triggered mode. Before another in- 
terrupt can be generated the IR input must return to the 
inactive state. 


Referring back to Figure 16,the timing requirements for 
interrupt triggering is shown. Like the level triggered 
mode, in the edge triggered mode the request on the IR 
input must remain active until after the falling edge of 
the first INTA pulse for that particular interrupt. Unlike 
the level triggered mode, though, after the interrupt 
request is acknowledged its IRRlatch is disarmed. Only 
after the IR input goes inactive will the IRR latch again 
become armed, making it ready to receive another inter- 
rupt request (in the level triggered mode, the IRRlatch is 
always armed). Because of the way the edge triggered 
mode functions, it is best to use a positive level with a 
negative pulse to trigger the IR requests. With this type 
of input, the trailing edge of the pulse causes the inter- 
rupt and the maintained positive level meets the neces· 
sary timing requirements (remaining high until after the 
interrupt acknowledge occurs). Note that the IR7default 


feature mentioned in the "level triggered mode" section 
also works for the edge triggered mode. 


Depending upon the particular design and application, 
the edge triggered mode has various uses. Because of 
its edge lockout operation, it is best used in those 
applications where repetitious interrupt generation isn't 
desired. It is also very useful in systems where the inter- 
rupt request is a pulse (this should be in the form of a 
negative pulse to the 8259A). Another possible advan- 
tage is that it can be used with the automatic EOI mode 
without the cautions in the level triggered mode. Over- 
all, in most cases, the edge triggered mode simplifies 
operation for the user, since the duration of the interrupt 
request at a positive level is not usually a factor. 


3.4 INTERRUPTSTATUS 


By means of software control, the user can interrogate 
the status of the 8259A.This allows the reading of the 
internal interrupt registers, which may prove useful for 
interrupt control during service routines. It also pro- 
vides for a modified status poll method of device moni- 
toring, by using the poll command. This makes the 
status of the internal IR inputs available to the user via 
software control. The poll command offers an alterna- 
tive to the interrupt vector method, especially for those 
cases when more than 64 interrupts are needed. 


Reading Interrupt Registers 


The contents of each B·bit interrupt register, IRR, ISR, 
and IMR, can be read to update the user's program on 
the present status of the 8259A.This can be a versatile 
tool in the decision making process of a service routine, 
giving the user more control over interrupt operations. 
Before delving into the actual process of reading the 
registers, let's briefly review their general descriptions: 


IRR (Interrupt 
Specifies all interrupt levels re- 


Request Register) 
questing service. 


ISR (In'Service 
Specifies 
all 
interrupt 
levels 


Register) 
which are being serviced. 


IMR (Interrupt 
Specifies all interrupt levels that 


Mask Register) 
are masked. 


To read the contents of the IRR or ISR, the user must 


.st issue the appropriak read register cvmmand (read 
IRR or read ISR) to the 8259A. Then by applying a RD 
pulse to the 8259A (an INput instruction), the contents 
of the desired register can be acquired. There is no need 
to issue a read register command every time the IRR or 
ISR is to be read. Once a read register command is 
received by the 8259A, it "remembers" which register 
has been selected. Thus, all that is necessary to read 
the contents of the same register more than once is the 
RD pulse and the correct addressing (AO= 0, explained 
in "Programming the 8259A"). Upon initialization, 
the 


selection of registers defaults to the IRR.Some caution 
should be taken when using the read register command 
in a system that supports several levels of interrupts. If 
the higher priority routine causes an interrupt between 
the read register command and the actual input of the 
register contents, there's no guarantee that the same 
register will be selected when it returns. Thus it is best 
in such cases to disable interrupts during the operation. 


Reading the contents of the IMR is different than read- 
ing the IRR or ISR. A read register command is not 
necessary when reading the IMR. This is because the 
IMR can be addressed directly for both reading and 
writing. Thus all that the 8259A requires for reading the 
IMR is a RD pulse and the correct addressing (AO= 1, 
explained in "Programming the 8259A"). 


Poll Command 


As mentioned towards the beginning of this application 
note, there are two methods of servicing peripherals: 
status polling and interrupt servicing. For most applica- 
tions 
the interrupt 
service method is best. This is 
because it requires the least amount of CPU time, thus 
increasing system throughput. However, for certain ap· 
plications, the status poll method may be desirable. 


For this reason, the 8259A supports polling operations 
with the poll command. As opposed to the conventional 
method of polling, the poll command offers improved 
device servicing and increased throughput. Rather than 
having the processor poll each peripheral in order to 
find the actual device requiring service, the processor 
polls the 8259A.This allows the use of all the previously 
mentioned priority modes and commands. Additionally, 
both polled and interrupt methods can be used within 
the same program. 


To use the poll command the processor must first have 
its interrupt request pin disabled. Once the poll com· 
mand is issued, the 8259A will treat the next (CS quali· 
fied) RDpulse issued to it (an INput instruction) as an in- 
terrupt acknowledge. It will then set the appropriate bit 
in the ISR,if there was an interrupt request, and enable a 
special 
word onto the data bus. This word shows 


whether an interrupt 
request has occurred and the 
highest 
priority 
level requesting 
service. Figure 
17 
shows the contents of the "poll word" which is read by 
the processor. Bits WO-W2 convey the binary code of 
the highest priority level requesting service. Bit I desig· 
nates whether or not an interrupt request is present. If 
an interrupt request is present, bit I will equal 1. If there 
isn't an interrupt request at all, bit I will equal 0 and bits 
WO-W2 will be set to ones. Service to the requesting 
device is achieved by software decoding the poll word 
and branching to the appropriate service routine. Each 


time the 8259A is to be polled, the poll command must 
be written before reading the poll word. 


The poll command is useful in various situations. For in· 
stance, it's a good alternative when memory is very 
limited, because an interrupt·vector table isn't needed. 
Another use for the poll command is when more than 64 
interrupt levels are needed (64 is the limit when cascad· 
ing 8259's). The only limit of interrupts using the poll 
command is the number of 8259's that can be addressed 
in a particular system. Still another application of the 
poll command might be when the INT or INTA signals 
are not available. This might be the case in a large 
system where a processor on one card needs to use an 
8259A on a different card. In this instance, the poll com· 
mand is the only way to monitor the interrupt devices 
and still 
take advantage of the 8259A's prioritizing 


features. For those cases when the 8259A is using the 
poll command only and not the interrupt method, each 
8259A must receive an initialization sequence (interrupt 
vector). This must be done even though the interrupt 
vector features of the 8259A are not used. In this case, 
the 
interrupt 
vector 
specified 
in 
the 
initialization 


sequence could be a "fake". 
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3.5 INTERRUPTCASCADING 


As mentioned earlier, more than one 8259A can be used 
to expand the priority interrupt scheme to up to 64 levels 
without additional hardware. This method for expanded 
interrupt capability 
is called "cascading". 
The 8259A 


supports cascading operations with the cascade mode. 
Additionally, the special fUlly nested mode and the buf· 
fered mode are available for increased flexibility 
when 
cascading 8259A's in certain applications. 


Cascade Mode 


When programmed in the cascade mode, basic opera· 
tion consists of one 8259A acting as a master to the 
others which are serving as slaves. Figure 18 shows a 
system containing a master and two slaves, providing a 
total of 22 interrupt levels. 


A specific 
hardware set·up is required to establish 


operation in the cascade mode. With Figure 18 as a ref· 
erence, note that the master is designated by a high on 
the SP/EN pin, while the SP/EN pins of the slaves are 
grounded (this can also be done by software, see buf· 
fered mode). Additionally, the INT output pin of each 
slave is connected to an IR input pin of the master. The 
CASO-2 pins for all 8259A's are paralleled. These pins 
act as outputs when the 8259A is a master and as inputs 
for the slaves. Serving as a private 8259A bus, they con- 
trol which slave has control of the system bus for inter· 
rupt vectoring operation with the processor. All other 
pins are connected as in normal operation (each 8259A 
receives an INTA pulse). 
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Besides hardwareset-up requirements, all 8259A's must 
be software programmed to work in the cascade mode, 
Programming the cascade mode is done during the in- 
itialization of each 8259A,The 8259Athat is selected as 
master must receive specification during its initializa- 
tion as to which of its IR inputs are connected to a 
slave's INT pin, Each slave 8259A, on the other hand, 
must be designated during its initialization with an 10(0 
through 7)corresponding to which of the master's IR in- 
puts its INT pin is connected to, This is all necessary so 
the CASO-2 pins of the masters will be able to address 
each individual slave, Note that as in normal operation, 
each 8259A must also be initialized to give its IR inputs 
a unique interrupt vector. More detail on the necessary 
programming of the cascade mode is explained in "Pro- 
gramming the 8259A", 
' 


Now, with background information on both hardware 
and software for the cascade mode, let's go over the 
sequence of events that occur during a valid interrupt 
request from a slave. Suppose a slave IR input has 
received an interrupt request. Assuming this request is 
higher priority than other requests and in·service levels 
on the slave, the slave's INT pin is driven high. This 
signals the master of the request by causing an inter- 
rupt request on a designated IRpin of the master. Again, 
assuming that this request to the master is higher priori- 
ty than other master requests and in-service levels 
(possibly from 'other slaves), the master's INT pin is 
pulled high, interrupting the processor. 


The interrupt acknowledge sequence appears to the 
processor the same as the non-cascading interrupt 
acknowledge sequence; however, it's different among 
the 8259A's. The first INTA pulse is used by all the 
8259A's for internal set-up purposes and, if in the 
8080/8085 
mode, the master will place the CALL opcode 
on the data bus. The first IN'M pulse also signals the 
master to place the requesting slave's 10 code on the 
CAS lines. This turns control over to the slave for the 
rest of the interrupt acknowledge sequence, placing the 


appropriate pre-programmed interrupt 
vector on the 
data bus, completing the interrupt request. 
' 


During the interrupt acknowledge sequence, the cor- 
responding ISR bit of both the master and the slave get 
set. This means two EOI commands must be issued (if 
not in the automatic EOI mode),one for the master and 
one for the slave. 


Special consideration 
should be taken when mixed 
interrupt requests are assigned to a master 8259A;that 
is, when some of the master's IR inputs are used for 
slave interrupt requests and,some are used for individ- 
ual interrupt requests. In this type of structure, the 
master's IRa must not be used for a slave. This is 
because when an IR input that isn't initialized as a slave 
receives an interrupt request, the CASO-2lines won't be 
activated, thus staying in the default condition address- 
ing for IRa (slave IRa). If a slave is connected to the 
master's 
IRa when a nor-slave interrupt 
occurs on 
another master IR input, erroneous conditions 
may 
result. Thus IRashould be the last choice when assign- 
ing slaves to IR inputs. 


Special Fully Nested Mode 


Depending on the application, changes in the nested 
structure of the cascade mode may be desired. This is 
because the nested structure of, a slave 8259A differs 
from that of the normal fully nested mode. In the cas- 
cade mode, if a slave receives a higher priority interrupt 
request than one which is in service (through the same 
slave), it won't be recognized by the master. This is 
because the master's ISRbit is set, ignoring all requests 
of equal or lower priority. Thus, in this case, the higher 
priority slave inte'rrupt won't be serviced until after the 
master's ISR bit is reset by an EOI command, This is 
most likely after the completion of the lower priority 
routine. 


If the user wishes to have a truly fully nested structure 
within a slave 8259A, the special fully nested mode 
should be used. The special fully nested mode is pro- 


grammed in the master only. This is done during the 
master's 
initialization. 
In this mode the master will 


ignore only those interrupt requests of lower priority 
than the set ISR bit and will respond to all requests of 
equal or higher priority. Thus if a slave receives a higher 
priority request than one in service, it will be recognized. 
To insure proper interrupt operation when using the 
special fUlly nested mode, the software must determine 
if any other slave interrupts are still in service before 
issuing an EOI command to the master. This is done by 
reselling the appropriate slave ISR bit with an EOI and 
then reading its ISR. If the ISR contains all zeros, there 
aren't any other interrupts from the slave in service and 
an EOI command can be sent to the master. If the ISR 
isn't all zeros, an EOIcommand shouldn't be sent to the 
master. Clearing the master's ISR bit with an EOI com- 
mand while there are still 
slave interrupts in service 


would allow lower priority interrupts to be recogliized at 
the master. An example of this process is shown in the 
second application in the "Applications Examples" sec- 
tion. 


Buffered Mode 


The buffered mode is useful in large systems where buf- 
fering is required on the data bus. Although not limited 
to only 8259A cascading, it's most pertinent in this use. 
In the buffered mode, whenever the 8259A's data bus 
output is enabled, its SP/EN pin will go low. This signal 
can be used to enable data transfer through a buffer 
transceiver in the required direction. 


Figure 19 shows a conceptual diagram of three 8259A's 
in cascade, each slave is controlling an individual 8286 
8-bit bidirectional bus driver by means of the buffered 
mode. Note the pull-up on the SP/EN. It is used to 
enable da'tatransfer to the 8259A for its initial program- 
ming. When data transfer is to go from the 8259Ato the 
processor, SP/EN will go low; otherwise, it will be high. 


A question should arise, however, from the fact that the 
SP/EN pin is used to designate a master from a slave; 


how can it be used for both master·slave selection and 
buffer control? The answer to this is the provision for 
software programmable master-slave selection when in 
the buffer mode. The buffered mode is selected during 
each 8259A's initialization. At the same time, the user 
can assign each individual 8259A as a master or slave 
(see "Programming the 8259A"). 


4. PROGRAMMING THE 8259A 


Programming the 8259A is accomplished by using two 
types 
of 
command 
words: 
Initialization 
Command 


Words 
(ICWs) and 
Operational 
Command 
Words 
(OCWs).All the modes and commands explained in the 
previous section, "Operation 
of the 8259A", are pro· 


grammable using the ICWs and OCWs (see Appendix A 
for cross reference). The ICWs are issued from the proc- 
essor in a sequential format and are used to set-up the 
8259A in an initial state of operation. The OCWs are 
issued as needed to vary and control 8259A operation. 


Both ICWs and OCWs are sent by the processor to the 
8259A via the data bus (8259A CS= 0, WR= 0). The 
8259A distinguishes 
between the different 
ICWs and 


OCWs by the state of its AOpin (controlled by processor 
addressing), the sequence they're issued in (ICWs only), 
and some dedicated bits among the ICWs and OCWs. 
Those bits which are dedicated are indicated so by fixed 
values (0 or 1) in the corresponding ICW or OCW pro- 
gramming formats which are covered shortly. 
Note, 


when 
issuing 
either 
ICWs or OCWs, the 
interrupt 


request pin of the processor should be disabled. 


4.1 INITIALIZATION COMMAND WORDS (ICWs) 


Before normal operation can begin, each 8259A in a 
system must be initialized by a sequence of two to four 
programming 
bytes called 
ICWs (Initialization 
Com· 


mand Words). The ICWs are used to set-up the neces- 
sary conditions and modes for proper 8259A operation. 


Figure 20 shows the initialization 
flow of the 8259A. 
Both ICW1 and ICW2 must be issued for any form of 
8259A operation. However, ICW3 and ICW4 are used 
only if designated so in ICW1. Determining the neces- 
sity and use of each ICW is covered shortly in individual 
groupings. Note that, once intialized, if any program- 
ming changes within the ICWs are to be made, the entire 
ICW sequence must be reprogrammed, not just an indi- 
viduallCW. 


Certain internal set-up conditions 
occur automatically 
within the 8259A after the first ICW has been issued. 
These are: 


A. Sequencer logic is set to accept the remainng ICWs 
as designated in ICW1. 


B. The ISR(In-Service Register) and IMR (Interrupt Mask 
Register) are both cleared. 


C. The special mask mode is reset. 


D. The rotate in automatic EOI mode flip-flop is cleared. 


E. The IRR (Interrupt Request Register) is selected for 
the read register command. 


F. If the IC4 bit equals 0 in ICW1, all functions in ICW4 
are cleared; 8080/8085 
mode is selected by default. 


G. The fully nested mode is entered with an initial prior- 
ity assignment of IROhighest through IR7 lowest. 


H. The edge sense latch of each IR priority 
cell is 
cleared, thus requiring a low to high transition 
tp 
generate an interrupt (edge triggered mode effected 
only). 


The ICW programming 
format, Figure 21, shows bit 
designation and a short definition of each ICW.With the 
ICW format as reference, the functions of each ICW will 
now be explained individually. 
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ICW1 and ICW2 


Issuing ICW1 and ICW2 is the minimum amount of pro- 
gramming needed for any type of 8259A operation. The 
majority of bits within these two ICWs are used to desig- 
nate the interrupt vector starting address. The remain- 
ing bits serve various purposes. Description oflhe ICW1 
and ICW2 bits is as follows: 


IC4: 
The IC4 bit is used to designate to the 8259A 
whether or not ICW4 will be issued. If any of 
the ICW4 operations are to be used, ICW4 
must equal 1. If they aren't used, then ICW4 
needn't be issued and IC4 can equal O. Note 
that if IC4= 0, the 8259Awill assume operation 
in the MCS-80/85mode. 


SNGL: 
The SNGL bit is used to designate whether or 
not the 8259A is to be used alone or in the cas- 
cade mode. If the cascade mode is desired, 
SNGL must equal O. In doing this, the 8259A 
will accept ICW3for further cascade mode pro- 
gramming. If the 8259A is to be used as the 
single 8259A within a system, the SNGL bit 
must equal 1; ICW3 won't be accepted. 


ADI: 
The ADI bit is used to specify the address in- 
terval for the MCS-80/85mode. If a 4-byte ad- 
dress interval is to be used, ADI must equal 1. 
For an 8-byte address interval, ADI must equal 
O.The state of ADI is ignored when the 8259A 
is in the MCS-86/88mode. 


LTIM: 
The LTIM bit is used to select between the two 
IR input triggering modes. If LTIM= 1,the level 
triggered mode is selected. If LTIM = 0, the 
edge triggered mode is selected. 


A5-A15: The A5-A15 bits are used to select the inter- 


rupt vector address when in the MCS-80/85 
mode. There are two programming formats 
that can be used to do this. Which one is im- 
plemented depends upon the selected address 
interval (ADI). If ADI is set for the 4-byte inter- 
val, then the 8259A will automatically 
insert 
AO-A4 (AO, A1=0 
and A2, A3, A4=IRO-7). 
Thus A5-A15 must be user selected by pro- 
gramming the A5-A 15 bits with the desired ad- 
dress. If ADI is set for the 8-byte interval, then 
AO-A5 are automatically 
inserted 
(AO, A1, 


A2 = a and A3, A4, A5= IRO-7). This leaves 
A6-A 15 to be selected by programming the 
A6-A15 bits with the desired address. The 
state of bit 5 is ignored in the latter format. 


T3-T7: 
The T3-T7 bits are used to select the interrupt 
type when the MCS-86/88mode is used. The 
programming of T3-T7 selects the upper 5 
bits. The lower 3 bits are automatically 
in- 


serted, corresponding to the IR level causing 
the interrupt. The state of bits A5-A10 will be 
ignored when in the MCS-86/88mode. Estab- 
lishing the actual memory address of the inter- 
rupt is shown in Figure 22. 
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ICW3 


The 8259A will only accept ICW3 if programmed in the 
cascade mode (ICW1, SNGL= 0). ICW3 is used for 
specific 
programming within 
the cascade mode. Bit 


definition 
of ICW3 differs depending on whether the 


8259A is a master or a slave. Definition of the ICW3 bits 
is as follows: 


SO-7 
(Master): 


If the 8259A is a master (either when the 
SP/EN pin is tied high or in the buffered 
mode when MIS= 1 in ICW4), ICW3 bit defi- 
nition is SO-7,corresponding to "slave 0-7". 
These bits are used to establish which IR in- 
puts have slaves connected to them. A 1 
designates a slave, a a no slave. For exam- 
ple, if a slave was connected to IR3, the S3 
bit should be set to a 1. (SO)should be last 
choice for slave designation. 


If the 8259Ais a slave(either when the SP/EN 
pin is low or in the buffered mode when 
MIS= a in ICW4),ICW3 bit definition is used 
to establish its individual identity. The 10 
code of a particular slave must correspond 
to the number of the masters IR input it is 
connected to. For example, ;t a slave was 
connected to IR6 of the master, the slaves 
100-2 bits should be set to IDa= 0, ID1= 1, 
andID2=1. 


IDO-ID2 
(Slave): 


ICW4 


The 8259A will only accept ICW4 if it was selected in 
ICW1 (bit IC4= 1). Various modes are offered by using 
ICW4. Bit definition of ICW4 is as follows: 


I'PM: 
The I'PM bit allows for selection of either the 
MCS-80/85or MCS-86/88mode. If set as a 1 the 
MCS-86/88 mode 
is 
selected, 
if 
a 0, the 


MCS-80/85mode is selected. 
. 


AEOI: 
The AEOI bit is used to select the automatic 
end 
of 
interrupt 
mode. 
If 
AEOI = 1, 
the 


automatic end of interrupt mode is selected. If 
AEOI= 0, it isn't selected; thus an EOI com- 
mand must be used during a service routine_ 


MIS: 
The MIS bit is used in conjunction with the buf- 
fered mode. If in the buffered 
mode, MIS 


defines whether the 8259A is a master or a 
slave. When MIS is set to a 1, the 8259A 
operates as the master; when MIS is 0, it 
operates as a slave. If not programmed in the 
buffered mode, the state of the MIS bit is 
ignored. 


The BUF bit is used to designate operation in 
the buffered mode, thus controlling the use of 
the SP/EN pin. If BUF is set to a 1, the buffered 
mode is programmed and SP/EN is used as a 
transceiver enable output. If BUF is 0, the buf- 
fered mode isn't programmed and SP/EN is 
used for master/slave selection. Note if ICW4 
isn't programmed, SP/EN is used for master/ 
slave selection. 


SFNM: 
The SFNM bit designates 
selection 
of the 
special fully nested mode which is used in 
conjunction 
with the cascade mode. Only the 
master should be programmed in the special 
fully nested mode to assure a truly fully nested 
structure among the slave IR inputs. If SFNM 
is set to a 1, the special fully nested mode is 
selected; if SFNM is 0, it is not selected. 


4.2 OPERATIONAL COMMAND WORD (OCWs) 


Once initialized by the ICWs, the 8259A will most likely 
be operating in the fully nested mode. At this point, 
operation can be further controlled or modified by the 
use ~f 
OCWs (Operation 
Command Words). Three 
OCWs are available for programming various modes and 
commands. Unlike the ICWs, the OCWs needn't be in 
any type of sequential order. Rather, they are issued by 
the processor as needed within a program. 


Figure 23, the OCW programming format, shows the bit 
designation and short definition of each OCW. With the 
OCW format as reference, the functions of each OCW 
will be explained individually. 


OCW1 


OCW1 is used solely for 8259A masking operations. It 
provides a direct link to the IMR (Interrupt Mask Regis- 
ter). The processor can write to or read from the IMR via 
OCW1. The OCW1 bit definition is as fqllows: 


MO-M7: The MO-M7 bits are used to control the mask- 


ing of IR inputs. If an M bit is set to a 1, it will 
mask the corresponding IR input. A 0 clears 
the mask, thus enabling the IR input. These 
bits convey the same meaning when being 
read by the processor for status update. 


OCW2 


OCW2 is used for end of interrupt, automatic rotation, 
and specific rotation operations. Associated commands 
and modes of these operations (with the exception of 
AEOI initialization), are selected using the bits of OCW2 
in a combined fashion. 
Selection of a command or 
mode should be made with the corresponding table for 
OCW2 in the OCW programming 
format (Figure 20), 
rather than on a bit by bit basis. However, for com- 
pleteness of explanation, bit definition of OCW2 is as 
follows: 


LO-L2: 
The LO-L2 bits are used to designate an inter- 
rupt level (0-7) to be acted upon for the opera- 
tion selected by the EOI, SL, and R bits of 
OCW2. The level designated 
will 
either be 
used to reset a specific 
ISR bit or to set a 
specific priority. The LO-L2 bits are enabled or 
disabled by the SL bit. 
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The EOI bit is used for all end of interrupt com- 
mands (not automatic end of interrupt mode). 
If set to a 1, a form of an end of interrupt com- 
mand will be executed depending on the state 
of the SL and R bits. If EOI is 0, an end of inter- 
rupt command won't be executed. 


The SL bit is used to select a specific level for 
a given operation. If SL is set to a 1, the LO-L2 
bits are enabled. The operation selected by the 
EOI and 
R bits 
will 
be executed 
on 
the 
specified interrupt level. If SL is 0, the LO-L2 
bits are disabled. 


The R bit is used to control all 8259A rotation 
operations. If the R bit is set to a 1, a form of 
priority rotation will be executed depending on 
the state of SL and EOI bits. If R is 0, rotation 
won't be executed. 


OCW3 


OCW3 is used to issue various modes and commands to 
the 8259A. There are two main categories of operation 
associated with OCW3: interrupt status and interrupt 
masking. Bit definition of OCW3 is as follows: 


RIS: 
The RIS bit is used to select the ISR or IRR for 
the read register command. If RIS is set to 1, 
ISR is selected. If RIS is 0, IRR is selected. The 
state of the RIS is only honored if the RR bit is 
a 1. 


RR: 
The RR bit is used to execute the read register 
command. If RR is set to a 1, the read register 
command is issued and the state of RIS deter· 
mines the register to be read. If RR is 0: the 
read register command isn't issued. 


P: 
The P bit is used to issue the poll command. If 
P is set to a 1, the poll command is issued. If it 
is 0, the poll command isn't issued. The poll 
command will override a read register com- 
mand if set simuitaneously. - 


SMM: 
The SMM bit is used to set ·the special mask 
mode. If SMM is set to a 1, the special mask 
mode is selected. If it is 0, it is not selected. 
The state of the SMM bit is only honored if it is 
enabled by the ESMM bit. 


ESMM: 
The ESMM bit is used to enable or disable the 
effect of the SMM bit. If ESMM is set to a 1, 
SMM is enabled. If ESMM is 0, SMM is dis- , 
u 


abled. This bit is useful to prevent interference 
of mode and command selections in OCW3. 


5. APPLICATION EXAMPLES 


In t~is section, the 8259A is shown in three different ap- 
plication examples. The first is an actual design imple· 
mentation supporting an 8080Amicroprocessor 
system, 


"Power Fail/Auto Starf with Battery Back-Up RAM". The 
second is a conceptual example of inc'orporating more 
than 64 interrupt levels in an 8080A or 8085A system, 
"78 Level Interrupt System". The third application is a 
conceptual design using an 8086 system, "Timer Con- 
trolled 
Interrupts". 
Although 
specific 
microprocessor 
systems are used in each example, these applications 
can be applied to either MCS-80, MCS-85, MCS-86, or 
MCS-88systems, providing the necessary hardware and 
software changes are made. Overall, these applications 
should serve as a useful guide, illustrating 
the various 
procedures in using the 8259A. 


5.1 POWER FAIL/AUTO·START WITH BATTERY 
BACK·UP RAM 
. I 


The first application 
illustrates 
the 8259A used in an 
8080A system, supporting a battery back-up scheme for 
the RAM (Random Access Memory) in a microcomputer 
system. Such a scheme is important in numerical and 
process control applications. The entire microcomputer 
system 
could 
be supported 
by a 
battery 
back-up 
scheme, however, due to the large amount of current 
usually required and the fact that most machinery is not 
supported by an auxiliary power source, only the state 
of calculations and variables usually need to be saved. 
In the event of a loss of power, if these items are not 
already stored in RAM, they can be transferred there and 
saved using a simple battery back-up system. 


The vehicle 
used in this 
application 
is the 
Intel<!> 


SBC-80/20Single Board Computer. An 8259A is used in 
the SBC-80/20along with control lines helpful in imple- 
menting 
the 
power-down and automatic 
restart 
se- 


quence used in a battery back-up system. The SBC-80/20 
also contains user-selectable jumpers which allow the 
on-board RAM to be powered by a supply separate from 
the suppiy used for the non·RAM components. Also, the 
output of an undedicated latch is available to be con- 
nected to the IR inputs of the 8259A (the latch is cleared 
via an output port). In addition, an undedicated, buffered 
input line is provided, along with an input to the RAM 
decoder that will protect memory when asserted. 


The additional 
circuitry 
to 
be described 
was 
con- 


structed o.nan SBC-905 prototyping board. An SBC-635 
power supply was used to power the non-RAM section 
of the SBC-80/20while an external DC supply was used 
to simulate the back-up battery supplying power to the 
RAM. The SBC-635 was used since it provides an open 
collector 
ACLO output 
which 
indicates 
that the AC 
input line voltage is below 103/206VAC (RMS). 


The following is an example of a power-down and restart 
sequence that introduces the various power fail signals. 


1. An AC power failure occurs and the ACLO goes high 
(ACLO is pulled up by the battery supply). This indi- 
cates that DC power will be reliable for at most 7.5 
ms. The power fail circutry generates a Power Fail In- 
terrupt (PFI) signal. This signal sets the 'PFj latch, 
which is connected to the IRa input of the 8259A, and 
sets the Power Fail Sense (PFS) latch. The state of 
this latch will indicate to the processor, upon reset, 
whether it is coming up from a power failu(e (warm 
start) or if it is coming up initially (cold start). 
, 
I 
-.'-4. 


2. The processor is interrupted by the 8259A when the 
PFI latch is set. This pushes the pre-power-down pro- 
gram counter, onto the stack and calls the service 
routine for the IRa input. The IRa service routine 
saves the processor status and any other needed 
variables. 
The routine 
should 
end with 
a HALT 
instruction 
to minimize bus transitions. 


3. After a predetermined length of time (5 ms in this ex- 
ample) the power fail circuitry 
generates a Memory 
Protect (MPRO) signal. All processing for the power 
failure (including the interrupt response delays) must 
be completed within this 5 ms window. The MPRO 
signal ensures that spurious transitions 
on the sys- 
tem control bus caused by power going down do not 
alter the contents of the RAM. 


4. DC power goes down. 


5. AC power returns. The power-on reset circuitry on the 
SBC-80/20generates a system RESET. 


6. The processor reads the state of the PFS line to 
determifle 
the ,appropriate start-up sequence. The 
PFS latch is cleared, the MPRO signal is removed, 
and the PFI latch driving IRa is cleared by the Power 
Fail Sense Reset (PFSR)signal. The system then con- 
tinues from the pre-power-d-ownlocation for a warm 
start by restoring the processor status and popping 
the pre-power-down program counter off the stack. 


\'----- 


Figure 25 shows the block diagram for the system. 
Notice that the RAM, the RAM decoder, and the power- 
down circuitry are powered by the battery supply. 


The schematic of the power-down circuitry 
and the 


SBC-80/20interface is shown in Figure 26.The design is 
very straightforward and uses CMOS logic to minimize 
the battery current requirements. The cold start switch 
is necessary to ensure that during a cold start, the PFS 
line is indicating "cold start" sense (PFShigh). Thus, for 


a cold start, the cold start switch is depressed during 
power on. After that, no further action is needed. Notice 
that the PFI signal sets the on-board PFI latch. The out- 
put of this latch drives the 8259A IROinput. This latch is 
cleared during the restart routine by executing an OUT- 
put o4H instruction. The state of the PFS line may be 
read on the least significant data bus line (oBO)by exe- 
cuting an INput o4H instruction. An 8255 port (8255#1, 
port C, bit 0) is used to control the PFSR line. 
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The fully nested mode for the 8259A is used in its initial 
state to ensure the IROalways has the highest priority. 
The remaining IR inputs can be used for any other pur- 
pose in the system. The only constraint is that the ser- 
vice routines must enable interrupts as early as possi- 
ble. Obviously, this is to ensure that the power-down in- 
terrupt does not have to wait for service. If a rotating 
priority 
scheme is desired, another 8259A could be 


added as a slave and be programmed to operate in a 
rotating mode. The master would remain in the initial 
state of the fully nested mode so that the IROstill reo 
mains the highest priority input. 


The software to support the power-down circuitry 
is 


shown in Figure 27. The flow for each label will be 
discussed. 


After any system reset, the processor starts execution 
at location OOOOH(START).The PFS status is read and 
execution is transferred to CSTART if PFS indicates a 
cold start (i.e., someone is depressing the cold start 
switch) or WSTART if a warm start is indicated (PFS 
LOW). CSTART is the start of the user's program. The 
Stack Pointers (SP) and device initialization 
were in- 
cluded just to remind the reader that these must occur. 
The first EI instruction must appear after the 8259A has 
received its initialization 
sequence. The 8259A (and 
other devices) are initialized in the INIT subroutine. 


When a power failure occurs, execution is vectored by 
the 8259A to REGSAV by way of the jump table at 
JSTART.The pre-power-down program counter is placed 
on the stack. REGSAV saves the processor registers 
and flags in the usual manner by pushing them onto the 
stack. Other items, such as output port status, program- 


mabie peripheral states, etc., are pushed onto the stack 
at this time. The Stack Pointer (SP)could be pushed on- 
to the stack by way of the register pair HL but the top of 
the stack can exist anywhere in memory and there is no 
way then of knowing where that is when in the power-up 
routine. Thus, the SP is saved at a dedicated location in 
RAM. It isn't really necessary to send an EOI command 
to the 8259A in REGSAV since power will be removed 
from the 8259A, but one is included for completeness. 
The final instruction 
before actually losing power is a 


HALT. This minimizes somewhat spurious transitions 
on the various busses and lets the processor 
die 


gracefully. 


On reset, when a warm start is detected, execution is 
transferred to WSTART. WSTART activates PFSR by 
way of the 8255 (all outputs go low then the 8255 is ini- 
tialized). In the power-down circuitry, PFSR clears the 
PFS latch and removes the MPRO signal which then 
allows access to the RAM. WSTARTalso clears the PFI 
latch which arms the 8259A IROinput. Then the 8259A is 
re-initialized along with any other devices_ The SP is 
retrieved from RAM and the processor registers and 
flags are restored by popping them off the stack. Inter- 
rupts are then enabled. Now the power-down program 
counter is on top of the stack, so executing a RETurn in- 
struction transfers the processor to exactly where it left 
off before the power failure. 


Aside from illustrating the usefulness of the 8259A(and 
the SBC-80/20) in implementing 
a power failure pro- 
tected microcomputer system, this application should 
also point out a way of preserving the processor status 
when using interrupts. 
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5.2 78 LEVEL INTERRUPT SYSTEM 


The second application illustrates an interrupt structure 
with greater than 64 levels for an 8080A or 8085A sys- 
tem. In the cascade mode, the 8259A supports up to 64 
levels with direct vectoring to the service routine. Ex- 
tending the structure to greater than 64 levels requires 
polling, using the poll command. A 78 level interrupt 
structure is used as an illustration; 
however, the prin- 
ciples apply to systems with up to 512 levels. 


To implement the 78 level structure, 3 tiers of 8259A's 
are used. Nine 8259A's are cascaded in the master-slave 
scheme, giving 
64 levels at tier 
2. Two additional 


8259A's are connected, by way of the INT outputs, to 
two of the 64 inputs. The 16 inputs at tier 3, combined 
with the 62 remaining tier 2 inputs, give 78 total levels. 
The fully nested structure is preserved over all levels, 
although direct vectoring is supplied for only the tier 2 
inputs. Software is required to vector any tier 3 re- 
quests. Figure 28 shows the tiered structure used in this 
example. Notice that the tier 3 8259A's are connected to 
the bottom 'Ievel slave (SA7).The master-slaves are inter- 
connected as shown in "Interrupt Cascading", while the 
tier 3 8259A's are connected as "masters"; 
that is, the 


SP/EN pins are pulled high and the CAS pins are left un- 
connected. Since these 8259A's are only going to be 
used with the poll command, no INTA is required, there- 
fore the INTA pins are pulled high. 
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SAO<l 


SA. 
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INTA 


$P07 


INTA 
SA10 
SP 
INTA 
S800 


S •• 


MASTER 


IR1 
M1 
'NT 
SA 17 
INT 
5807 


SA70 
Sf 
INTA 
S810 


SA1 


INT 
INTA 
S., 


SA76 


IR1 
M1 
INT 
SA77 
INT 
SB17 


t. 


The concept 
used to implement the 78 levels is to 
directly vector to all tier 2 input service routines. If a tier 
2 input contains a tier 3 8259A, the service routine for 
that input will poll the tier 3 8259Aand branch to the tier 
3 input service routine based on the poll word read after 
the poll command. Figure 29 shows how the jump table 
is organized assuming a starting location of 1000Hand 
contiguous tables for all the tier 2 8259A's. Note that 
"SA35" denotes the IR5 input of the slave connected to 
the master IR3 input. Also note that for the normal tier 2 
inputs, the jump table vectors the processor directly to 
the service routine for that input, while for the tier 2 in· 
puts with 8259A's connected to their IR inputs, the proc- 
essor is vectored to a service routine (i.e., S60) which 
will poll to determine the actual tier 3 input requesting 
service. The polling routine utilizes the jump table start- 
ing at 1200Hto vector the processor to the correct tier 3 
service routi ne. 


Each 8259A must receive an initialization 
sequence 
regardless of the mode. Since the tier 1 and 2 8259A's 
are in cascade and the special fully nested mode is used 
(covered shortly), all 
ICWs are required. The tier 3 
8259A's don't require ICW3 or ICW4 since only polling 
will be used on them and they are connected as masters 
not in the cascade mode. The initialization sequence for 
each tier is shown in Figure 30. Notice that the master is 
initialized with a "dummy" jump table starting at OOH 
since all vectoring 
is done by the slaves. The tier 3 
devices also receive "dummy" tables since only polling 
is used on tier 3. 


As explained in "Interrupt 
Cascading", to preserve a 
truly fUlly nested mode within a slave, the master 8259A 
should be programmed in the special fUlly nested mode. 
This allows the master to acknowledge all interrupts at 
and above the level in service disregarding only those of 
lower priority. The special fully nested mode is pro· 
grammed in the master only, so it only affects the im- 
mediate slaves (tier 2 not tier 3). To implement a fully 
nested structure 
among tier 3 slaves some special 
housekeeping software is required in all the tier-2-with- 
tier·3·slave routines. The software should simply save 
the state of the tier 2 IMR, mask all the lower tier 2 inter· 
rupts, then issue a specific EOI, resetting the ISR of the 
tier 2 interrupt level. On completion of the routine the 
IMR is restored. 


Figure 31 shows an example flow and program for any 
tier 2 service routine without a tier 3 8259A. Figure 32 
shows an example flow and program for any tier 2 ser- 
vice routine with a tier 3 8259A. Notice the reading of the 
ISR in both examples; this is done to determine whether 
or not to issue an EOI command to the master (refer to 
the section on "Special Fully Nested Mode" for further 
details). 


LOCATION 
8259 
CODE 
COMMENTS 


lOaD H 
SAO 
JMP 
SADD 
; SAOO SERVICE 
ROUTINE 


101C H 
JMP 
SA07 
; SAD7 
SERVICE 
ROUTINE 


1020 H 
SA' 
JMP 
SA1Q 
: SA 10 SERVICE 
ROUTINE 


l03C 
H 
JMP 
SAl7 
; SA 17 SERVICE 
ROUTINE 


; SA20-SAG7 
SERVICE 
ROUTINES 


10EO H 
SA7 
JMP 
SAlO 
; SA70 
SERVICE 
ROUTINE 


lDF8 
H 
JMP 
SBO 
; sea POLL 
ROUTINE 


10FC H 
JMP 
SB' 
; S8l 
POLL ROUTINE 


1200 H 
SBO 
JMP 
S800 
; S800 
SERVICE 
ROUTINE 


121C H 
JMP 
S807 
; S807 
SERVICE 
ROUTINE 


1220 H 
SBl 
JMP 
5810 
; 5810 
SERVICE 
ROUTINE 


123C H 
JMP 
SB17 
; SB17 
SERVICE 
ROUTINE 


Figure 29. Jump Table Organization 


MINT: 
MYI 
OUT 
MV! 
OUT 
MV! 
OUT 
MV! 
OUT 


A,15H 
MPTA 
A,OOH 
MPTB 
A,OFFH 
MPTB 
A,laH 
MPTB 


; ICWI,LTM=O,ADI=l,S=O.IC4=1 
; MASTER 
PORT AD = 0 


; ICW2, DUMMY 
ADDRESS 


; MASTER 
PORT AO",,1 


; ICW3, S7-S0 = 1 
; MASTER 
PORT AO= 1 


; ICW4, SFNM = 1 
; MASTER 
PORT AO= 1 


SAXINT: 
MVI 
OUT 
MV! 
OUT 
MV! 
OUT 
MV! 
OUT 


A,If 
SAXPTA 
A,10H 
SAXPTB 
AOXH 
SAXPTB 
A10H 
SAXPTB 


; SEE KEY FOR ICW1, lTM =0, 
ADI = 1, s=o, 
IC4= 
1 
: SA"X" 
PORT AO= 0 


: ICW2, ADDRESS 
MSB 


; SA"X" 
PORT Ao = 1 
: ICW3, SA 10 
: SA"X" 
PORT AO= 1 


; ICW4, SFNM = 1 
; SA"X"PORTAO=1 


SBXINT 
MVI 
OUT 
MV' 
OUT 


A,16H 
SBXPTA 
A,DOH 
SBXPTB 


; ICW1.lTM=O,ADI=1,S=1,IC4=O 
; SB"X" 
PORT AO= 0 


: ICW2, DUMMY 
ADDRESS 


: SB"X" 
PORT AO:z 1 


SA INITIALIZATION 
KEY 


SA"X" 
a (ICW1) 
JUMP TABLE START (H) 


0 
15 
1000 
1 
35 
1020 
2 
55 
1040 
3 
75 
1060 
4 
95 
1080 
5 
B5 
10AO 
5 
D5 
10CO 
7 
F5 
10EO 


SA 1(: 
PUSH 
D 


PUSH 
B 


PUSH 
H 
PUSH 
PSW 
EI 


: 
SAVE 
DE 


: SAVE 
BC 


: SAVE 
Hl 


: SAVE 
A, 
FLAGS 


; 
ENABLE 
INTERRUPTS 


01 
MVI 
OUT 


MUI 


OUT 
IN 
ANI 


JZN 
MVI 


OUT 


sAI(RsR: 
POP 


POP 


POP 
POP 
EI 


RET 


20, 
sAI(PTA 


A.OB,. 


sAI(PTA 
sAI(PTA 


OFFH 
sAI(RsR 


A,OBH 
MAsPTA 


PSW 
H 
B 
o 


: 
DISABLE 
INTERRUPTS 


; 
OCW2, 
NON·sPECIFIC 
EOI 


: 
sA"I(" 
PORT 
AO = 0 


: 
OCW3, 
READ 
REGISTER, 
IsR 


; 
sA"I(" 
PORT 
AO '" 0 


; 
sA"I(" 
PORT 
AO = 0, sA"I(" 
IsR 


; 
TEST 
FOR 
ZERO 


; 
IF NOT 
ZERO, 
RESTORE 
STATUS 


; 
OCW2, 
NON·sPECIFIC 
EOI 


; 
MASTER 
PORT 
AO = 0 


; 
RESTORE 
A, 
FLAGS 


: 
RESTORE 
Hl 


; 
RESTORE 
BC 


: 
RESTORE 
DE 


: 
ENABLE 
INTERRUPTS 


: 
RETURN 


.: 
sB"I(" 
ROUTINE 
- SERVICE 
ROUTINE 
FOR 
TIER 
2 


: 
INTERRUPTS 
WITH 
TIER 
3 8259As 


sBI(: 
PUSH 
D 
: 
SAVE 
DE 


PUSH 
B 
: 
SAVE 
BC 
PUSH 
H 
: SAVE 
Hl 
PUSH 
PSW 
; SAVE 
A, 
flAGS 


IN 
sAXPTB: 
READ 
sA"X" 
IMR 
MOV 
D,A 
; SAVE 


MVI 
A,XXH 
; 
MASK 
sA"X" 
lOWER 
IR 


OUT 
sAXPTB 
; s.o."X" 
PORT 
AO t; 1 


MVI 
A,6XH 
; OCW2 
SPECIFIC 
EOI 
SA"X" 


OUT 
sAXPTA 
: 
sA"X" 
PORT 
AO •• 1 


lXI 
H,1200H: 
JUMP 
TABLE 
START 
MVI 
B,DOH 
: 
CLEAR 
B 


MVI 
A,OCH 
: OCW3, 
POll 
COMMAND 


OUT 
sBXPTA 
: 
sB"X" 
PORT 
AO •• O 


IN 
sBXPTA: 
GET 
POll 
WORD 


ANI 
07H 
: 
LIMIT 
TO 3 BITS 


ADO 
A 
: GET 
TABLE 
OFFSET 


ADO 
A 


MOV 
C,A 
DAD 
B 
EI 


sBXRET 
01 
MVI 
OUT 
MVI 
OUT 


IN 
ANI 


JNZ 
MVI 
OUT 
sBXRsR: 
MOV 


OUT 
POP 


POP 
POP 


POP 
EI 
RET 


A,20H 
SBXPTA 


A,OBH 
SAXPTA 


sBXPTA 


OFFH 
SBXRSR 


A,20H 
MAsPTA 
A,D 
SAXPTB 


PSW 
H 


B 
o 


; 
DISABLE 
INTERRUPTS 


; OCW2, 
NON 
SPECIFIC 
EOI 


; 
sA"X" 
PORT 
AO",O 


~ ~;'~~' 
:~~ 
:oE2~STER 
IsR 


: 
sA"X" 
PORT 
AO '" 0, ISR 


: 
TEST 
FOR 
ZERO 


: 
IF,. 
0 RESTORE 
IMR 


: OCW2, 
NON·SPECIFIC 
EOI 


; 
MASTER 
PORT 
AO •• 0 


; 
RESTORE 
sA"X" 
IMR 


; 
sA"X" 
PORT 
AO= 
1 


: 
RESTORE 
A, FLAGS 
: 
RESTORE 
Hl 


; 
RESTORE 
BC 


; 
RESTORE 
BC 


: 
RESTORE 
DE 


; 
RETURN 


In a large number of controller 
type microprocessor 
designs, certain timing 
requirements must be imple· 
mented 
throughout 
program 
execution. 
Such 
time 
dependent applications 
include control of keyboards, 
displays, CRTs, printers, and various facets of industrial 
control. These examples, however, are just a few of 
many designs which require device servicing at specifiC 
rates or generation of time delays. Trying to maintain 
these timing requirements by processor control alone 
can be costly in throughput and software complexity. 
So, what can be done to alleviate this problem? The 
answer, use the 8259A Programmable Interrupt Con· 
troller and external timing to interrupt the processor for 
time dependent device servicing. 


This application example uses the 8259A for timer con· 
trolled interrupts in an 8086 system. External timing is 
done by two 8253 Programmable Interval Timers. Figure 
33 shows a block diagram of the timer controlled inter· 
rupt circuitry which was built on the breadboard area of 
an SDK·86 (system design kit). Besides the 8259A and 
the 8253's, the necessary 1/0 decoding is also shown. 
The timer controlled interrupt circuitry interfaces with 
the SDK·86which serves as the vehicle of operation for 
this design. 


A short overview of how this application operates is as 
follows. The 8253's are programmed to generate inter· 
rupt requests at specific rates to a number of the 8259A 
IR inputs. The 8259A processes these requests by inter· 
rupting the 8086 and vectoring program execution to the 
appropriate 
service 
routine. 
In 
this 
example, 
the 
routines use the SDK·86 display panel to display the 
number of the interrupt 
lev'el being serviced. These 
routines are merely for demonstration purposes to show 
the necessary procedures to establish the user's own 
routines in a timer controlled interrupt scheme. 


Let's go over the operation starting with the actual inter· 
rupt timing generation which is done by two 8253 Pro· 
grammable Interval Timers (8253#1 and 8253 #2). Each 
8253 provides three individual 16·bit counters (counters 


essor. ~acn counter nas a ClOCK Input (\';L"'), 
gate Input 
(GATE),and an output (OUT).The output signal is based 
on divisions of the clock input signal. Just how or when 
the output occurs is determined by one of the 8253's six 
programmable modes, a programmable 16·bit count, 
and the state of the gate input. 


Figure 34 shows the 8253 timing configuration 
used for 
generating interrupts to the 8259A. The SDK·86's PCLK 
(peripheral clock) signal provides a 400 ns period clock 
to CLKOof 8253#1. Counter 0 is used in mode 3 (square 
wave rate generator), and acts as a prescaler'to provide 
the clock inputs of the other counters with a 10 ms 
period square wave. This 10 ms clock period made it 
easy to calculate exact timings for the other counters. 
Counter 2 of the 8253 #1 is used in mode 2 (rate gener· 
ator), it is programmed to output a 10 ms pulse for every 
200 pulses it receives (every 2 see). The output 
of 
counter 2 causes an interrupt on IR1 of the 8259A. All 
the 8253 #2 counters are used in mode 5 (hardware trig· 
gered strobe) in which the gate input initiates counter 
operations. In this case the output of 8253 #1 counter 2 
controls the gate of each 8253 #2 counter. When one of 
the 8253 #2 counters receive the 8253 #1 counter 2 out· 
put pulse on its gate, it will output a pulse (10 ms in 
duration) 
after a certain 
preprogrammed number of 
clock pulses haveoccurred. The programmed number of 
clock pulses for the 8253 #2 counters is as follows: 50 
pulses (0.5 see) for counter 0, 100 pulses (1 see) for 
counter 1, and 150 pulses (1.5 sec) for counter 2. The 
outputs of these counters cause interrupt requests on 
IR2 through IR4 of the 8259A. Counter 1 of 8253 #1 is 
used in mode 0 (interrupt on terminal count). Unlike the 
other 
modes ,used 
which 
initialize 
operation 
auto- 
matically or b~ gate triggering, mode 0 allows software 
controlled counter initialization. When counter 1 of 8253 
#1 is set during program execution, it will count 25 
clocks (250ms) and then pull its output high, causing an 
interrupt request on IROof the 8259A. Figure 35 shows 
the timing generated by the 8253's which cause inter- 
rupt request on the 8259A IR inputs. 
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There are basically two methods of timing generation 
that can be used in a timer controlled interrupt struc- 
ture: dependent timing and independent timing. Depen- 
dent timing uses a single timing occurrence as a refer- 
ence to base other timing occurrences on. On the other 
hand, independent timing has no mutual reference be- 
tween occurrences. Industrial controller type applica- 
tions are more apt to use dependent timing, whereas in- 
dependent timing is prone to individual device control. 


Although this application uses primarily dependent tim- 
ing, independent timing 
is also incorporated as an 


example. The use of dependent timing can be seen back 
in Figure 34, where timing for IR2 through IR4 uses the 
IR1pulse as reference. Eachone of the 8253#2 counters 
will generate an interrupt request a specific amount of 
times after the IR1 interrupt request occurs. When using 
the dependent method, as in this case, the IR2 through 
IR4 requests must occur before the next IR1 request. 
Independent timing is used to control the IROinterrupt 
request. Note that its timing isn't controlled by any of 
the other IR requests. In this timer controlled interrupt 
configuration the dependent timing is initially set to be 
self running and the independent timing is software 
initialized. However, both methods can work either way 
by using the various 8253 modes to generate the same 
interrupt timing. 


The 8259A processes the interrupts generated by the 
8253's according to how it is programmed. In this appli- 
cation it is programmed to operate in the edge triggered 
mode, MCS-86/88mode, and automatic EOI mode. In the 
edge triggered mode an interrupt request on an 8259A 


IR input becomes active on the ri~ing edge. With this in 
mind, Figure 35 shows that IROwill generate an inter- 
rupt every half second and IR1 through IR4 will each 
generate.an interrupt every 2 seconds spaced apart at 
half 
second 
intervals. 
Interrupt 
vectoring 
in 
the 


MCS-86/88mode is programmed so IRO,when activated, 
will select interrupt type 72. This means IR1 will select 
interrupt 
type 73, IR2 interrupt 
type 74, and so on 


through IR4.Since IR5 through IR7 aren't used, they are 
masked off. This prevents the possibility 
of any acci- 


dental 
interrupts 
and rids the necessity 
to tie the 


unused IR inputs to a steady level. Figure 36 shows the 
8259A IR levels (IRO-IR4)with their corresponding inter- 
rupt type in the 8086 interrupt-vector table. Type 77 in 
the table is selected by a software "I NT" instruction 
during program execution. Each type is programmed 
with 
the 
necessary code segment 
and instruction 
pointer values for vectoring to the appropriate service 
routine. Since the 8259A is programmed in the auto- 
matic EOI Mode, it doesn't require an EOI command to 
designate the completion of the service routine. 
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As mentioned earlier, the interrupt service routines in 
this application 
are used merely to demonstrate 
the 
timer controlled 
interrupt scheme, not to implement a 
particular design. Thus a service routine simply displays 
the number of its interrupting 
level on the SDK-86 dis- 
play panel. The display panel is controlled by the 8279 
Keyboard and Display Controller. 
It is initialized 
to 
display "Ir" in its two left-most digits during the entire 
display sequence. When an interrupt from IR1 through 
IR4 occurs the corresponding routine will display its IR 
number via the 8279. During each IR1 ttHough IR4 serv- 
ice routine a software "INT77" instruction 
is executed. 


This instruction vectors program execution to the serv- 
ice routine designated by type 77, which sets the 8253 
counter controllin'g IROso it will cause an interrupt in 
250 ms. When the IRO interrupt occurs its routine will 
turn off the digit 
displayed 
by the IR1 through 
IR4 
routines. Thus each fR level (IR1-IR4) will be displayed 
for 250 ms followed by a 250 ms off time caused by IRO. 
Figure 37 shows the entire display sequence of the 
timer controlled interrupt application. 
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Figure 37. SDK Dlspley 
Sequence 
for Timer Controlled 
Interrupts 
Program (Each Display Block Shown is 250 msec 
In Duretlon) 


Now that we've covered the operation, let's move on to 
the program flow and structure of the timer controlled 
interrupt program. The program flow is made up of an 
initialization 
section and six interrupt service routines. 


The initialization 
program flow is shown in Figure 38. It 
starts by initializing some of the 8086's registers for pro- 
gram operation; this includes the extra segment, data 
segment, stack segment, and stack pointer. Next, by 
using the extra segement as reference, interrupt types 
72 through 77 are set to vector interrupts to the appro- 
priate routines. This is done by moving the code seg- 
ment and instruction 
pointer values of each service 
routine into the corresponding type location. The 8253 
counters are then programmed with the proper mode 
and count to provide the interrupt 
timing 
mentioned 
earlier. All counters with the exception of the 8253 #1, 
counter 1 are fully initialized at this point and will start 
counting. Counter 1 of 8253 #1 starts counting when its 
counter is loaded during the "INTR77" service routine, 
which will be covered shortly. Next, the 8259A is issued 
ICW1, ICW2, ICW4, and OCW1. The ICWs program the 


8259A for the edge triggered 
mode, automatic 
EOI 
mode, and the proper interrupt vectoring (IRO,type 72). 
OCW1 is used to 
mask off 
the 
unused 
IR inputs 
(IR5-IR7). The 8279 is then set to'display "IR" on its two 
left-most digits. After that the 8086 enables interrupts 
and a "dummy" main program is executed to wait for in- 
terrupt requests. 


There are six different interrupt service routines used in 
the program. Five of these routines, "INTR72" through 
"INTR76", are vectored to via the 8259A. Figure 39A·C 
shows the program flow for all six service routines. Note 
that "INTR73" through "INTR76" (IR1-IR4) basically'use 
the same flow. These four similar routines display the 
number of its interrupting IR level on the SDK·86display 
panel. The "INTR77" routine is vectored to by software 
during each of the previously mentioned routines and 
sets up interrupt timing to cause the "INTR72" (IRO) 
routine to be executed. The "INTR72" routine turns off 
the number on the SDK-86 display panel. 


To best explain how these service routines work, let's 
assume an interrupt occurred on IR1 of the 8259A. The 
associated service routine for IR1 is "INTR73". Entering 
"INTR73", the first thing done is saving the pre-interrupt 
program status. This isn't really necessary in this pro- 
gram since a "dummy" main program is being executed; 
however, it is done as an example to show the operation. 
Rather than having code for saving the registers in each 
separate routine, a mutual call routine, "SAVE", is used. 
This routine will save the register status by pushing it 
on the stack. The next portion of "INTR73" will display 
the number of its IR level, "1", in the first digit of the 
SDK-86 display panel. After that, a software INT instruc- 
tion is executed to vector program execution 
to the 


"INTR77" service routine. The "'NTR77" service routine 
simply sets the 8253 #1 counter 1 to cause an interrupt 
on IROin 250 ms and then returns to "INTR73". Once 
back in "'NTR73", the pre-interrupt status is restored by 
a call routine, 
"RESTORE". It does the opposite 
of 


"SAVE", returning the register status by popping it off 
the stack. The "INTR73" 
routine then returns to the 


"dummy" 
main program. The flow for the "INTR74" 


through "INTR76" routines are the same except for the 
digit location and the IR level displayed. 


After 250 ms have elapsed, counter 1 of 8253 #1 makes 
an interrupt request on IROof the 8259A. This causes 
the "INTR72" service routine to be executed. Since this 
routine interrupts 
the main program, it also uses the 
"SAVE" routine to save pre-interrupt program status. It 
then turns off the digit displaying the IR level. In the 
case of the "INTR73" routine, the "1" is blanked out. 
The pre-interrupt 
status 
is then 
restored 
using 
the 
"RESTORE" routine and program execution returns to 
the "dummy" main program. 


The complete program for the timer controlled 
inter· 


rupts application is shown in Appendix B. The program 
was executed in SDK-86 RAM starting at location 0500H 
(code segment = 0050, instruction 
pointer= 0). 


CONCLUSION 


This application note has explained the 8259A in detail 
and gives three applications illustrating the use of some 
of the numerous programmable features available. It 
should 
be evident 
from these discussions 
that 
the 


8259A is an extremely flexible and easily programmable 
member of the Intel~ MCS·80, MCS·85, MCS-86, and 
MCS·88 families. 


This table is provided merely for reference information 
between the "Operation 
of the 8259A" and "Programming 
the 
8259A" sections 
of this application 
note. It shouldn't 
be used as a programming 
reference guide (see "Programming 


the 8259A"). 


Operational 
Description 


MCS-80185™ Mode 


Address Interval for MCS·80/85 
Mode 


Interrupt Vector Address for MCS-80/85 
Mode 


MCS-86/88 Mode 


Interrupt 
Vector Byte for MCS-86/88 Mode 


Fully Nested Mode 


Non-Specific 
EOI Command 


Specific 
EOI Command 


Command 
Words 


ICW1,ICW4' 


ICW1 


ICW1,ICW2 


ICW1.ICW4 


ICW2 


OCW-Default 


OCW2 


OCW2 


IC4,I'PM' 


ADI 


A5-A15 


IC4.I'PM 


T3-T7 


Automatic 
EOI Mode 


Rotate On Non-Specific 
EOI Command 


Rotate In Automatic 
EOI Mode 


Set Priority Command 


Rotate on Specific 
EOI Command 


Interrupt 
Mask Register 


Special Mask Mode 


Level Triggered 
Mode 


Edge Triggered 
Mode 


Read Register Command, IRR 


Read Register Command, ISR 


Read IMR 


Poll Command 


Cascade Mode 


ICW1,ICW4 


OCW2 


OCW2 


OCW2 


OCW2 


OCW1 


OCW3 


ICW1 


ICW1 


OCW3 


OCW3 


OCW1 


OCW3 


ICW1,ICW3 


EOI 


SEOI, EOI, 
LO-L2 


IC4, AEOI 


EOI 


R. SEOI, EOI 


LO-L2 


R, SEOI. EOI 


MO-M7 


ESMM-SMM 


LTIM 


LTIM 


ERIS, RIS 


ERIS, RIS 


MO-M7 


P 


SNGL, SO-7, 
100-2 


IC4, SFNM 


IC4, BUF, 
MIS 


Special Fully Nested Mode 


Buffered 
Mode 


ICW1,ICW4 


ICW1.ICW4 


ISIS-II 
i'!CS-3(; HSSCHBLER ~'1 ~ kSSEI1Bl.'; OF HOC-AU rCI59A 
OBJECT MOOULEPLACED IN 
1'1 TCI59fl 
OSJ 
ASSEMBlER.INVOKEDB'r'F1ti5t136 
F1 TCl59il5RC 


9120 
0128 04l:l1 
0122''1'1? 
8124 1891 
0126 ?"?? 
0128 3001 
012k ???? 
012e 4801 
012E)??'1 


81313 6001 
~132???~ 
!:lB47801 
0136"')') 


סס oo ???'1 


01192 
"Y??') 
0004 
?? 


llOO9 B8808l.l 


81193 SEC8 
90lfj 
B87099 


81198 BEDS 
9l.l9A 1jl)7888 
eeeo BE00 
000F 8(;88ll9 


ORG 
12eH 
DloJ 
INT~?2 
; T'r'f'E 72 INSTRUCTION POINTER 


DW 
? 
,T't'P£ 
72 CODE SEGMENT 


OW 
INTfl73 
; noPE 73 INSTRUCTIIJN POINTER 


OW 


.) 
; TWE 73 CODE SI:Gt1£Nl 


OW 
INTR74 
. T\'PE 74 INSTRUCTION POINT~R 


DW 
') 
; T'r'PE ;.'4 (;Ol)E SEGMENT 


l!W 
INTR75 
; T'r'rE 75 INSTRUCTION POINTER 


DW 
.) 
; noPE ;'5 CODE 5!:GMEN'! 


OW 
INHa6 
; TYPE ;'6 
INSTRUCTION POINTER 


DW 
'1 
.•T'r'PE 76 WDE SEGl'lENT 


OW 
INTR77 
; TIIPE 77 INSTRUCTION f'(JINn.R 


OW 
? 
, T't'PE 77 CODE SEGMENl 


10 
11 
12 
13 
14 
15 
16 
1.' 
lS 
19 
28 
21 
22 
23 
EXTRA ENDS 


24 • 
.• 


TP72IP 
TP72CS 
lP73IP 
TP7JC5 
TP?4If' 


Tf'74CS 
lPl5IP 
TP75CS 
TP76IP 
TP76CS 
TP77IP 
TP77CS 


25 
llATA SEGMENTDECLARATIONS 


26 
27 
DATA 
SEGl'lENT 


28 
29 
STACJ<l 
DW 
'1 
.•IIARIABlE TO SAllE CfU 
AODI<tSS 
J8 
AXTEI1P 
OW 
? 
; l/ARIRBLE TO SflIIE AX !\'£GISTEI< 
31 
DIGIT 
DB 
? 
; lIARIABLE 1I) SAllE SELECT~D 0 lfj IT 


32~~ DAm 
ErIDS 
.>.> 


34 
J5 
CODE SEGl'IENTDECLARATION 


36 
37 
CODE 
SEGl'tENT 


38 
39 
ASSIJIlE 
ES :EXTRA,DS:MTA, Cs: CODE 
48 
41 
INITIflUZE 
REGISTERS 
42 
43 
START: 
1'101/ 
Al(,0fI 
; OORA SCGl1ENTAT 0H 
44 
1«)11 
ES.AX 
45 
If.)I1 
Al(,70H 
; DATA SEGIlENT AT 798H 
46 
I'lOl/ 
D!;.AX 
47 
I«lI/ 
AX,i'8H 
; 5TIU 
SEG1'ENl AT 78eH 
48 
I«lI/ 
55. A''( 


49 
I'lOl/ 
SP.89H 
; 5TACI< POINTER AT ll8H (STflCK=8geH) 


A-168 


AP-59 


MCS-B6 ASSEMBLER 
TCI59A 


LOG 
OBJ 
LINE 
SOURCE 


50 


51 
LOAD INTERRUf'T VECTOR malE 


52 


0012 B80481 
~i3 
T'rPE.5. 
I'fIN 
AX, OfFSET 
(INTRm 
. LOA!) T'''Pl 
?2 


0015 
2CA32001 
54 
110V 
IP72IP, 
AX 


0019 
268C0£2201 
55 
I1lN 
IP72CS, CS 


00lE 
881801 
~ 
I"OV 
AX. OFFSET (JNTRm 
.•LOAD TYPE 73 


0021 
26A32401 
57 
HI)',' 
TP73IP, AX 


0025 
26BC0E2601 
58 
11O'J 
TF73CS, (;5 


002A 883001 
59 
MOY 
AX, OfFSET 
WlTR74) 
; LOAD TYPE 74 
0021) 26A3281:l1 
69 
I10V 
TP74IP, AX 


0031 
268C9E2A91 
61 
MOV 
TF'74CS, CS 


0036 
884801 
62 
110'{ 
A)(, OFFSET (INTR75) 
.•LOAO TYPE 75 


0039 
26A32C!l1 
63 
MOil 
TP75IP, ftX 


0030 
268C9E2E01 
6.4 
MOil 
Tf'i'5CS ..CS 


0042 
~.6001 
65 
MOil 
AX,OFFSD 
(INTR76l 
'LOAD T\lPE 76 


8945 
26A33001 
66 
MIJIr' 
TP76IP, AX 


8949 
268C!lE3281 
67 
MIJ\I 
TP76CS, CS 


004E !J87881 
68 
1'1011 
AX, OfFSET 
(JNTR77J 
,LOAD WPE 71 


0051 
261133401 
69 
MO',' 
TP77IP. AX 
0055 
268C0E36l.l1 
79 
HOV 
IP77CS, CS 


71 
.,,, 
8253 
INITIALIZATION 
'< 
73 


005A BAllEFF 
74 
SET531: 
I'IIJ,' 
OX,eFF9EH 
; a-;:>53Ii CONTROL WORI) 


005D 1:1036 
75 
I'IOV 
AL,36H 
, C~INTER 9, 
MODE3, 
BINARY 


005F EE 
76 
OUT 
OX.'AI.. 


0060 
Be71 
77 
/11)\1 
AL 71H 
'COUNTeR 1, 
MODE0, 
BCD 
9062 
EE 
i'B 
OUT 
OX,AL 


0063 
B0B5 
?9 
I10V 
AL, lJB5H 
; COlINTER 2. 
I'lOOE 2, 
BCD 


0065 
EE 
se 
OUT 
OX,AL 


0066 
BAOOFF 
81 
MO\I 
OX,eFF08H 
; LOAD COUNTER0 (1lm$) 


8969 
BOA8 
82 
MIJ\I 
AL0A8H 
,LS8 
ll96B 
E[ 
83 
OUT 
OX,AL 


006C 8961 
84 
I'lO\l 
AL,61H 
.•I'fSB 


006E EE 
85 
OUT 
OX,fl. 


006F 
BAOCFF 
86 
110\1 
DX,0FFOCH 
; L(JlD COllIHER 2 (25£C) 


0072 
B900 
87 
I'IOV 
AL,~ 
,LSfJ 


0074 
EE 
88 
OUT 
OX,AL 


0075 
flOO2 
89 
1'1011 
AL02H 
.o~ 


0077 
EE 
90 
OUT 
OX,AL 


0078 
1lA16FF 
91 
5ET532: 
110\1 
ox. 9FF16H 
; 8253 
12 COOROL lOt) 


0078 
803B 
92 
MO\I 
AL,3BH 
;C~_~TER 
O. ~ 
5,BCD 


8lI7D EE 
93 
OUT 
OX, AL 


OOlE 1:l07fJ 
94 
1'10\1 
AL 7BH 
; COUNTER1, 
I'IOOE 5, 
BCD 


eese 
EE 
9S 
~JT 
OX,AL 


0081 
Il0BB 
96 
MOV 
AL9BBH 
; COUNTER2, 
I100E 5. 
BCD 
0083 EE 
97 
OUT 
OX,AI.. 


0084 
BA10FF 
98 
MlN 
OX,eFFiBH 
,LOAD COlfHER 
8 (5st.C) 
088r 
Bll5e 
99 
/kJII 
AL,59H 
.oL~ 


8089 
EE 
100 
OUT 
OX,ft. 


908A El900 
101 
1'10\1 
ALOOH 
; I'fSB 


0ll8C EE 
102 
OUT 
Ox.AL 
8lI8D BA12FF 
193 
~\1 
ox. 0FF12H 
; LlR> 
COONTI:R 1 (1SE(;) 


9lI90 
BOO9 
184 
I'IOV 
AL0BH 
;L58 


A-169 


AP-59 


MC5-86 
ASSEt1BLEli 
lC!'j9R 


LOC 
OB..r 
LINE 
50U~:cE 


131392EE 
105 
OUT 
DX.•AL 


!*193 
BltOl 
Hl6 
Ml)v 
RL ij1H 
;W~: 


0095 
EE 
1137 
OUT 
Di;. At. 


0096 
BA14FF 
1138 
1'101/ 
DK. BFF14H 
.;LOlli 
COUNTE::R2 (1. 5SECl 


0099 
B05e 
109 
MOil 
Hl ..5!lH 
· L~B 


009B 
1:1 
1Hl 
OUT 
OX,Rl 


oo9C &101 
111 
1'101/ 
RL 81H 
; MSB 


e09E EE 
112 
OUl 
DX, AL 


113 


114 
8259fl 
HHT I f:LI ZRTI ON 


115 


oo9F 
BAOOFF 
116 
SET59A: 
110\1 
OX. eFFe0H 
; gz'j9A 
00=0 


00A2 
Be13 
117 
MO••• 
AL.13H 
· ICIoI1-L T11'l=0..S=1. IC4=1 


00A4 EE 
118 
em 
OX,RL 


OOR.5 BRe:iFF 
119 
1'101/ 
ox, 0FF0~'H 
; 8259R 
ro0=1 


ooRB 8048 
12ij 
1'10'./ 
AL 
4BH 
; ICIoI2-INTERRUF'T 
TYPE::r2 (12llfO 
Bl,1Afl EE 
121 
OUT 
OX, AL 


ooAB B003 
122 
110••• 
AL 
BiH 
; IC~H-SFNM=0,Bur=0,R(OI=1,MPK=1 


OOAD EE 
123 
OUT 
OX, At. 


00fIE B0EO 
124 
110l' 
At.. eE0H 
; OCWHIASK 
IRS, 6, 7 (NUT USED) 


00Be EE 
125 
OI1T 
DX,AL 


126 
p7 
3279 
INITIALIZflTION 
.:.., 


128 


eOO1 BAEAFF 
129 
S~T79 
MOV 
Oi;,0FFEAH 
;8279 
COMMANDWOROSHNO STATUS 
OOB4 BOOEl 
130 
110••• 
AL, ~300H 
; CLEAR DISPLAY 


ElllB6 EE 
131 
IJJT 
OK. At. 


0087 
EC 
132 
IflIT79: 
IN 
At..DX 
; RERO S1ATlJS 


ll0B8 
DOCB 
1~3 
~ 
AL..1 
· "Oll" 
BIT 
10 
CARR'I 


OOBA 72FB 
134 
IB 
WAIT79 
; JUMP IF 
DISf'LA'I 
IS UNAVAILABLE 


OOBC B007 
135 
t10Y 
AL, 
87H 
; DIGlT 
8 


OOBE EE 
136 
OUT 
DX, AL 


OOBF BAE8FF 
137 
MOl' 
OX. aFFE8H 
; 8279 
DATA WORD 


!JOC2 BOO6 
138 
1101/ 
RL B6H 
; CHARACTER "I" 


llt1C4 EE 
139 
OUT 
OX,RL 


OOC5 BAEAFF 
14B 
1101/ 
DX, llFFEAH 
; 8279 
CO/'IHAND WORD 


ll8CS 8086 
141 
~10\l 
AL.86H 
; DIGIT 
7 


9l!CA EE 
142 
QlT 
OX, AI. 


OOC8 BAESFF 
143 
1'10'" 
[lX, eFFEBH 
; 82i'9 
DATA IoIORfi 


OOCE E:!JS0 
144 
I1Ol' 
AL 5!lH 
; C'HARACTER "R" 


00De EE 
145 
OUT 
OX, A:.. 


00D1 FE: 
146 
STI 
; ENABLE ItlTERRlIPTS 


147 


143 


149 
DUI'lI'lY PROGRAM 


150 


BllD2 EBFE 
151 
DIm'· 
J/'lP 
OUHMY 
; WAIT FOR INTERRUPT 


152 


153 


00D4 R3B200 
154 
SRVE: 
I'IOY 
AXTEl'P, AX 
; SAYE AX 
8007 
58 
155 
POP 
AX 
; POP CAlL 
RETURN ADDRESS 


BOO8 A3!J000 
116 
MOI/ 
STACKL AX 
; SAVE CAlL 
RETURN ADDRESS 


BOOB AIB2BB 
157 
I'IOY 
AX, AXT[I'IP 
; RESTORE AX 
llOOE 50 
1~ 
PlISH 
AX 
; SAllE PROCESSOR STAWS 


BOOF 53 
159 
PUSH 
ax 


A-l70 


AP-59 


HlS-136 
ASSEI1BLER 
TCI'59A 


LOC 
OB; 
LINE 
SOURCE 


OOEB '51 
168 
PUSH 
C:~ 


OOEl 52 
161 
PUSH 
DX 


00E2 
55 
162 
F1JSH 
Ill' 


B8n 
56 
163 
PUSH 
SI 


00E4 
57 
164 
PUSH 
DI 


9llE5 
lE 
165 
PUSH 
DS 


00E6 
B6 
166 
PUSH 
ES 


00E7 
Al000e 
167 
MOil 
AX, STACKl 
.'RESTORE CALL RETURN AOORESS 


OOEA 50 
16~ 
PUSH 
AX 
; PUSH CALL RElIJRN AOORESS 


OOES n 
169 
RET 


170 


OOEC 53 
171 
~1STOR 
POP 
fiX 
; POP CALL RETURN AOOf([5S 


OOED A3eeB8 
172 
MOil 
STACK!' A~'i 
; SAllE CALL RETURN ADDRESS 


OOF8 07 
173 
POP 
ES 
, RESTORE P~OCES5OR STATUS 


99Fl 
IF 
174 
POP 
DS 


0l!lF2 SF 
175 
POP 
DI 


OOF3 5E 
176 
POP 
51 


OOF4 5D 
If? 
POP 
BP 


89F5 
SA 
178 
POP 
DX 


!!llF6 
59 
179 
POP 
C:~ 


00f7 
58 
180 
POP 
8X 


eeF8 
58 
181 
POP 
AX 


il9F9 
A39200 
182 
I'()II 
AXTEI1P, AX 
.•SAVE AX 


OOFC A1B800 
183 
I'lO\l 
AX, STACKl 
.•RESTORE: CfU 
RETlIlN 
ADDRESS 


OOFF 59 
184 
PUSH 
AX 
.•PUSH Cl-";LL RETURN AOORESS 


81ee 
AHl200 
185 
MO',' 
AX, AXTEMP 
.•RESTORE AX 


8U:!3 t3 
186 
RET 


187 


188 


189 
INTERRUPT 72, 
CLEAR DISPLfI'r', 
IRO 3259A 


198 


8184 
E8CDFF 
191 
INTR72 
CfU 
SA~ 
,Rf.NTINE 
TO SfM. 
PROCESSOR Sl ATUS 


B187 
BAEAFF 
192 
HOII 
D~, OFFEAH 
; 827S 
COl'lI1flI{) WORt> 
81BA A9I.l4OO 
193 
MOil 
AL.•DIGIT 
; SELECTED LED DIG Il 


8100 
EE . 
194 
OUT 
DX, AL 


81eE 
BAESFF 
195 
1'1011 
DX,0FFE8H 
.•8279 
DATA 


f:ll11 
8000 
196 
!'KJV 
AI..,OOfI 
.•BLANK OUT DIGIT 


81B 
I,E 
197 
OUT 
DX,AL 


8114 
EBDSFF 
15:8 
CALL 
RESTOR 
.;ROUTINE TO RES10RE PROCESSOR STfi1US 


8117 
CF 
199 
IRET 
; RETlJRN FI<OM I NTERRUI'l 


2ee 


201 


292 
I NTEI<RUPTn 
IRl 
8259A 


293 
8118 
EBB9FF 
204 
INTR73: 
CALL 
SAVE 
; ROUlIt£ 
TO SRl/E PROCESSOR STA1US 


8118 
B1lEAFF 
205 
IlOV 
DX,0FFEAH 
.'8279 
ClJI1I1AN) IO/D 
811E 
11088 
296 
1'10'" 
AL 8lJH 
; LED lIISPLAII 
DIGIT 
1 


8128 
A20400 
207 
1'1011 
DIGIT, 
Al 


8123 
EE 
298 
OUT 
OX, AI.. 


8124 
BAEllFF 
289 
1'1011 
OX, IlrFESH 
; 82i'9 
DATA 
8127 
B896 
210 
I'lO\l 
AL06H 
.•CHflRR(;'IER "1" 


81~ 
EE 
211 
OUT 
DX,AI.. 


812A 
Cl)4I) 
212 
INT 
77 
-'1 I1'lER DELAII FOR LED ON mE 
812C 
E8BDFF 
213 
CAlL 
RESTOR 
; ROOTINE 10 RESTORE PROC.lOSSORSTATUS 
8121- CF 
214 
IRET 
; RETlJlN FROO INTERRUPT 


A-I?I 


AP-59 


MC5-86 AS5EI'1BLER 
TCI59A 


LOC 
OBJ 
LINE 
SOlJRCE 


215 


216 
i 


217 
i 
INTERRUPT 74, 
11<28259f: 


218 


8139 
E8A1FF 
219 
INTR74. 
CALL 
SAVE 
i ROUTINE TO SA\lE f'ROI'...ESSORSl ATUS 


8rn 
BAEAFF 
220 
110',' 
DX,0FFEAH 
i 8279 
[;O/'lI'IAND WIJ1D 


8136 
8981 
221 
I'(l\I 
AL.81H 
; LED DISPLA'~ DIGII 
2 


8138 
11"20480 
222 
!'lO\I 
DIGIT, AI. 


8138 
EE 
223 
OUT 
DX,AI. 


8EC 
BliE8FF 
224 
MOil 
OX,9FFESH 
i 8279 DA1A 


013F B95B 
225 
11011 
AL 581: 
.;CHARACTER "2" 


8141 
EE 
226 
OUT 
DX, AI. 


8142 
l'D4D 
227 
INT 
(7 
; Tl~R 
DELfiV FOR LED ON TII'IE 


8144 
E8ASFF 
228 
CAlL 
RESTOR 
,ROUTINE 
TO RESTORE ~OCESSOR SiATUS 


8147 cr 
229 
IR!:l 
i RETURN FROM INTERRUPT 


238 
i 


231 
232 
INTERRUPT 75.. IR3 8259A 


233 
; 


8148 
E889FF 
234 
INTR75 
CAlL 
SA'IE 
i ROUTINE 10 SAllE. PROCESSORSTATUS 


8148 
BAHH 
235 
/'to\I 
OX,8FFEAH 
; 8279 
COMMffID WORD 


814E Be82 
236 
1'1(1\1 
AI.,82H 
; LEO DISPlA'!' 
DWI r 3 


8150 
A20489 
237 
MOl,' 
DIGIT. AI. 


il153 
EE 
238 
OUT 
OX,AI. 


8154 BAE8FF 
239 
Il(J\I 
DX,aFFESH 
; 8279 
DATA 
8157 
8841- 
249 
I'IOV 
AL4FH 
i CHARACTER "3" 


8159 
EE 
241 
OUT 
DX,AI. 


81~ 
CD4I) 
242 
INT 
77 
i mER 
DELAV FOR LED ON TIl'E 
815\; 
E8SDFF 
24~ 
CALL 
RESTOR 
; ROUTINE TO RESTORE PROCESSORSTffTUS 


815F CF 
244 
IRET 
i REWRN FROM INl EI<RUPT 


245 
i 


246 
; 


247 
i 
INTERRI~T 
76, 
IR48259R 


243 
; 


8168 
E871FF 
249 
INTR76: 
CALL 
SAI,'E 
i ROUTINE TO SAVE PROCESSORSlATUS 


8163 
BAEAFF 
259 
/'to\I 
DX,9FFEAH 
i 8279 
COMMANDWORD 


9166 
8883 
251 
i'IOV 
AI.,83H 
; LED DISPLAY IHGlT 
4 


8168 A28499 
2S2 
i'IOV 
DIGIT.. AI. 


9168 
EE 
253 
OUT 
DX,AI. 


816C BAEBFF 
254 
I'KJV 
DX, aFFESH 
; 82(9 
DATA 


916F B866 
255 
PlO\I 
AL66H 
.;CHARfll;TE.R "4" 


8171 
EE 
2S6 
OUT 
DX,AI. 


9172 
CD4D 
257 
INT 
77 
.;TlMER DELAV FOR LE.D ON nME 


9174 
E875FF 
2S8 
CAlL 
RESTOR 
iROUTlNE 
TO RESTORE PROCESSORSlATUS 


9177 
CF 
259 
IRET 
; RETURN FROM INlERRlfT 


269 
; 


261 
; 


262 
I NTERRlJPTn. 
TI~R 
OELA\', 
SOFTWAREC~TROLLED 


263 
91('8 
BAllAfF 
264 
HlTR77. 
MOl,' 
DX, llFF0AH 
; LOf1I) CWNTER 1 8253 
11 (258 
I'ISEC) 


8178 
8825 
265 
1'1011 
AL2SH 
iLSB 
8171> EE 
266 
WT 
Di<.AI. 


817E B899 
267 
I«lV 
AL llIJi 
itlSB 
9188 
EE 
268 
OUT 
DX,AI. 


8181 
CF 
269 
lRET 
i RUURN FROI'I INTERRUPT 


A-I72 


LOC 
OB.J 
LINE 
SOURCE 


270 


271 


272 
CODE 
ENDS; 


').,.,. 
C.I _' 
274 


0000 
275 
END 
START 


S'mBOL 
TABLE 
LI~,T1N(j 
------ ----- ------- 


NfiME 
T"'PE 
\!ALlIE 
ATTRIBUTES 


??SEG 
Sl:.GMENT 
SIZE=0OOlJH 
PARA PUBLIC 


AXTEMP 
\I WOR(l 
0£102H 
(lATA 


CODE. 
SEGMENT 
SIZE =0182H 
PARA 
[iflTA 
SEGMENT 
SIZE=OOe5H 
PAr/A 


DIGIT 
\I B'r'TE 
0004H 
DATA 


DliMMY 
L NEAl' 
01<lD2H 
CODE 


EXTRA 
SEGMENT 
SIZE=I<lBSH 
PARA 


INTR72 
L NEAl" 
91il4H 
COOE 


!NTP?] 
L NEAR 
91181·: 
CODE 


INTR74 
L NEHR 
01313H 
CODE 


INTR75 
L NEAR 
i.l148H 
CODE 


INTR76 
L NEAR 
9160H 
CODE 


INTR77 
L NEAR 
0178H 
CODE 


RESTOR 
L NEAR 
OOECH 
COOE 


SAY£. 
L NEqR 
OO(l4H 
CODE 


SET531 
L NEAR 
OOSAH 
CODE 


:.£TS32 
L NEAR 
0073>1 
CODE 


SETS9A 
L NEAR 
009fH 
COOE 


5£T79 
L NEAR 
0i.lB1H 
CODE 


STACl<l 
II WORD 
i.lOOBH 
DATA 


STRRT 
L NEAR 
OOOOH 
CODE 


TP72CS 
II WORD 
9122H 
EXTRA 


TP?2IP 
y WOf?[) 
912!JH 
EXTRA 


TP73CS 
II WORD 
0126H 
EXTRA 


TP73IP 
V WfR) 
0124H 
EXTRA 


TP?4CS 
Y WORD 
812AH 
EXTRA 


TP74IP 
'y' WOR(I 
9128B 
EXTRA 


TP75CS 
Y WOR(, 
i.l12EH 
EXTRA 


TP7SIP 
V WORl) 
012CH 
EXTRA 


TP76CS 
'y' WORD 
0B2H 
EXTRA 


TP76IP 
Y WORD 
0BttH 
EXTRA 


TP77CS 
II IoIORD 
9136H 
EXTRA 


TP77IP 
II WORD 
0134H 
EXTRA 


TYPES 
L NEAR 
OO12H 
CODE 


WAIT79 
L NEAR 
eeB7H 
COOl: 


F:SSEI'lBLY COMPLETE, 
I«l 
ERR~S 
FOOl{) 


iSBC 80/20 and iSBC 80/20-4 Single Board Computer 
Hardware 
Reference 


Manual. 
98-317C. 


iSBcrM 
86/12 Single 
Board 
Computer 
Hardware 
Reference 
Manual, 


9800645A. 
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A significant 
measure of the power and flexibility 


of the Intel OEM Computer Product Line can be 
attributed 
to the design of the Intel MULTIBUS 


system bus. The bus structure provides a common 
element 
for communication 
between 
a wide 
variety of system modules which include: Single 
Board Computers, 
memory, digital, 
and analog 
I/O expansion boards, and peripheral controllers. 


The purpose of this application 
note is to help you 
develop a working knowledge ofthe Intel MULTI- 
BUS specification. 
This knowledge is essential for 


configuring 
a system 
containing 
multiple 
mod- 
ules. 
Another purpose is to provide you with the 
information necessary to design a bus interface for 
a slave module. One ofthe tools that will be used to 
achieve this goal is the complete description of a 
MULTIBUS slave design example. Other portions 
of this 
application 
note 
provide 
an in depth 


examination 
of the bus signals, operating charac- 
teristics, and bus interface circuits. 


This application 
note was originally 
written in 
1977. Since 1977, the MULTIBUS specification 
has been significantly 
expanded to cover opera- 
tion with both 8 and 16-bit system modules and 
with an auxiliary 
power bus. 
This application 
note now contains 
information 
on these 
new 
MULTIBUS specification features. 


In addition, 
a detailed MULTIBUS specification 
has also been published which provides the user 
with further information 
concerning MULTIBUS 
interfacing. 
The MULTIBUS specification 
and 


other useful documents are listed in the overleaf of 
this note under Related Intel Publications. 


II. MULTIBUSTM 
SYSTEM 
BUS 
DESCRIPTION 


The Intel MULTIBUS signal lines can be grouped 
in the following categories: 
20 address lines, 16 
bidirectional 
data 
lines, 
8 multilevel 
interrupt 
lines, and several bus control, timing and power 
supply 
lines. 
The address 
and data 
lines are 
driven by three-state 
devices, while the interrupt 


and some other control lines are open-collector 
driven. 


Modules that use the MULTIBUS system bus have 
a master-slave relationship. 
A bus master module 
can drive the command and address lines: it can 
control the bus. 
A Single Board Computer is an 


example of a bus master. 
A bus slave cannot 


control 
the bus. 
Memory and 
I/O 
expansion 
boards are examples of bus slaves. 
The MULTI- 
BUS architecture 
provides for both 8 and 16-bit 


bus masters and slaves. 


Notice that a system may have a number of bus 
masters. 
Bus arbitration 
results when more than 
one master requests control ofthe bus at the same 
time. A bus clock is usually provided by one of the 
bus masters 
and may be derived independently 


from the processor clock. The bus clock provides a 
timing 
reference 
for resolving 
bus contention 
among multiple requests from bus masters. 
For 
example, a processor and a DMA (direct memory 
access) module may both request control of the 
bus. This feature allows different speed masters to 
share resources on the same bus. Actual transfers 
via 
the bus, however, proceed asynchronously 


with respect to the bus clock. Thus, the transfer 
speed 
is dependent 
on the 
transmitting 
and 
receiving devices only. 
The bus design prevents 


slow master modules from being handicapped 
in 
their attempts 
to gain control of the bus, but does 


not restrict the speed at which faster modules can 
transfer data via the same bus. Once a bus request 
is granted, single or mUltiple read/write 
transfers 


can proceed. The most obvious applications for the 
master-slave 
capabilities 
of the bus are multi- 
processor configurations 
and high-speed 
direct- 


memory-access (DMA) operations. 
However, the 


master-slave 
capabilities 
of the bus are by no 


means limited to these two applications. 


This section defines the signal lines that comprise 
the Intel MULTIBUS system bus. These signals 
are contained on either the PI or P2 connector of 
boards compatible 
with the MULTIBUS specifi- 


cation. 
The PI signal lines contain the address 


data, 
bus control, bus exchange, 
interrupt 
and 


power supply lines. The P2 signal lines contain the 
optional auxiliary 
signal lines. 
Most signals on 


the bus are active-low. For example, a low level on 
a control signal on the bus indicates active, while a 
low level on an address or data signal on the bus 
represents logic "1" value. 


NOTE 


In this application 
note, a signal will be 


designated active-low by placing a slash (I) 
after the mnemonic for the signal. 


Appendix A contains a pin assignment 
list of the 


following signals: 


· Initialization 
signal; resets the entire system to 


a known inter~al state. INIT I may be driven by 
one of the bus masters or by an external source 
such as a front panel reset switch. 


20 address lines; used to transmit the address of 
the memory location or I/O port to be accessed. 
The lines are labeled ADROI 
through 
ADR9/, 


ADRAI 
through ADRFI 
and ADRlOl 
through 
ADR13/. 
ADR131 
is the most significant 
bit. 
8-bit masters 
use 16 address 
lines (ADROI 
- 


ADRF/) 
for memory addressing 
and 8 address 


lines (ADROI 
- ADR7/) 
for I/O port selection. 
16-bit masters 
use all twenty a-ddress lines for 
memory 
addressing 
and 
12 address 
lines 


(ADROI - ADRB/) 
for I/O port selection. Thus, 


8-bit masters may address 64K bytes of memory 
and 256 I/O devices while 16-bit masters 
may 
address 
1 megabyte of memory and 4096 I/O 
devices. 
(The 8086 CPU actually 
permits 
16 


address bits to be used to specify I/O devices, 
the MULTIBUS specification, 
however, states 
that only the low order 12 address bits can be 
used to specify I/O ports.) 
In a 16-bit system, 
the ADROI line is used to indicate whether a low 
(even) byte or a high (odd) byte of memory or 
I/O space is being accessed in a word oriented 
memory or I/O device. 


BHENI 


Byte 
High 
Enable; 
the 
address 
control 
line 


which is used to specify that data will be trans- 
ferred on the high byte (DAT81 - DATF I) of the 
MULTIBUS 
data 
lines. 
With current 
iSBC 
boards, 
this signal effectively specifies that a 
word (two byte) transfer is to be performed. This 
signal is used only in systems which incorporate 
sixteen bit memory or I/O modules. 


Inhibit 
RAM 
signal; 
prevents 
RAM memory 
devices from responding to the memory address 
on the system address bus. 
INHlI 
effectively 


allows ROM memory devices to override RAM 
devices 
when 
ROM and 
RAM memory 
are 


assigned the same memory addresses. 
INHlI 


may also be used to allow memory mapped I/O 
devices to override RAM memory. 


INH21 


Inhibit 
ROM 
signal; 
prevents 
ROM memory 


devices from responding to the memory address 
on the system address bus. 
INH21 
effectively 


allows auxiliary 
ROM (e.g., a bootstrap 
pro- 
gram) to override ROM devices when ROM and 
auxiliary 
ROM memory are assigned the same 


memory addresses. 
INH21 may also be used to 


allow memory mapped I/O devices to override 
ROM memory. 


16 bidirectional 
data lines; used to transmit 
or 


receIve information 
to or from a memory loca- 


tion or I/O port. OATF I being the most signifi- 
cant bit. 
In 8-bit systems, only lines DATOI 
- 


DAT7I are used (DAT7I being the most signi- 
ficant bit). In 16-bit systems, either 8 or 16lines 
may be used for data transmission. 


BCLKI 


Bus clock; the negative 
edge (high to low) of 


BCLKI 
is used to synchronize 
bus priority re- 


solution circuits. BCLKI 
is asynchronous 
to the 


CPU clock. It has a 100ns minimum period and 
a 35'7<,to 65%duty cycle. BCLKI 
may be slowed, 


stopped, or single stepped for debugging. 


Constant 
clock; a bus signal which provides a 


clock signal of constant frequency for unspeci· 
fied general use by modules on the system bus. 
CCLKI 
has a minimum period of 100 ns and a 


35% to 65%duty cycle. 


Bus priority in signal; indicates to a particular 
master module that no higher priority module 
is requesting use of the system bus. 
BPRNI 
is 


synchronized 
with BCLK/. 
This signal is not 


bused on the backplane. 


Bus priority 
out signal; used with serial (daisy 


chain) bus priority resolution schemes. 
BPRO/ 


is passed 
to the BPRN/ 
input of the master 
module with the next lower bus priority. BPRO/ 
is synchronized with BCLK/. This signal is not 
bused on the backplane. 


Bus 
busy 
signal; 
an open collector line driven 
by the bus master currently in control to indicate 
that the bus is currently in use. BUSY/prevents 
all other master modules from gaining control 
ofthe bus. BUSY/is synchronized with BCLK/. 


Bus 
request 
signal; 
used with a parallel 
bus 
priority 
network to indicate 
that 
a particular 
master module requires use of the bU:sfor one 
pr more data tran!ifers. BREQ/ is synchronized 
with BCLK/. 
This signal is not bused on the 


backplane. 


Common 
bus 
request; 
an 
open-collector 
line 


which is driven by all potential 
bus masters 


and is used to inform the current bus master 
that another 
master wishes to use the bus. 
If 


CBRQi is high, it indicates to the bus master 
that no other master is requesting the bus, and 
therefore, the present bus master can retain the 
bus. This saves the bus exchange overhead for 
the current master. 


A bus 
master 
provides 
separate 
read/write 


command 
signals 
for memory and 110 devices: 
MRDC/, 
MWTC/, 
rORC/ 
and 
roWC/, 
as ex- 


plained below. 
When a read/write 
command is 


active, the address signals must be stabilized at all 
slaves on the bus. 
For this reason, the protocol 


requires 
that 
a bus master 
must issue address 


signals (and data signals for a write operation) at 
least 50 ns ahead of issuing a read/write command 
to the bus, initiating 
the data transfer. 
The bus 
master must keep address signals unchanged until 
at least 
50 ns after the read/write 
command is 
turned off, terminating 
the data transfer. 


A bus slave must provide an acknowledge signal to 


the bus master 
in response 
to a read or write 


command signal. 


MRDC/ 


Memory 
read 
command; 
indicates 
that 
the 


address of a memory location has been placed 
on the system address lines and specifies that 
the contents 
(8 or 16 bits) of the addressed 


location are to be read and placed on the system 
data bus. MRDC/ is asynchronous 
with respect 


to BCLK/. 
. 


Memory 
write 
command; 
indicates 
that 
the 


address of a memory location has been placed 
on the system address lines and that data (8 or 
16 bits) has been placed on the system data bus. 
MWTC/ specifies that the data is to be written 
into the addressed memory location. MWTC/ is 
asynchronous 
with respect to BCLK/. 


I/O 
read command; 
indicates that the address 


of an input port has been placed on the system 
address bus and that the data (8 or 16 bits) at 
that input port is to be read and placed on the 
system data bus. rORC/ is asynchronous 
with 


respect to BCLK/. 


I/O 
write command; 
indicates that the address 


of an output port has been placed on the system 
address bus and that the contents ofthe system 
data bus (8 or 16 bits) are to be output to the 
address 
port. 
rowc/ 
is asynchronous 
with 


respect to BCLK/. 


Transfer 
acknowledge 
signal; 
the required 


response of a slave board which indicates that 
the 
specified 
read/write 
operation 
has 
been 


completed. That is, data has been placed on, or 
accepted 
from, 
the system 
data 
bus lfnes. 


XACK/ is asynchronous with respect to BCLK/. 


used with a parallel 
interrupt 
resulutiun 
net- 


wurk. 
INTO. has the highest 
priority, 
while 
INT7/ 
has 
luwest 
priority. 
Interrupt 
lines 
shuuld be driven with open cullectur drivers. 


Interrupt 
acknuwledge; 
an interrupt 
acknuwl- 


edge line (lNTA/), 
driven by the bus master, 
requests 
the transfer 
of interrupt 
information 


onto the bus from slave priority interrupt 
con- 
trollers (8259s or 8259As). The specific informa- 
tion 
timed 
onto 
the 
bus 
depends 
upon 
the 
implementation 
of the interrupt 
scheme. 
In 


general, 
the leading 
edge of INTAI 
indicates 
that the address bus is active while the trailing 
edge indicates 
that data \s present on the data 
lines. 


MULTI BUS 
P2 Signal 
Lines 
- 
The signals 
cuntained 
un the MULTIBUS 
P2 auxiliary 
con- 
nectur 
are 
used 
primarily 
by uptiunal 
puwer 
back-up 
circuitry 
fur memory 
prutection. 
P2 
signals 
are 
nut 
bused 
on the 
backplane, 
and 
therefure, 
require 
a separate 
connectur 
fur each 
board using the P2 signals. 
Present iSBC buards 
have a slut in the card edge and should be used 
with a keyed P2 edge connector. 
Use of the P2 
signal lines is uptional. 


AC 
Low; this signal 
generated 
by the power 
supply goes high when the AC line voltage 
drops below a certain voltage (e.g., 103v AC in 
1l5v AC line voltage systems) indicating 
D.C. 


power will fail in 3 msec. ACLO goes low when 
all D.C. voltages return to approximately 
95'PlI 
of the regulated value. This line must be pulled 
up by the optional standby 
power source, if one 
is used. 


Power fail interrupt; 
this signal interrupts 
the 
processor 
when 
a power failure 
occurs, it is 
driven by external 
power fail circuitry. 


Power 
fail 
sense; 
this line is the output 
of a 
latch which indicates 
that a power failure has 
occurred. 
It is reset by PFSR/. 
The power fail 


sense latch i::;part 
uf external 
power fail cir- 
cuitry 
and 
must 
be puwered by the standby 
power ::;ource. 


Puu'er fail sel/se reset; this line is used to reset 
the power fail sense latch (PFSN/). 


Memury 
protect; 
prevents 
memory 
op~ration 


during 
periud uf uncertain 
DC power, by in- 


hibiting 
memory requests. 
MPROi 
is driven 
by external puwer fail circuitry. 


Address 
latch 
enahle; 
generated 
by the CPU 
(8085 or 8086) to provide an auxiliary 
address 


latch. 


Auxiliary 
Reset; 
this externally 
generated 
sig- 
nal initiate::; a power-up sequence. 


Bus 
master 
wait 
state; 
this 
signal 
indicates 
that the processor is in a wait state. 


Reserved 
- 
Several PI and P2 connector 
bu::; 
pins are unused. 
However, they should be regard- 
ed as reserved 
for dedicated 
use in future Intel 
products. 


Power 
Supplies 
- 
The power supply bus pin::; 
are detailed 
in Appendix 
A which contains 
the 
pin assignment 
of signals 
on the MULTIBUS 
backplane. 


It is the 
designer's 
responsibility 
to provide 
adequate 
bulk decoupling on the board to avoid 
current surges on the power supply lines. It is also 
recommended 
that 
you provide high frequency 


decoupling for the logic on your board. Values of 
22uF for +5v and +12v pins and 10uF for -5v and 
-12v pins are typical on iSBC boards. 


Beyond the definition of the MULTIBUS Bignals 
themselves, 
it is important 
to examine 
the 
operating 
characteristics 
of the bUBo The AC 
requirements 
outline the timing of the bus signals 


and in particular, define the relationBhips between 
the various bus signals. On the other hand, the DC 
requirements 
specify 
the bus driver 
character- 


istics, maximum 
bus loading per board, and the 


pull-up/down 
resistors. 


The AC requirements 
are best presented 
by a 


discussion 
of the relevant 
timing 
diagrams. 
Appendix 
B contains 
a list of the MULTIBUS 


timing specifications. 
The following sectionB will 
discuss 
data transfers, 
inhibit 
operations, 
inter- 
rupt operations, 
MULTIBUS multi-master 
opera- 
tion and power fail considerations. 


Data Transfers 
- Data transfers on the MULTI- 


BUS system 
bus occur with a maximum 
band- 
width of 5 MHz for single or multiple read/write 
transfers. 
Due to bus arbitration 
and memory 


access time, a typical maximum transfer 
rate is 


often on the order of 2 MHz. 


Read Data 


Figure 
1 shows 
the read operation 
AC timing 
diagram. 
The address must be stable (tAS) for a 


minimum 
of 50 ns before command 
(IaRC/ 
or 
MRDC/). 
This time is typically used by the bus 


interface 
to decode the address and thus provide 
the required 
device selects. 
The device selects 


establish 
the data 
paths 
on the user system in 
anticipation 
of the 
strobe 
signal 
(command) 


which will follow. The minimum command pulse 
width is 100 ns. The address must remain stable 
for at least 50 ns following the command (tAH)' 
Valid data should not be driven onto the bus prior 
to command, 
and must not be removed until the 


command is cleared. The XACK/ signal, which is 
a response 
indicating 
the specified 
read/write 
operation 
has been completed, must coincide or 
follow both the read access and valid data (tDXL)' 
XACK/ must be held until the command is cleared 
(tXAH)· 


Write Data 


The write operation AC timing diagram 
is shown 


in Figure 2. During a write data transfer, 
valid 


data 
must be presented 
simultaneously 
with 
a 


stable address. 
Thus, the write data setup time 
(tDSJ has the same requirement 
as the address 


setup time (tASJ. The requirement for stable data 
both 
before 
and 
after 
command 
(lOWC/ 
or 


MWTC/J enables 
the bus interface 
circuitry 
to 


latch data on either the leading or trailing edge of 
command. 
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A 16-bit master may transfer data on the MULTI- 
BUS 
data 
lines 
using 
8-bit 
or 16-bit 
paths 


depending 
on whether 
a byte or word (2 byte) 


operation 
has been specified. 
(A word transfer 


specified with an odd I/O or memory address will 
actually be executed as two single byte transfers.) 
An 8-bit master may only perform byte transfers 
on the MULTIBUS data lines DATO/ - DAT7I. 


In 
order 
to maintain 
compatibility 
with 
older 
8-bit masters 
and slaves, a byte swapping 
buffer 


is included 
in all new 16-bit masters 
and 16-bit 


slaves. In the iSBC product line, all byte transfers 
will take place on the low 8 data lines DATO/ - 
DAT7I. 
Figure 3 contains 
a example of 8/16-bit 


rhla 
driver 
logic for ]f,-hit master 
and 
slave 


systems. 
In the 8/16-bit 
system, there are three 


sets 
of buffers; 
the 
lower byte buffer 
which 


accesses DATOI - DAT7I, the upper byte buffer 
which accesses 
DAT81 - DATF/, 
and the swap 
byte buffer which accesses the MULTIBUS data 
lines 
DATOI 
- DAT7I 
and 
transfers 
the data 


to/from 
the on-board data bus lines D8 - DF. 


Figure 4 summarizes the 8 and 16-bit data paths 
used for three types of MULTI BUS transfers. 
Two 


signals control the data transfers. 


Byte High Enable (BHEN/) 
active indicates that 


the bus is operating 
in sixteen 
bit mode, and 


Address Bit 0 (ADRO/) defines an even or odd byte 
transfer 
address. 


On the first type of transfer, 
BHENI 
is inactive, 


and ADROI is inactive indicating 
the transfer of 


an even eight bit byte. The transfer takes place 
across data lines DATOI - DAT7I. 


On the second type oftransfer, 
BHEN I is inactive, 


and ADROI is active indicating 
the transfer of a 
high (odd) byte. On this type of transfer, the odd 
(high) byte is transferred 
through the Swap Byte 


Buffer to DATOI - DAT7I. This makes eight bit 
and sixteen bit systems compatible. 
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The 
third 
type 
of transfer 
is a 16 bit (word) 
transfer. 
This 
is indicated 
by BHENI 
being 


active, and ADRO/ being inactive. On this type of 
transfer, 
the 
low (even) 
byte 
is transferred 
on 


DATal 
- DAT7I 
and 
the 
high 
(odd) byte 
is 


transferred 
on DAT81 - DATF/. 


Note that 
the condition 
when both BHEN/ 
and 


ADROI are active is not used with present iSBC 
boards. 
This condition could be used to transfer a 
high odd byte of data on DATH/ - DATF/, 
thus 


eliminating 
the need for the swap byte buffer. 


However, this is not a recommended transfer type, 
because it eliminates 
the capability 
of communi- 
cating with 8-bit modules. 


Inhibit Operations 
- 
Bus inhibit operations are 
required by certain bootstrap and memory mapped 
I/O 
configurations. 
The purpose of the inhibit 


operation is to allow a combination ofRAM, ROM, 
or memory 
mapped 
I/O 
to occupy 
the 
same 


memory address space. In the case of a bootstrap, 
it may be desirable to have both ROM and RAM 
memory occupy the same address space, selecting 
ROM instead 
of RAM for low order memory only 


when the system is reset. A system designed to use 


memory mapped I/O, which has actual memory 
occupying 
the 
memory 
mapped 
I/O 
address 


space, may need to inhibit RAM or ROM memory 
to perform its functions. 
There are two essential requirements 
for a success- 


ful inhibit operation. 
The first is that the inhibit 


signal must be asserted as soon as possible, within 
a maximum 
of 100 ns (tCI), after stable address. 


The second requirement 
for a successful 
inhibit 


operation is that the acknowledge must be delayed 
(tXACKB) 
to allow 
the inhibited 
slave 
to ter- 


minate 
any 
irreversible 
timing 
operations 
in- 
itiated by detection of a valid command prior to its 
inhibit. 


This situation 
may arise because a command can 
be asserted within 50 ns after stable address (tAS) 
and yet inhibit 
is not required until 100 ns (tID) 


after stable address. 
The acknowledge delay time 


(tXACKB) is a function 
of the cycle time of the 


inhibited 
slave memory. 
Inhibiting 
the iSBC 016 


RAM 
board, for example, requires a minimum 
of 


1.5 usec. 
Less time is typically 
needed to inhibit 
other memory modules. For example, the iSBC 104 
board requires 475 ns. 
Figure 5 depicts a situation 
in which both RAM 
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and 
PROM 
memory 
have 
the 
same 
memory 
addresses. 
In this case, PROM inhibits 
RAM, 
producing 
the effect of PROM overriding 
RAM. 
After address is stable, local selects are generated 
for both the PROM and the RAM. The PROM local 
select 
produces 
the 
INHlI 
signal 
which 
then 
removes the RAM local select and its driver enable. 
Because the slave RAM has been inhibited after it 
had already 
begun its cycle, the PROM XACKI 


must be delayed (tXACKB) until after the latest 
possible 
acknowledgement 
from 
the 
RAM 
(tXACKA)· 


Interrupt 
Operations 
- The MULTIBUS inter- 
rupt lines INTOI - INT71 are used by a MULTI- 
BUS master to receive interrupts 
from bus slaves, 


other bus masters or external logic such as power 
fail logic. A bus master may also contain internal 
interrupt 
sources which 
do not require the bus 
interrupt 
lines to interrupt 
the master. 
There are 
two interrupt 
implementation 
schemes 
used by 
bus interrupts, 
Non Bus Vectored Interrupts 
and 
Bus Vectored 
Interrupts. 
Non Bus 
Vectored 
Interrupts 
do not convey interrupt 
vector address 
information 
on the bus. Bus Vectored Interrupts 
are interrupts 
from slave Priority Interrupt 
Con- 
trollers (PICs) which do convey interrupt 
vector 


Non Bus Vectored Interrupts 


Non Bus Vectored Interrupts 
are those interrupts 
whose interrupt vector address is generated by the 
bus master 
and do not require the MULTIBUS 
address 
lines for transfer 
of the interrupt 
vector 
address. The interrupt vector address is generated 
by the interrupt 
controller 
on the master 
and 
transferred 
to the processor over the local bus. The 
source of the interrupt can be on the master module 
or on other bus modules, in which case the bus 
modules 
use the MULTIBUS 
interrupt 
request 
lines (INTOI - INT7/) to generate 
their interrupt 
requests 
to the bus master. 
When an interrupt 
request line is activated, the bus master performs it 
own interrupt 
operation 
and processes the inter- 
rupt. 
Figure 6 shows an example 
of Non Bus 
Vectored Interrupt 
implementation. 


Bus VeCtored Interrupts 


Bus Vectored Interrupts 
(Figure 7) are those inter- 


rupts which transfer 
the interrupt 
vector address 
along 
the MULTIBUS 
address 
lines 
from the 
slave to the bus master using the INTAI command 
signal for synchronization. 
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When an interrupt request from the MULTIBUS 
interrupt lines INTO/ -INT7/ occurs, the interrupt 
control 
logic on the bus master 
interrupts 
its 


processor. 
The processor 
on the bus master 


generates an INTAI command which freezes the 
state of the interrupt 
logic on the MULTIBUS 


slaves for priority resolution. The bus master also 
locks (retains 
the bus between bus cycles) the 
MULTIBUS 
control 
lines 
to guarantee 
itself 


consecutive 
bus cycles. 
After the first INTAI 


command, the bus master's interrupt control logic 
puts 
an interrupt 
code on to the MULTIBUS 


address lines ADR81 -ADRA/. The interrupt code 
is the address of the highest priority active inter- 
rupt request line. At this point in the Bus Vectored 


Interrupt procedure, two different sequences could 
take place. 
The difference occurs, because the 


MULTIBUS 
specification 
can support 
masters 


which 
generate 
one additional 
INTAI 
(8086 


masters) or two additional 
INTA/s 
(8080A and 
8085 masters). 


If the bus master generates one additional INTAI, 
this second INTAI 
causes the bus slave interrupt 


control logic to transmit 
an interrupt vector 8-bit 


pointer on the MULTIBUS data lines. The vector 
pointer is used by the bus master to determine the 
memory address of the interrupt service routine. 


If the 
bus master 
generates 
two additional 
INTAI s, these two INTAI 
commands 
allow the 


bus slave to put a two byte interrupt vector address 
on to the MULTIBUS data lines (one byte for each 
INTA/). 
The interrupt 
vector address is used by 
the bus master to service the interrupt. 


The MULTIBUS 
specification 
provides for only 
one type of Bus Vectored Interrupt 
operation in a 
given system. 
Slave boards which have an 8259 
interrupt 
controller are only capable of 3 INTAI 


operation 
(2 additional 
INTA/s 
after the first 
INTA/). 
Slave boards with the 8259A interrupt 
controller 
are capable 
of either 
2 INTAI 
or 3 
INTAI 
operation. 
All slave boards 
in a given 
system must operate in thesameway(2 
INTA/sor 
3 INTA/s) 
if Bus Vectored Interrupts 
are to be 
used. 
However, 
the MULTIBUS 
specification 
does provide for Bus Vectored Interrupts 
and Non 
Bus Vectored I~terrupts 
in the same system. 


MULTIBU8 
Multi-Master 
Operation 
- 
The 
MULTI BUS system bus can accommodate several 
bus masters on the same system, each one taking 
control of the bus as it needs to affect data trans- 
fers. The bus masters request bus control through 
a bus exchange sequence. 


Two bus exchange priority resolution techniques 
are discussed, 
a serial technique 
and a parallel 
technique. 
Figures ,8 and 9 illustrate 
these two 
techniques. 
The bus exchange 
operation 
dis- 
cussed later is the same for both techniques. 


Serial Priority Technique 


Serial priority resolution is accomplished 
with a 
daisy chain ~echnique (see Figure 8). The priority 
input (BPRN/) 
of the highest 
priority 
master is 
tied to ground. The priority output (BPRO/) 
of the 


highest 
priority 
master 
is then connected to the 
priority input (BPRN/) of the next lower priority 
master, and so on. Any master generating 
a bus 
request will set its BPROI 
signal high to the next 
lower priority master. 
Any master seeing a high 
signal on its BPRNI line will sets its BPROI line 
high, thus passing 
down priority information 
to 
lower priority masters. 
In this implementation, 


the bus request line (BREQ/) is not used outside of 
the 
individual 
masters. 
A limited 
number 
of 
masters can be accommodated 
by this technique, 
due to gate delays through the daisy chain. 
Using 
the current 
Intel MULTIBUS 
controller chip on 
the master boards up to 3 masters may be accom- 
modated if a BCLKI 
period of 100 ns is used. 
If 
more bus masters are required, either BCLKI 
must 
be slowed or a parallel priority technique used. 


Parallel Priority Technique 


In the parallel priority technique, 
the priority is 
resolved in a priority resolution circuit in which 
the highest priority BREQI input is encoded with 
a priority encoder chip (74148). This coded value is 
then decoded with a priority decoder chip (748138) 
to activate 
the appropriate 
BPRNI 
line. 
The 
BPROI lines' are not used in the parallel priority 
scheme. 
However, since the MULTIBUS 
back- 
plane contains 
a trace from the BPRNI signal of 
one card slot to the BPROI signal of the adjacent 
lower card slot, the BPROI must be disconnected 
from the bus on the board or the backplane trace 
must be cut. A practical 
limit of sixteen masters 
can be accommodated 
using the parallel priority 
technique due to physical bus length limitations. 
Figure 
9 contains 
the schematic 
for a typical 
parallel resolution network. Note that the parallel 
priority 
resolution 
network 
must 
be externally 
supplied. 


MULTIBUS 
Exchange 
Operation 
- 
A timing 
diagram 
for the MULTIBUS exchange operation 
is shown 
in Figure 
10. 
This 
implementation 
example 
uses a parallel 
resolution 
scheme, how- 
ever, the timing would be basically 
the same for 
the serial resolution scheme. 


In this example, 
master 
A has been assigned 
a 
lower priority than master B. The bus exchange 
occurs because master B generates 
a bus request 
during a time when master A has control of the 
bus. 


The 
exchange 
process 
begins 
when 
master 
B 
req uires the bus to access some resource such as an 
110 or memory module while master A controls the 
bus. 
This internal 
request is synchronized 
with 
the 
trailing 
edge 
(high 
to low) of BCLKI 
to 
generate a bus request (BREQ/). 
The bus priority 
resolution circuit changes the BPRNI 
signal from 
active (low) to inactive 
(high) for master 
A and 
from inactive 
to active for master 
B. 
Master A 
must first complete the current 
bus command 
if 
one is in operation. 
After master A completes the 
command, 
it sets BUSY I inactive 
on the next 
trailing edge ofBCLK/. 
This allows the actual bus 


exchange 
to occur, because master 
A has relin- 
quished control of the bus, and master B has been 
granted 
its BPRN/. 
During this time, the drivers 


for master 
A are disabled. 
Master B must take 
control of the bus with the next trailing 
edge of 
BCLKI 
to complete the bus exchange. 
Master B 
takes control by activating 
BUSY I and enabling 
its drivers. 


It is possible for master A to retain control of the 
bus and prevent 
master 
B from getting 
control. 


Master A activates the Bus Override (or Bus Lock) 
signal 
which keeps BUSYI active allowing 
con- 
trol 
of the bus to stay 
with 
master 
A. 
This 
guarantees 
a master 
consecutive 
bus cycles for 
software 
or hardware 
functions 
which 
require 
exclusive, continuous 
access to the bus. 


Note that in systems with only a single master it is 
necessary to ground the BPRNI pin of the master, 
if slave boards are to be accessed. 
In single board 
systems 
which use a CPU board capable 
of Bus 
Vectored Interruptoperation, 
the BPRNI 
pin must 
also be grounded. 


In a single master system bus transfer 
efficiency 


may be gained if the BUS OVERRIDE signal is 
kept active continuously. 
This permits the master 
to maintain 
control of the bus at all times, there- 


fore saving the overhead ofthe master reacquiring 
the bus each time it is needed. 


The CBRQI 
line may be used by a master 
in 
control of the bus to determine 
if another 
master 
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requires the bus. If a master currently in control of 
the bus sees the 
CBRQI 
line inactiVe, it will 
maintain 
control of the bus between adjacent bus 


accesses. Therefore, when a bus access is required, 
the master saves the overhead of reacquiring the 
bus. If a current bus master sees the CBRQI 
line 


active, it will then, relinquish 
control of the bus 


after the current bus access and will contend for 
the bus with the other master(s) requiring the bus. 
The relative priorities of the masters 
will deter- 
mine which master receives the bus. 


Note that exceptforthe 
BUS OVERRIDE state, no 


single master may keep exclusive control of the 
bus. 
This is true because it is impossible fOl;the 


CPU on a master to ,require continuous access to 
the bus. Other lower priority masters will always 
be able to gain access to the bus between accesses 
of a higher priority master. 


Power" Fail Considerations 
- The MULTIBUS 


P2 connector signals provide a means of handling 
power failures. 
The circuits required for power 
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failure detection and handling 
are optional and 


must be supplied by the user. 
Figure 11 shows 


the timing of a power fail sequence. 


The power supply monitors the AC power level. 
When power drops below an acceptable value, the 
power supply raises ACLO which tells the power 
fail logic that a minimum ofthree milliseconds will 
elapse before DC power will fall below regulated 
voltage levels. 
The power fail logic sets a sense 


latch (PFSN /) and genera tes an interrupt (PFIN /) 
to the processor so the processor can store its 
environment. 
After a 2.5 millisecond timeout, the 


memory protect signal (MPRO/) is asserted by the 
power fail logic preventing any memory activity. 
As power falls, 
the memory 
goes on standby 


power. 
Note that 
the power fail logic must be 


powered from the standby source. 


As the AC line revives, the logic voltage level is 
monitored by the power supply. 
After power has 


been at its operating 
level for one millisecond 


minimum, the power supply sets the signal ACLO 
low, beginning 
the restart 
sequence. 
First, the 


memory protect line (MPRO/) then the initialize 
line (INIT /) become inactive. The bus master now 
starts running. 
The bus master checks the power 


fail latch (PFSN /) and, ifit finds it set, branches to 


a power up routine which resets the latch (PFSR/), 
restores the environment, 
and resumes execution. 


Note that INIT/ is activated only after DC power 
has risen to the regulated voltage levels and must 
stay low for five milliseconds minimum before the 
system is allowed to restart. 
Alternatively,INIT/ 


may be held low through an open collector device 
by MPRO/. 


How the power failure equipment is configured is 
left to the system designer. 
The backup power 


source may be batteries 
located on the memory 


boards 
or more elaborate 
facilities 
located off- 


board. 
The 
location 
of the 
power- fail 
logic 


determines which MULTIBUS power fail lines are 
used. Pins on the P2 connector have been specified 
for the power failure functions for use as needed. 


To further clarify the location and use ofthe power 
fail circuitry, an example of a typical power fail 
system block diagram 
is shown in Figure 12. A 


single board computer and a slave memory board 
are contained in the system. It is desired to power 
the memory circuit elements of the memory board 
from auxiliary power. The single board computer 
will remain 
on the main power supply. 
To ac- 


complish this, user supplied power fail logic and 


an auxiliary power supply have been included in 
the system. 


The single board computer is powered from the PI 
power lines 
and 
accesses 
the P2 signal 
lines 


PFINI, 
PFSNI 
and PFSRI 
(only the P2 signal 


lines used by a particular 
functional 
block are 
shown on the block diagram). 
The PFSRI line is 
driven from two sources: a front panel switch and 
the single board computer. The front panel switch 
is used during normal power-up to reset the power 
fail sense latch. 
The single board computer uses 
the PFSRI line to reset the latch during a power-up 
sequence after a power failure. 
Current 
single 
board 
computers 
must 
access 
the PFSNI 
and 


PFSRI 
signals 
either 
directly 
with 
dedicated 


circuitry and a P2 pin connection or through the 
parallel I/O lines with a cable connection from the 
parallel I/O connector to the P2 connector. 


The slave memory board uses both the PI and P2 
power lines, the P2 power lines are used (at all 
times) to power the memory circuit elements and 
other support circuits, the PI power lines power all 
other circuitry. 
In addition, 
the MPROI 
line is 
input and used to sense when memory contents 
should be protected. 


The power fail logic contains the power fail sense 
latch, and uses the PFSRI 
and ACLO lines for 


inputs and the PFINI 
PFSNI, 
and MPROI 
lines 
for outputs. 
The power fail logic must be powered 


by the P2 power lines. 


DC Requirements 
- The drive and load charac- 


teristics of the bus signals are listed in Appendix 
C. The physical locations of the drivers and loads, 
as well as the terminating 
resistor value for each 


bus line, are also specified. Appendix D contains 
the MULTIBUS power specifications. 


MUL TIBUS™ Slave 
Interface 
Circuit 
Elements 


There 
are three basic elements 
of a slave bus 


interface: 
address 
decoders, 
bus 
drivers, 
and 
control signal logic. This section discusses each of 
these elements in general terms. A description of a 
detailed 
implementation 
of a slave interface 
is 


presented in a later section of this application note. 


Address 
Decoding 
- 
This 
logic decodes the 
appropriate 
MULTIBUS address 
bits into.RAM 
requests, ROM requests, or I/O selects. Care must 
be taken in the design of the address decode logic 
to ensure flexibility in the selection of base address 
assignments. 
Without this flexibility, restrictions 
may be placed upon various 
system 
configura- 
tions. 
Ideally, switches and jumper connections 
should 
be associated 
with the decode logic to 
permit field modification of base address assign- 
ments. 


The initial step in designing 
the address decode 
portion of a MULTIBUS interface is to determine 
the required number of unique address locations. 
This 
decision 
is influenced 
by the fact 
that 
address 
decoding is usually done in two stages. 


The first stage 
decodes the base address, 
pro- 
ducing 
an 
enable 
for the second stage 
which 
generates 
the actual 
device selects for the user 


logic. 
A convenient 
implementation 
of this two 
stage decoding scheme utilizes a pair of decoders 
driven by the high order bits of the address for the 
first stage and a second decoder for the low order 
bits of the address bus. This technique forces the 
number of unique address locations to be a power 
of two, based at the address decoded by the first 
stage. 
Consider the scheme illustrated 
in Figure 


13. 


As shown in Figure 13,the address bits A4 -ABare 
used to produce switch selected outputs of the first 
stage of decoding. 
The lout 
of 8 binary decoders 


have been used. The top decoder decodes address 
lines A4 - A7. and the bottom decoder decodes 
address lines A8 -AB. If only address lines AO-A7 
are being used for device selection, as in the case of 
I/O 
port selection in 8-bit systems, 
the bottom 
decoder may be disabled by setting switch S2 to the 
ground position. 
Address lines A7 and AB drive 
enable 
inputs 
E2 or E3 of the decoders. 
The 
address 
lines AO - A3 enter 
the second stage 
address decoder to produce 8 user device selects. 
The second stage decoder must first be enabled by 
an address that corresponds to the switch-selected 
base address. 


Address decoding must be completed before the 
arrival of a command. 
Since the command may 
become active within 50 ns after stable address, 
the decode logic should be kept simple with a 
minimal number of layers of logic. Furthermore, 
the timing is extremely critical in systems which 
make use of the inhibit lines. 


A linear or unary select scheme in which no binary 
encoding of device address (e.g., address 
bit AO 
selects device 0, address bit Al selects device 1, 
etc.) is performed is not recommended because the 
scheme 
offers 
no protection 
in case multiple 
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devices are simultaneously 
selected, and because 
the addressing 
within such a system is restricted 
by the extent of the address space occupied by such 
a scheme. 


Data 
Bus 
Drivers 
- 
For user designed 
logic 
which simply receives data from the MULTIBUS 
data lines, this portion of the bus interface logic 
may only consist of buffers. 
Buffers are required 
to ensure that maximum allowable bus loading is 
not exceeded by the user logic. 


In systems 
where the user designed 
logic must 
place data onto the MULTIBUS data lines, three- 
state drivers are required. These drivers should be 
enabled 
only when 
a memory 
read command 
(MRDC/) or an I/O read command 
(lORC/) 
is 
present and the module has been addressed. 


When both the read and write functio'ns are re- 
quired, parallel bidirectional bus drivers (e.g., Intel 
8226,8287, etc.) are used. A note of caution must be 
included for the designer 
who uses this type of 
device. 
A problem may arise if data hold time 
requirements 
must 
be satisfied 
for user 
logic 
following write operations. 
When bus commands 
are used to directly produce both the chip select for 
the bidirectional bus driver and a strobe to a latch 
in the user logic, removal of that signal may not 
provide the user's latch with adequate data hold 
time. Depending on the specifics of the user logic, 
this 
problem 
may 
be solved 
by permanently 
enabling 
the data buffer's receiver circuits 
and 
controlling only the direction of the buffers. 


Control 
Signal 
Logic - The control signal logic 
consists of the circuits that forward the I/O and 
memory read/write 
commands to their respective 
destinations, 
provide 
the bus with 
a transfer 
acknowledge 
response, 
and 
drive 
the 
system 
interrupt lines. 


Bus Command Lines 


The MULTI BUS information 
transfer 
protocol 
lines 
(MRDC/, 
MWTC/, 
lORD/. 
and 
lOWC/) 


should be buffered by devices with very high speed 
switching. 
Because the bus DC requirements 
specify that each board may load these lines with 
2.0 mA, Schottky devices are recommended. 
LS 
devices are not recommended 
due to their poor 
noise immunity. 
The commands should be gated 


with a signal indicating the base address has been 
decoded to generate read and write strobes for the 
user logic. 


Transfer Acknowledge Generation 


The user interface transfer 
acknowledge genera- 
tion logic provides 
a transfer 
acknowledge 
re- 
sponse, XACK/, to notify the bus master that write 
data provided by the bus master has been accepted 
or that read data it has requested is available on 
the MULTIBUS data lines. XACK/ allows the bus 
master to conclude its current instruction. 


Since XACK/ timing requirements depend on both 
the CPU of the bus master and characteristics 
of 
the user logic, a circuit is needed which will provide 
a range of easily modified acknowledge responses. 


The transfer acknowledge signals must be driven 
by three-state drivers which are enabled when the 
bus interface 
is addressed 
and 
a command 
is 
present. 


Interrupt 
Signal Lines 


The asynchronous 
interrupt 
lines must be driven 
by open collector devices with a minimum drive of 
16 mA. 


In a typical Non Bus Vectored InteQ"upt system, 
logic must be provided to assert and latch-up an 
interrupt 
signal. 
In addition 
to driving 
the 
MULTIBUS interrupt 
lines, the latched interrupt 
signal would be read by an 110 operation such as 
reading the module's status. 
The interrupt signal 
would be cleared by writing to the status register. 


III. MULTIBUSTM SLAVE 
DESIGN 
EXAMPLE 


A MULTIBUS 
slave design 
example 
has been 
included in this application 
note to reinforce the 
theory previously discussed. 
The design example 
is of general purpose 110 slave interface. 
This 
design example could easily be modified to be used 
as a slave 
memory 
interface 
by buffering 
the 
address 
signals 
and 
using 
the 
appropriate 
MULTIBUS memory commands. 
In addition, to 
help the reader better understand 
an application 
for an 110 slave interface, two Intel 8255A Parallel 
Peripheral 
Interface (PPI) devices are shown con- 
nected to the slave interface. 


The design 
example 
is shown 
in both 8/16-bit 
version and an 8-bit version. The 8/16-bit version 


is an 110 interface 
which will permit 
a 16-bit 
master to perform 8 or 16 bit data transfers. 
8-bit 
masters 
may also use the 8/16-bit version of the 
design example to perform 8-bit data transfers. 


The 8-bit version of the design example may be 
used by both 8 or 16-bit masters, 
but will only 
perform 8-bit data transfers. 
It does not contain 
the 
circuitr.y 
required 
to perform 
16-bit data 
transfers. 


Both the 8/16-bit version and the 8-bit version of 
the design example were implemented on an iSBC 
905 prototype board. 
The schematics 
for each of 
the examples are given in Appendices F and G. 


This 
section 
describes 
the organization 
of the 
slave 
interface 
from 
two points 
of view, the 
functional 
point of view and the programming 
characteristics. 
First, 
the 
principal 
functions 
performed by the hardware are identified and the 
general data flow is illustrated. 
This point of view 
is intended 
as an introduction 
to the detailed 
description provided in the next section; Theory of 
Operation. 
In the 
second 
point 
of view, the 
information needed by a programmer to access the 
slave is summarized. 


Functional 
Description 
- 
The function of this 
110 slave is to provide the bus interface logic for 
general purpose 110 functions 
and for two Intel 
8255A Parallel Peripheral Interface (PPI) devices. 
Eight device selects (port addresses) are available 
for general purpose 110 functions. 
One of these 
device select lines is used to read and reset the state 
of an interrupt 
status 
flip-flop, the other seven 
device selects 
are unused 
in this 
design. 
An 
additional 
eight 110 device port addresses 
are 
used by the two 8255A devices; four 110 port 
addresses 
per 8255A (three 110 port address 
for 
the three parallel ports A, B, and C and the fourth 
110 port address for the device control register). 


Figure 14 contains 
a functional 
block diagram of 
the slave design example. 
This block diagram 
shows the fundamental 
circuit elements of a bus 
slave: 
bidirectional 
data 
bus drivers/receivers, 


address decoding logic and bus control logic. Also 
shown is the address 
decoding logic for the low 
order four bits, the interrupt logic which is selected 
by this decoding logic, and the two 8255A devices. 
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Programming 
Characteristics 
- 
The slave 
design example provides 16 110 port addresses 
which may be accessed by user software. 
The 
base address of the 16 contiguous port addresses 
is selected by wire wrap connections on the proto- 
type board. 
The wire wrap connections 
specify 
address 
bits ADR4/ - ADRB/. 
They allow the 
selection 
of a base 
address 
on any 
16 byte 
boundary. 
Twelve address bits (ADRO/ -ADRB/) 
are used since 16-bit (8086 based) masters use 12 
bits to specity 110 port addresses. If an 8 bit (8080 
or 8085based) master is used with this slave board, 
the high order address bits (ADR8/ -ADRB/) must 
not be used by the decoding circuits; a wire wrap 
jumper position (ground position) is provided for 
this. 


The 16 110 port addresses 
are divided into two 
groups of8 port addresses by decoding address line 
ADR3/. 
Port addresses 
XXO- XX7 are used for 
general 
I/O 
functions 
(XX indicates 
any hexi- 
decimal digit combination). 
Port address XXOis 
used for accessing the interrupt status flip-flop and 


port addresses 
XXI - XX7 are not used in this 
example. 
Port addresses XX8 - XXF are used for 
accessing the PPIs. 
If port addresses XX8 . XXF 
are selected, then ADRO/ is used to specify which 
of two PPIs are selected. 
If the address is even 
(XX8,XXA, XXC,or XXE)then one PPI is selected. 
If the address is odd (XX9, XXB, XXD, or XXF), 
then the other PPI is selected. ADRl/ and ADR2/ 
are 
connected 
directly 
to the PPIs. 
Table 
1 
summarizes 
the 110 port addresses 
of the slave 


design example. 
Note that if a 16-bit master is 


used, it is possible to access ,the slave in a byte or 
word mode. 
If word access is used with port 
address 
XX8, XXA, XXC, or XXE, then 
16 bit 
transfers 
will occur between the PPIs 
and the 


master. 
These 16 bit transfers 
occur because an 
even address has bel?nspecified and the MULTI- 
BUS BHEN/ 
signal 
indicates 
that 
a 16-bit 


transfer is requested. 


In the preceding section, each of the slave design 
example 
functional 
blocks was identified 
and 
briefly explained. This section explains how these 
functions are implemented. 
For detailed circuit 


information, refer to the schematics in Appendices 
F and G. The schematic in Appendix F is on a 
foldout page so that the following text may easily 
be related to the schematic. 


The discussion ofthe theory ofoperation is divided 
into five segments, 
each of which discusses 
a 


different function performed by the MULTIBUS 
slave design example. 
The five segments are: 


1. Bus address decoding 


2. Data buffers 


3. Control signals 


4. Interrupt 
logic 


5. PPI operation 


Each of these topics are discussed with regard to 
the 
8/16-bit 
version 
of the design 
example; 


followed by a discussion of the circuit elements 
which are required by the 8-bit version of the 
interface. 
. 


Bus Address 
Decoding 
- Bus address decoding 


is performed by two 82051 out of8 binary dp.coders. 
One decoder (A3) decodes address bits ADR8/ - 
ADRB/ 
and 
the second decoder (A2) decodes 


address bits ADR4/ . ADR7/. 
The base address 


I/O 
PORT 
ADDRESS 
BYTEACCESS 


XXO 


XX1 -XX? 


XX8 


XX9 


XXA 


XXB 


XXC 


XXD 


XXE 


XXF 


XXO 


XX2 - XX6 


XX8 


XXA 


XXC 


XXE 


Table 
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SLAVE 
DESIGN 
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PORT 
ADDRESSES 


READ 


Bit'0 = Interrupt Status 


Unused 


. Parailel Port A, Even ",Pi 


Parallel Port A, Odd PPI 


Parallel Port B, Even PPI 


Parallel Port B, Odd PPI 


Parallel Port C, Even PPI 


Par.allelPort C, Odd PPI 


Illegal Condition 


Illegal Condition 


Bit 0 = Interrupt Status 


Unused 


Parallel Port A, Even and Odd PPls 


Parallel Port B, Even and Odd PPls 
Parallel Port C, Even and Odd PPls 


Illegal Condition 


Reset Interrupt Status 


Unused 


Parallel Port A, Even PPI 


Parallel Port A, Odd PPI 


Parallel Port S, Even PPI 


Parallel Port B, Odd PPI 


Parallel Port C, Even PPI 


Parallel Port C, Odd PPI 


Control, Even PPI 


Control, Odd PPI 


Reset Interrupt Status 


Unused 


Parallel Port A, Even and Odd PPts 


Parallel Port B, Even and Odd PPls 


Parallel Port C, Even and Odd PPls 


Control, Even and Odd PPls 


selected is determined 
by the position 
of wire wrap 


jumpers. 
The 
outputs 
of the 
two decoders 
are 
ANDed together 
to form the BASE AUR SELECTI 


signal. 
This 
signal 
specifies 
the 
base 
address 
for a group of Hi I/O ports. 
Using 
the wire ""rap 


jumper 
positions 
shown 
in the schematic, 
a base 
address' 
of E3 has been selected. 
Therefore, 
this 


- MULTIBUS 
slave board will respond 
to I/O port 


addresses 
in the E30 - EJF range. 


If this slave board is to be used with 8-bit MULTI- 
BUS masters, 
the high prder address 
bits must not 


be decoded. 
Therefore, 
the 
wire 
wr,ap jumper 


which 
selects 
the output 
of decoder 
A3 must 
be 
placed 
in the top (ground) 
position 
(pin 10 of gate 
A9 to ground). 


The low order 4 address 
lines (ADROI 
-AURJ/) 
are 
buffered 
and 
inverted 
using 
74LS04 
inverters. 


These 
address 
lines 
are 
input 
to 
an 
8205 for 


decoding 
a chip select for the interrupt 
logic; the 
address 
lines are also used directly 
by the PPIs. 


LS-Series logic is required 
for buffering 
to meet the 
MULTIBUS 
specification 
for IlL (low level input 


current). 
S-Series or standard 
series logic will not 


meet this specification.' 


Address 
decoder 
A4 is used to decode addresses 
EJU - E:J7. 'The CSOI output of this decoder is used 
to select the interrupt 
logic, thus I/O port address 
EJO is used to read and reset the interrupt 
latch. 


The remaining 
outputs 
from decoder 
A4 (CSlI 
- 


CS7 I) are not used in this example. 
They would 


normally 
be 'used 
to select 
other 
functions 
in a 


slave board with more capability. 
Note that in the 
schematic 
shown 
in Appendix 
G for the 
il-bit 
version 
of this 
slave 
design 
example, 
the high 
order 
(AD Rill 
- ADRB/) 
address 
decoder 
is not 
included 
and the BHENI 
signal 
is not used. 


Data 
Buffers 
- 
Intel 
il2il7 il-bit 
parallel 
bi- 


directional 
bus drivers 
are used for the MULTI- 
BUS data 
lines DATUI - UATF/. 
In the il/16-bit 
version 
of the 
slave 
board, 
three 
il2il7 drivers 
are used. 


When an 8-bit data 
transfer 
is requested, 
either 


driver 
A5, which 
is connected 
to on-board 
data 


lines DO- D7, or driver A6, which is connected to 
on-board data lines D8 . DF, is used. 
If a byte 
transfer is requested from an even address, driver 
A5 will be selected. If a byte transfer from an odd 
address is requested, driver A6 will be selected. All 
byte transfers 
take place on MULTIBUS 
data 
lines 
DATOI 
. DAT7I. 
When a word (16-bit) 
transfer is requested from an even address, drivers 
A5 and A7will be used. Note that if a user program 
requests 
a word transfer 
from an odd address, 
16·bit masters 
in the iSBC product 
line will 
actually perform two byte transfer 
requests. 


The logic which 
determines 
the chip selection 


(8287 input signal OE, output enable) signals for 
the bus drivers 
uses the low order address 
bit 


(ADRO/) 
and 
the 
buffered 
Byte 
High 
Enable 
signal 
(BHENBLI). 
Note that the MULTIBUS 
signal BHENI 
has been buffered with an 74LS04 
inverter. 
This is done to meet the bus address line 
loading specification. 
The SWAP BYTEI 
signal 
which is generated is qualified by the BD ENBLI 
signal and used to select the bus drivers. 


The steering pin for the 8287 drivers is labelled T 
(transmit) 
and is driven by the signal RD. When 
an input (read) request is active or when neither a 
read 
or write command 
is being 
serviced, 
the 
direction of data transfer of the 8287 will be set for 
B toA. 


The 8287 drivers are set to point IN (direction B to 
A) when no MULTIBUS I/O transfer command is 
being serviced for two reasons. 
First, ifthe driver 
were pointed OUT (direction A to B) and a write 
command occured, it would be necessary 
to turn 
the buffers IN and set the OE (output enable) 
signal active before the data could be transferred 
to the on-board bus. 
A possibility 
of a "buffer- 
fight" could occur in some designs if the OE signal 
permitted 
an 8287 to drive the MULTIBUS data 
lines momentarily 
before the steering signal could 
switch the direction of the 8287. In this case, both 
the MULTI BUS master 
and the slave would be 
driving the data lines; this is not recommended. 
(In this particular 
design, the steering signal will 
always 
stabilize 
before the OE signal 
becomes 
active.) 


The second reason the driver is pointing IN when 
no command is present is due to the "data valid 
after WRITE" requirements 
of the 8255As. The 
8255A requires that data remain on its data lines 
for 30 ns after the WRITE command (WR at the 
8255A) is removed. This requirement will be met if 
the direction of the 8287 drivers is not switched 


when the MULTIBUS IOWCI 
signal is removed 
(WRTI 
could have been used to steer the 8287 
instead of RD); and if the capacitance 
of the on- 
board data bus lines is sufficient to hold the data 
values on the bus after the 8287 OE signal and the 
8255A PPI WRTI signal go inactive. Theon-board 
data 
bus may easily be designed such that the 
capacitance 
of the lines is sufficient to meet the 30 
ns data hold time requirement. 
In addition, the 
current leakage of all devices connected to the on· 
board bus must be kept small to meet the 30 ns data 
hold time requirement. 


The 8-bit version of this design example uses only 
one 8287 instead of the three required by the 8/16- 
bit version. The logic required to control the swap 
byte buffer is also not necessary. 
The chip select 
signal used for the 8287 is the BD ENBLI 
signal. 


Control 
Signals 
- 
The MULTIBUS 
control 
signals 
used by this slave 
design 
example 
are 
IORC/, 
IOWC/, 
and XACKI. 
IORCI 
and IOWCI 


are qualified by the BASE ADR SELECTI signal 
to form the signals RD and WRT. RD and WRT 
are used to drive the interrupt 
logic, the PPI logic 
and the XACKI 
(transfer 
acknowledge) logic. 


For the XACKI 
logic RD and WRT are ORed to 
form the BD ENBLI 
signal which is inverted and 
used to drive the CLEAR pin of a shift register. 
When the slave board is not being accessed, the 
CLEAR pin of the shift register will be low (BD 
ENBLI 
is high). 
This causes the shift register to 
remain cleared and all outputs of the shift register 
will be low. When the slave board is accessed, the 
CLEAR pin will be high, and the A and B inputs 
(which are high) will be clocked to the output pins 
by CCLKI. 
To select a delay for the XACKI 
signal, 
a jumper must be installed 
from one of the shift 
register 
output pins to the 8089 tri-state 
driver. 


Each of the shift register 
output pins select an 
integer multiple of CCLKI 
periods for the signal 
delay. 
Since the CCLKI 
signal is asynchronous, 


the actual 
delay selected may only be specified 
with a tolerance 
of one CCLKI 
period. 
In this 
example 
a delay 
of 3 - 4 CCLKI 
periods 
was 
selected; with 
a CCLKI 
period of 100 ns, the 
XACKI 
delay would occur somewhere within the 
range 
of 300 - 400 ns from the time when the 
CLEAR signal goes high. 


The control signal logic used in the 8-bit version of 
the slave design example is identical to the logic 
used in the 8/16-bit version. 


Interrupt 
Logic 
- 
The interrupt 
logic uses a 
74S74 flip-flop to latch an asynchronous 
interrupt 
request from some external logic. The Q outp~t 
of the INTERRUPT REQUEST LATCH is output 
through 
an 
open collector gate 
to one of the 
MULTIBUS 
interrupt 
lines. 
The state 
of the 
INTERRUPT 
REQUEST 
LATCH is transferred 
to the INTERRUPT 
STATUS LATCH when a 
read command 
is performed 'on I/O port BASE 
ADDRESS+O (E30 for the jumper configuration 
shown). The Q output ofINTERRUPT 
STATUS 
LATCH is used to drive data line DOof the on- 
board data bus by using an 8089 tri-state driver. 
If a user program performs an INPUT from I/O 
port E30, data bit 0 will be set to 1 if ~he INTER- 
RUPT REQUEST LATCH is set. 


The purpose of INTERRUPT STATUS LATCH is 
to minimize the possibility 
of the asynchronous 
interrupt 
occuring while the interrupt 
status 
is 
being read by a bus master. 
If the latch was not 
included in the design and an asynchronous 
inter- 
rupt 
did occur while a bus' master 
is reading 
MULTIBUS data line DATOI, 
a data buffer on the 
master 
could go into a meta-stable 
state. 
By 
adding 
the extra latch, which is clocked by the 
IORDI 
command for I/O port E30, the possibility 
of data line DATOI 
changing during a bus master 
read operation is eliminated. 


The INTERRUPT 
REQUEST LATCH is cleared 
when a user program performs an 0UTPUT to I/O 
port E30. 


This 
interrupt 
structure 
assumes 
that 
several 
interrupt 
sources may exist on the same MULTI- 
BUS interrupt line (for example, INT3/). 
When the 
MULTIBUS master gets interrupted, 
it must poll 
the possible sources of the interrupt received and 
after determining 
the source of the interrupt, 
it 
must clear the INTERRUPT 
REQUEST LATCH 
for that particular 
interrupt source. 


The interrupt 
.logic for the 8-bit version of the 
design example is identical to the interrupt logic of 
the 8/16-bit 
version of the design example. 


PPI Operation 
- Two 8255A Parallel Peripheral 
Interface 
(PPI) devices are shown interfaced 
to 
the slave design example logic. One PPI is con- 
nected to the on-board data bus lines DO- D7 and 
is addressed 
with the even I/O 
port addresses 
E38, E3A, E3C, and E3E. 
The second PPI is 
connected to data bus lines D8 -DF and is address- 
ed with the odd I/O 
port addresses 
E39, E3B, 


E3D, and E3F. The even or odd I/O port selection 
is controlled by using the ADRO address line in 
the chip select term of the PPIs. 
In addition, the 
odd PPI 
(All) 
is selected when the BHENBL 
term is high. 
This occurs when the MULTIBUS 
signal 
BHENI 
is low indicating 
that 
a word 
(16-bit) I/O instruction 
is being executed. 
When 
a word I/O instruction 
is executed, both PPIs will 
perform the I/O operation specified. 


The specifications 
of the 8255A device state that 
the address lines AO and' A1 and the chip select 
lines must be stable before the RD or WR lines are 
activated. 
The MULTIBUS specification address 
set-up time of 50 ns and the short gate propagation 
delays in this design assure that the address lines 
are stable before RD or WR are active. 


The data 
hold requirements 
of the 8255A were 
discussed in a previous section. The 8255A speci- 
fication states that data will be stable on the data 
bus lines a maximum 
of 250 ns after a READ 
command. 
This specification 
was used to select 
the delay for the XACKI 
signal. 


The PPI operation 
for the 8-bit version 
of the 
design eXilmple is slightly different than that used 
for the 8/16-bit 
version. The chip select signal for 
the bottom PPI does not use the BHENBL term 
since 16-bit data transfers are not possible with an 
8-bit I/O slave board. 
Also, the chip select and 
address signals have been swapped so the top PPI 
occupies I/O 
address 
range 
X8 - XB, and the 
bottom PPI occupies I/O address range XC-XF (X 
is the base address 
of the 8-bit version). 
This 
swapping of the address lines was not necessary; 
however, it was thought to be more convenient to 
access the PPIs in two groups of 4 contiguous I/O 
port addresses. 


This application 
n~ie has shown the structure of 
the Intel MULTIBUS system bus. The structure 
supports a wide range of system modules from the 
Intel OEM Microcomputer Systems product line 
that 
can be extended with the addition 
of user 
designed 
modules. 
Because the user designed 
modules are no doubt unique to particular applica- 
tions, a goal of this application 
note has been to 
describe in detail the singular common element - 
the 
bus 
interface. 
Materiai 
has 
also 
been 
presented to assist the systems designer to under- 
standing 
the bus 
functions 
so that 
successful 
systems integration 
can be achieved. 


APPENDIX A 


PIN ASSIGNMENT 
OF BUS SIGNALS ON MULTIBUS BOARD P1 CONNECTOR 


, 


(COMPONENT 
SIDE) 
(CIRCUIT SIDE) 


PIN 
MNEMONIC 
DESCRIPTION 
PIN 
MNEMONIC 
DESCRIPTION 


1 
GND 
Signal GND 
2 
GND 
Signal 
GND 
3 
+5V 
+ 5Vdc 
4 
+5V 
+ 5Vdc 
POWER 
5 
+5V 
+ 5Vdc 
6 
+5V 
+ 5Vdc 
SUPPLIES 
7 
+12V 
+12Vdc 
8 
+12V 
+12Vdc 


9 
-5V 
-5Vdc 
10 
-5V 
-5Vdc 
11 
~ND 
Signal GND 
12 
GND 
Signal GND 


13 
BCLK! 
Bus Clock 
14 
INIT/ 
Initialize 
15 
BPRN! 
Bus Pri. In 
16 
BPRO! 
Bus Pri. Out 
BUS 
17 
BUSY! 
Bus Busy 
18 
BREOI 
Bus Request 


CONTROLS 
19 
MRDC! 
Mem Read Cmd 
20 
MWTC! 
Mem Write Cmd 
21 
10RC! 
I!O Read Cmd 
22 
10WCI 
110 Write Cmd 
23 
XACK! 
XFER Acknowledge 
24 
INH1! 
Inhibit 1 disable 
RAM 


BUS 
25 
Reserved 
26 
INH2! 
Inhibit 2 disable 
PROM or ROM 


CONTROLS 
27 
BHEN! 
Byte High Enable 
28 
AD10! 


AND 
29 
CBRO! 
Common 
Bus Request 
30 
AD11! 
Address 


ADDRESS 
31 
CCLKI 
Constant Clk 
32 
AD12! 
Bus 
33 
INTA! 
Intr Acknowledge 
34 
AD131 


35 
INT6! 
Parallel 
36 
INT7! 
Parallel 


INTERRUPTS 
37 
INT41 
Interrupt 
38 
INT51 
Interrupt 
39 
INT2! 
Requests 
40 
INT31 
Requests 
41 
INTOI 
42 
INT1! 


43 
ADRE! 
44 
ADRF! 


45 
ADRC! 
46 
ADRD! 
47 
ADRAI 
Address 
48 
ADRB/ 
Address 


ADDRESS 
49 
ADR8! 
Bus 
50 
ADR9! 
Bus 
51 
ADR6! 
52 
ADR7! 
53 
ADR4! 
54 
ADR5! 
55 
ADR2! 
56 
ADR3! 
57 
ADRO! 
58 
ADR1! 


59 
DATEI 
60 
DATF! 
' 
!' 


61 
DATC! 
62 
DATD! 
63 
DATA! 
Data 
64 
DATB! 
Data 


DATA 
65 
DAT8! 
Bus 
66 
DAT9! 
Bus 
67 
DAT6! 
68 
DAT7! 
69 
DAT4! 
70 
DAT5! 
71 
DAT2! 
72 
DAT3! 
73 
DATO! 
74 
DAT1! 


75 
GND 
Signal GND 
76 
GND 
Signal GND 
77 
Reserved 
78 
Reserved 
POWER 
79 
-12V 
-12Vdc 
80 
-12V 
-12Vdc 
SUPPLIES 
81 
+5V 
+5Vdc 
82 
+5V 
+ 5Vdc 
83 
+5V 
+5Vdc 
84 
+5V 
+ 5Vdc 
85 
GND 
Signal GND 
86 
GND 
Signal GND 


All 
Mnemonics 
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APPENDIX 
A (Continued) 


P2 CONNECTOR 
PIN ASSIGNMENT 
OF OPTIONAL 
BUS 
SIGNALS 


(COMPONENT 
SIDE) 
(CIRCUIT SIDE) 


PIN 
MNEMONIC 
DESCRIPTION 
PIN 
MNEMONIC 
DESCRIPTION 


1 
GND 
Signal GND 
2 
GND 
Signal GND 
3 
5 VB 
+ 5V Battery 
4 
5 VB 
+ 5V Battery 
5 
Reserved 
6 
VCCPP 
+ 5V Pulsed Power 
7 
-5 VB 
-5V Battery 
8 
-5 VB 
-5V Battery 
9 
Reserved 
10 
Reserved 
11 
12 VB 
+ 12V Battery 
12 
12 VB 
+ 12V Battery 


13 
PFSRf 
Power Fail Sense Reset 
14 
Reserved 
15 
-12 VB 
-12V Battery 
16 
-12 VB 
-12V Battery 
17 
PFSNI 
Power Fail Sense 
18 
ACLO 
ACLow 
19 
PFINI 
Power Fail Interrupt 
20 
MPROI 
Memory Protect 


21 
GND 
Signal GND 
22 
GND 
Signal GND 
23 
+15V 
+15V 
24 
+15V 
+15V 
25 
-15V 
-15V 
26 
-15V 
-15V 
27 
PAR11 
Parity 1 
28 
HALTI 
Bus Master HALT 
29 
PAR21 
Parity 2 
30 
WAITI 
Bus Master WAIT STATE 
31 
1\ 
32 
ALE 
Bus Master ALE 
33 
34 
Reserved 
35 
36 
Reserved 
37 
38 
AUX RESETI 
Reset switch 
39 
40 
40 
42 
43 
Reserved 
44 
45 
46 
47 
48 
49 
50 
Reserved 
51 
52 
53 
54 
55 
56 
57 
56 
59 
I 
60 


Notes: 


1. PFIN, on slave modules, 
if possible, 
should have the option of connecting 
to INTOI on P1. 
2. All undefined 
pins are reserved 
for future use. 


All Mnemonics 
© Intel Corporation 
1978 
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Parameter 
Description 
Minimum 
Maximum. 
Units 


; 
. 


tBCY 
Bus Clock Period 
100 
DC. 
ns 


tBW 
Bus Clock Width 
0.35 tBCY 
065 tBCY 
. 


tSKEW 
BCLK/skew 
3 
ns 


tpD 
Standard 
Bus 
3 
Propagation 
Delay 


tAS 
Address 
Set-Up Time 
50 
ns 
(at Slave Board) 


tDS 
Write Data Set· 


" 


50 
ns 


Up Time 


tAH 
Address 
Hold Time 
50 
ns 


tDHW 
Write Data Hold Time 
50 
ns 


tDXL 
Read Data Set 
0 
ns 


Up Time To XACK 


tDHR 
Read Data Hold Time 
0 
65 
ns 


tXAH 
Acknowledge 
Hold 
0 
65 
, 
ns 
Time 


tXACK 
Acknowledge 
Time 
0 
tTOUT 
ns 


tCMD 
Command 
Pulse 
. 
100 
tTOUT 
ns 


Width 


tlD 
Inhibit Delay 
0 
100 
ns 
(Recommend 
< 100 ns) 


tXACKA 
Acknowledge 
Time of 
tIAD+ 50 ns 
tTOUT 
of an Inhibited 
Slave 


tXACKB 
Acknowledge 
Time of 
15 
tTOUT 
IJ.s 
an Inhibiting 
Slave 


tlAD 
Acknowledge 
Disable 
0 
100 
ns 


from Inhibit (An 
(arbitrary) 
internal 
parameter 
on 
an inhibited 
slave; 
used to determine 
tXACKA Min.) 


tAIZ 
Address 
to Inhibits 
100 
ns 
High delay 


tlNTA 
INTAI Width 
250 
ns 


tCSEP 
Command 
Separation 
100 
ns 


APPENDIX B (Continued) 


BUS TIMING SPECI,=ICATIONS SUMMARY 


Parameter 
Description 
Minimum 
Maximum 
Units 


tBREOL 
IBCLKI 
to BREOI 
0 
35 
ns 


Low Delay 


tBREOH 
IBCLKI 
to BREOI 
0 
35 
ns 


High Delay 


tBPRNS 
BPRNI 
10 IBCLKI 
22 
ns 


Setup Time 


tBUSY 
BUSY I delay 
0 
70 
ns 


from IBCLKI 


tBUSYS 
BUSY Ito IBCLKI 
25 
ns 


Setup Time 


tBPRO 
IBCLKI 
to BPROI 
0 
40 
~s 


(CLK to Priority Out) 


tBPRNO 
BPRNI to BPROI 
0 
30 
ns 


(Priority 
In to Out) 


ICBRO 
IBCLK/to 
CBROI 
0 
60 
ns 


(CLKto Common 
Bus Request) 


tCBROS 
CBROI to IBCLKI 
35 
ns 


Setup Time 


tCPM 
Central 
Priority 
0 
tBCy-tBREO 
Module Resolution 
-2tPD 
Delay (Parallel 
-tBPRNS 
Priority) 
-tSKEW 


tCCY 
C-clock Period 
100 
110 
ns 


tcw 
C-clock Width 
0.35 tCCY 
0.65tCCY 
ns 


tlNIT 
INIT/Width 
5 
ms 


tlNITS 
INIT/to 
MPROI 
100 
ns 
Setup Time 


tPBD 
Power Backup 
0 
200 
ns 
Logic Delay 


tPFINW 
PFINI Width 
2.5 
ms 


tMPRO 
MPROI Delay 
20 
2.5 
ms 


tACLOW 
ACLOI Width 
3.0 
ms 


tPFSRW 
PFSRI Width 
100 
ns 


tTOUT 
Timeout 
Delay 
5 
00 
ms 


IDCH 
D.C. Power Supply 
30 
ms 


Hold from ALCOI 


tDCS 
D.C. Power Supply 
5 
ms 
Setup to ACLOI 


APPENDIX 
C 


BUS DRIVERS, 
RECEIVERS, 
AND 
TERMINATIONS 


Driver 
1,3 
Receiver 
2,3 
Termination 


Bua 5lgnala 
location 
Type 
10l 
10H 
Co 
location 
III 
IIH 
CI 
location 
Type 
R 
Units 


Mlnma 
Mln"" 
MUpt 
M1xma 
M.x~. MUpl 


DATOI-OATFI 
Masters 
TRI 
16 
-2000 
300 
Masters 
-0.8 
125 
18 
, place' 
Pullup 
22 
KQ 


(16 lines) 
and Slaves 
and Slaves 


AOROI-AORBI, 
Masters 
TRI 
16 
-2000 
300 
Slaves 
-08 
125 
18 
1 place 
Pullup 
22 
KQ 


BHENI 
(21 lines) 


MROC/,MWTCI 
Masters 
TRI 
32 
-2000 
300 
Slaves 
-2 
125 
18 
1 place 
Pull up 
, 
KQ 


(Memory; 
memory- 
mapped 
110) 


10RC/,IOWCI 
Masters 
TRI 
32 
-2000 
300 
Slaves 
-2 
125 
18 
1 place 
Pullup 
1 
KQ 


(110) 


XACKI 
Slaves 
TRI 
32 
-2000 
300 
Masters 
-2 
125 
18 
I place 
Pullup 
510 
Q 


INH1/,INH2I 
Inhibiting 
OC 
16 
- 
300 
Inhibited 
-2 
50 
18 
1 place 
Pullup 
, 
KQ 


Slaves 
Slaves 
(RAM, PROM, 
ROM, Memory- 
Mapped 
110) 


BClKI 
I place 
TTl 
48 
-3000 
300 
Master 
-2 
125 
18 
Mother- 
To + 5V 
220 
Q 


(Master 
us) 
board 
ToGNO 
330 
Q 


BREal 
Each 
TTl 
5 
-400 
60 
Central 
2 
50 
18 
Central 
Pull up 
I 
KQ 


Master 
Priority 
Priority 


Module 
Module 
(not req) 


BPROI 
Each 
TTl, 
5 
-400 
60 
Next Master 
-1.6 
50 
18 
(not req) 


Master 
. 
in Serial 
Priority 
Chain at 
-' 
its BPRNI 


BPRW 
Parallel: 
TTl 
5 
-400 
300 
Master 
-2 
50 
Inot req) 


Central 
, 


Priority 
Module 
.... 


Sertal:Prev 
Masters 
BPROI 


BUSY/. 
CBRa 
All Masters 
OC 
32 
- 
300 
All Masters 
-2 
50 
18 
1 place 
Pullup 
1 
KQ 


INITI 
Master 
OC. 
32 
- 
300 
All 
-2 
50 
18 
1 place 
PulllJp 
2.2 
KQ 


CClKI 
1 place 
TTl 
48 
-3000 
300 
Any 
-2 
125 
18 
Mother- 
To + 5V 
220 
Q 


board 
ToGNO 
330 
Q 


INTAI 
Masters 
TRI 
32 
-2000 
300 
Slaves 
-2 
125 
18 
1 place 
Pullup 
I 
KQ 


(Interrupting 


1/0) 


INTO/-INT7I 
Slaves 
OC 
16 
- 
300 
Masters 
-1.6 
40 
18 
1 place 
Pull up 
, 
KQ 


18lines) 


PFSRI 
User's 
Fron 
TTl 
16 
-400 
300 
Slaves, 
-1.6 
40 
18 
I place 
Pullup 
1 
KQ 


Panel? 
Masters 


PFSNI 
Power 
Back 
TTl 
16 
-400 
300 
Masters 
-1.6 
40 
16 
1 place 
Pullup 
I 
KQ 


Up Unit 


AClO 
Power 
O.C. 
16 
-400 
300 
Slaves, 
-1.6 
40 
18 
1 place 
Pullup 
1 
KQ 
Supply 
Masters 


PFINI 
Power 
Back- 
O.C 
16 
-400 
300 
Masters 
-1.6 
40 
18 
1 place 
Pullup 
1 
KQ 


Up Unit 


MPROI 
Power 
Back- 
TTl 
16 
-400 
300 
Slaves 
-1.6 
40 
18 
1 place 
Pullup 
1 
KQ 


Up Unit 
Masters 


Driver 1,3 


Type 
IOL 
IOH 
Co 


Mlnma 
Mln~a 
MUpf 


IlL 
IIH 
C, 
Location 
Type 


MUma 
MU~a 
MUpl 


User's 
Front 
Panel? 


Notes: 


,. 
Driver 
Requirements 


, OH 
= High Output Current 
Drive 
IOL 
= Low Output 
Current 
Drive 
Co 
= 
Capacitance 
Drive Capability 
TRI 
= 3-State Drive 
O.C. 
= Open Collector 
Driver 
TTL 
= 
Totem-pale 
Driver 


2. 
Receiver 
Requirements 


IIH 
High Input Current 
Load 
IlL 
= Low Input Current 
Load 
CI 
= Caoacitive 
Load 


3. TTL 
low 
slale 
must 
be 2. -0.5v 
bul 
.s:.0.8v 
al the 
receivers 
TTL 
high 
slate 
must 
be2. 
2.Ov but.s:. 5.5v al the 
receivers 


4. 
For 
the 
iSSC 
80/10 and 
Ihe 
iSSC 
8o/10A·use 
only 
a lK 
PUII_Uip resislor 
to +5v for 
SCLKI 
and 
CCLK/lerminalion. 


APPENDIX D 


BUS POWER SPECIFICATIONS 


Standard (P1) 
I 
Optional (P2) 


Analog Power 
Battery Power Backup 


Ground 
+5 
+ 12 
-12 
+15 
-15 
+5 
+12 
-12 
-5 


Mnemonic 
GND 
+5V 
+ 12V 
-12V 
+ 15V 
-15V 
+58 
+128 
-128 
-58 


8us Pins 
P1+ 1,2, 
Pl + 3,4, 
Pl + 7,8 
Pl + 79, P2+ 23, 
P2+ 25, 
P2+ 3,4, 
P2+ 11, 
P2+ 15, 
P2-7,8 


11,12, 
5,6,81, 
80 
24 
26 
5,6 
12 
16 


75,76 
82,83, 


85,86 
84 


Nominal Output 
Ref. 
+5.0V 
+ 12.0V 
-12.0V 
+ 15.0V 
-15.0V 
+5.0V 
+ 12.0V 
-12.0V 
-5.0V 


Tolerance from 
Nominal' 
Ref. 
±5% 
±5% 
±5% 
±3% 
±3% 
±5% 
±5% 
±5% 
±5% 


Ripple 
(Pk-Pk)' 
Ref. 
50 mV 
50 mV 
50 mV 
10mV 
10mV 
50 mV 
50 mV 
50 mV 
50 mV 


Transient 
Response 
500/Ls 
500/Ls 
500/Ls 
100/LS 
100/Ls 
500/Ls 
500/Ls 
500/Ls 
500/Ls 
Time' 


Transient 
Deviation' 
±10% 
±10% 
±10% 
± 10% 
± 10% 
± 10% 
±10% 
±10% 
±10% 


NOTES: 


1. Tolerance 
is worst case, including 
initial 
voltage setting 
line and load effects 
of power source, temperature 
drift, and any additional 
steady 


state influences. 


2. As measured 
over any bandwidth 
not to exceed 0 to 500 kHz. 


3. As measured 
from the start of a load change to the time an output 
recovers within 
± 0.1 % of final voltage. 


4. Measured as the peak deviation 
from the initial 
voltage. 


APPENDIX 
E 


MECHANICAL 
SPECIFICATIONS 


MUL llBUS 
CONNECTOR: 
8I-PIN, 0.156 SPACING 
coe VFB01E43DOOAl 
VIKING 
2VH43/1ANES 


AUXILIARY 
CONNECTOR: 
GO·PIN.0.100 SPACING 


COC VPB01BJOOOOAl 
11 HJ1tl30 
AMP PE5·14559 


EJECTOR TYPE: 
SCANBE -5203 


BUS DRIVERS AND RECEIVERS SHOULD 8E lOCATED 
AS CLOSE AS POSSIBLE TO 


THEIR RESPECTIVE MUL TIBUS PIN CONNeCTIONS 
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inter 


iAPX 86/10 
16-BIT HMOS MICROPROCESSOR 


8086/8086-2/8086-1 


• 14 Word, by 16-Bit Register Set with 
Symmetrical Operations 


• Bit, Byte, Word, and Block Operations 


• 8 and 16-Bit Signed and Unsigned 
Arithmetic in Binary or Decimal 
Including Multiply and Divide 


• Range of Clock Rates: 
5 MHz for 8086, 
8 MHz for 8086-2, 
10 MHz for 8086-1 


• MULTIBUS™ System Compatible 
Interface 


• Direct Addressing Capability to 1 
MByte of Memory 


• Architecture Designed for Powerful 
Assembly Language and Efficient High 
Level Languages. 


The 
Intel 
iAPX 86/10 
high 
performance 
16-bit 
CPU is available 
in three 
clock 
riltes: 
5, 8 and 
10 MHz. The CPU is 


implemented 
in N-Channel, 
depletion 
load, silicon 
gate technology 
(HMOS), and packaged 
in a 40-pin 
CerDIP package. 


The iAPX 86/10 operates 
in both single 
processor 
and multiple 
processor 
configurations 
to achieve 
high 
performance 
levels. 


DATA. 


POINTER. 
AND 


INDEX 
REGS 


III WOROS) 


8US 
INTERFACE 
UNIT 
I 
RElOCATION 
l 


REGISTER 
fiLE 


SEGMENT 
REGISTERS 
AND 


INSTRUCTION 


POINTER 


tSWQROSI 


GND 
vcc 


AD14 
AD15 


A013 
A16/S3 


AD12 
A 17/54 


AD11 
A18/55 


A010 
A19/S6 


AD. 
BHE/S7 


AD8 
MN/MX 


AD7 
Rii 


AD6 
RQ/GTO 
(HOLD) 


ADS 
ROfGT1 
(HLOA) 


AD. 
LOCK 
(WA) 


AD3 
52 
(MliO) 


AD' 
51 
(OT/R) 


AD' 
so 
(DEN) 


ADD 
asD 
(ALE) 


NMI 
as, 
(INTA) 


INTR 
TeST 


ClK 
READY 


GND 
RESET 


40 LEAD 


Table 1. Pin Description 


The following 
pin function 
descriptions 
are for iAPX 86 systems in either minimum 
or maximum 
mode. The "Local 


Bus" in these descriptions 
is the direct multiplexed 
bus interface connection 
to the 8086 (without regard to additional 


bus buffers). 


Symbol 
Pin No. 
Type 
Name and Function 


AD15-ADo 
2-16,39 
I/O 
Address Data Bus: These lines constitute 
the time multiplexed 
memoryllO address (T1) 


and data (T2,T3,Tw, T4l bus. Ao is analogous to EiRE for the lower byte of the data bus, 
pins DrDo. It is LOW during T1 when a byte is to be transferred 
on the lower portion of 


the bus in memory or 1/0 operations. 
Eight-bit oriented 
devices tied to the lower half 


would normally use Ao to condition 
chip select functions. 
(See ~.) 
These lines are 


active HIGH and float to 3-state OFF during interrupt acknowledge 
and local bus "hold 


acknowledge." 


A1g1Se, 
35-38 
0 
Address/Status: 
During T1 these are the four most sign i- 
A1s1S5, 
ficant address lines for memory operations. 
During I/O 
A17/S4, 
operations 
these lines are LOW. During memory and I/O 
Al11S4 
A1s'S3 ChafllCter1It1CI 
A1eiS3 
operations, 
status 
information 
is available 
on these 
lines during T2,T3,Tw, and T4. The status of the interrupt 
o (LOW) 
0 
Alternate 
Data 


enable FLAG bit (Ss) is updated at the beginning of each 
0 
, 
Stack 


'(HIGH) 
0 
Code 
or None 


CLK cycle. A17/S4 and A1eiS3are encoded as shown. 
, 
1 
Data 


This information 
indicates 
which 
relocation 
register 
is 
56 is 0 


(LOW) 


presently being used for data accessing. 


These lines float to 3·state OFF during local bus "hold 
acknOWledge." 


BREis7 
34 
0 
Bus High Enable/Status: During T1 the bus high enable 
signal 
(~) 
should 
be used to enable data onto the 
most significant 
half of the data bus, pins D15-D8.Eight- 
~ 
Ao 
Chllfllc1ertltlcI 


bit oriented 
devices 
tied to the upper half of the bus 
0 
0 
Whole 
word 


would normally 
use BHE to condition 
chip select func- 
0 
1 
Upper 
byte 
fromf 


tions. 
BHE is LOW during T1 for read, write, and inter- 
to odd 
address 


rupt acknowledge 
cycles when a byte is to be transfer- 
, 
0 
Lower 
byte 
froml 


red on the high portion of the bus. The S7status informa- 


to even 
address 


tion is available during T2,T3,and T4. The signal is active 
, 
, 
None 


LOW, and floats to 3-state OFF in "hold." 
It is LOW dur- 
ing T1 for the first interrupt 
acknowledge 
cycle. 


RD 
32 
0 
Read: Read strobe indicates that the processor Is performing 
a memory of I/O read cy- 


cle, depending 
on the state of the S2 pin. This signal is used to read devices which 


reside on the 8086 local bus. RD is active LOW during T2, T3 and Tw of any read cycle, 
and is guaranteed to remain HIGH in T2 until the 8086 local bus has floated. 


This signal floats to 3-state OFF in "hold acknowledge." 


READY 
22 
I 
READY: is the acknowledgement 
from the addressed memory or I/O device that it will 


complete the data transfer. The READY signal from memory/IO is synchronized 
by the 


8284A Clock Generator to form READY. This signal is active HIGH. The 8086 READY in- 
put is not synch.ronlzed. Correct 
operation 
is not guaranteed 
if the setup and hold 


times are not met. 


INTR 
18 
I 
Interrupt 
Request: is a level triggered 
input which is sampled during the last clock cy- 


cle of each instruction 
to determine 
if the processor 
should enter into an interrupt 


acknowledge 
operation. A subroutine 
Is vectored to via an interrupt vector lookup table 


located in system memory. It can be internally masked by software resetting the inter- 
rupt enable bit. INTR is internally 
synchronized. 
This signal is active HIGH. 


TEST 
23 
I 
TEST: input is examined by the "Wait" 
instruction. 
If the TEST input is LOW execution 


continues, 
otherwise the processor waits in an "Idle" 
state. This input is synchronized 


internally 
during each clock cycle on the leading edge of CLK. 


inter 


Symbol 
Pin No. 
Type 
Name and Function 


NMI 
17 
I 
Non·maskable interrupt: an edge triggered input which causes a type 2 interrupt. A 
subroutine 
is vectored to via an interrupt 
vector lookup table located in system 
memory. NMI is not maskable internally by software. A transition from a lOW to HIGH 
initiates the interrupt at the end of the current instruction. This input is internally syn- 
chronized. 


RESET 
21 
I 
Reset: causes the processor to immediately terminate its present activity. The signal 
must be active HIGH for at least four clock cycles. It restarts execution, as described in 
the Instruction Set description, when RESET returns lOW. RESET is internally syn- 
chronized. 


ClK 
19 
I 
Clock: provides the basic timing for the processor and bus controller. It is asymmetric 
with a 33% duty cycle to provide optimized internal timing. 


Vcc 
40 
Vcc: + 5V power supply pin. 


GND 
1,20 
Ground 


MN/MX 
33 
I 
Mlnlmum/Maxlmum: 
indicates what mode the processor is to operate in. The two 
modes are discussed in the following sections. 


The fol/owing 
pin function 
descriptions 
are for the 808618288system in maximum mode (i.e., MNIMX = VssJ.Only the 
pin functions 
which are unique to maximum mode are described; aI/ other pin functions 
are as described above. 


52,51, 
So 
26-28 
0 
Status: active during T4, T1, and T2 and is returned to the 
passive state (1,1,1)during T3 or during Tw when READY 
52 
51 So 
Cher.cteristies 


is HIGH. This status is used by the 8288 Bus Controller 
OILOW) 
0 
0 
Interrupt 


to generate all memory and I/O access control signals. 
Acknowledge 


Any change by52, 8" 
or SOduring T4 is used to indicate 
0 
0 
1 
Read 
I/O Port 
, 
0 
1 
0 
Write 
1/0 Port 
the beginning of a bus cycle, and the return to the pas- 
o 
1 
1 
Halt 


sive state in T3 or Tw is used to indicate the end of a bus 
'(HIGH) 
0 
0 
Code 
Access 


cycle. 


1 
0 
1 
Read 
Memory 


1 
1 
0 
Write 
Memory 


These signals float to 3-state OFF in "hold acknowl- 


1 
1 
1 
Passive 


edge." These status lines are encoded as shown. 


RQ/GTo, 
30, 31 
I/O 
Request/Grant: pins are used by other local bus masters to force the processor to 
RO/m', 
release the local bus at the end of the processor's current bus cycle. Each pin is 
bidirectional 
with RQ/GTo 
having higher priority than RQ/GT1. RQ/GT has an internal 
pull-up resistor so may be left unconnected. The request/grant sequence is as follows 
(see Figure 9): 


1. A pulse of 1 ClK wide from another local bus master indicates a local bus request 
("hold") to the 8086 (pulse 1). 


2. During aT4orT) clock cycle, a pulse 1ClK wide from the 8086to the requesting master 
(pulse 2), indicates that the 8086 has allowed the local bus to float and that it will enter 
the "hold acknowledge" state at the next ClK. The CPU's bus interface unit is discon- 
nected logically from the local bus during "hold acknowledge." 


3. A pulse 1 ClK wide from the requesting master indicates to the 8086 (pulse 3) that 
the "hold" request is about to end and that the 8086 can reclaim the local bus at the 
next ClK. 


Each master-master exchange of the local bus is a sequence of 3 pulses. There must 
be one dead ClK cycle after each bus exchange. Pulses are active lOW. 


If the request is made while the CPUis performing a memory cycle, it will releasethe local 
bus during T4 of the cycle when all the following conditions are met: 


1. Request occurs on or before T2. 
2. Current cycle is not the low byte of a word (on an odd address). 
3. Current cycle is not the first acknowledge of an interrupt acknowledge sequence. 
4. A locked instruction is not currently executing. 


inter 


Symbol 
Pin No. 
Type 
Name and Function 


If the local bus is idle when the request is made the two possible events will follow: 


1. Local bus will be released during the next clock. 
2. A memory cycle will start within 3 clocks. Now the four rules for a currently active 


memory cycle apply with condition number 1 already satisfied. 


LOCK 
29 
0 
LOCK: output indicates that other system bus masters are not to gain control of the 
system bus while LOCK is active LOW. The LOCK signal is activated by the "LOCK" 
prefix instruction and remains active until the completion of the next instruction. This 
signal is active LOW, and floats to 3-state OFF in "hold acknowledge." 


as1, aso 
24,25 
0 
Queue Status: The queue status is valid during the CLK 
cycle after which the queue operation is performed. 
as, and aso provide status to allow external tracking of 
the internal 8086 instruction queue. 
, 


The fol/owing 
pin function 
descriptions 
are for the 8086 in minimum 
mode (i.e., MN/MX 
= Vee). Only/he 
pin functions 
which 


are unique 
to minimum 
mode are described; 
aI/ other pin functions 
are as described 
above. 
. 


MilO 
28 
0 
Status line: logically equivalent to S2in the maximum mode. It is used to distinguish a 
memory access from an I/Oaccess. M/JObecomes valid in the T4 preceding a bus cycle 
and remains valid until the final T4 of the cycle (M= HIGH, 10= LOW). M/JO floats to 
3-state OFF in local bus "hold acknowledge." 


WR 
29 
0 
Write: indicates that the processor is performing a write memory or write I/O cycle, 
depending on the state of the M/JOsignal. WR is active for T2,T3 and Tw of any write cy- 
cle. It is active LOW, and floats to 3-state OFF in local bus "hold acknowledge." 


INTA 
24 
0 
INTA is used as a read strobe for interrupt acknowledge cycles. It is active LOW during 
T2,T3 and Tw of each interrupt acknowledge cycle. 


ALE 
25 
0 
Address Latch Enable: provided by the processor to latch the address into the 8282/ 
8283 address latch. It is a HIGH pulse active during T, of any bus cycle. Note that ALE 
is never floated. 


DT/R 
27 
0 
Data Transmit/Receive: needed in minimum system that desires to use an 8286/8287 
data bus transceiver. It is used to control the direction of data flow through the 


, 
transceiver. Logically DT/R is equivalent to S; in the maximum mode, and its timing is 
the same as for M/iO.(T= HIGH, R= LOW.)This signal floats to 3-state OFF in local bus 
"hold acknowledge." 


DEN 
26 
0 
Data Enable: provided as an output enable for the 8286/8287 in a minimum system 
which uses the transceiver. DEN is active LOWduring each memory and I/Oaccess and 
for INTA cycles. For a read or INTA cycle it is active from the middle of T2until the mid- 
dle of T4, while for a write cycle it is active from the beginning of T2until the middle of 
T4. DEN floats to 3-state OFF in local bus "hold acknOWledge." 


HOLD, 
31,30 
I/O 
HOLD: indicates that another master is requesting a local bus "hold." To be acknowl- 


HLDA 
edged, HOLD must be active HIGH. The processor receiving the "hold" 
request will 
issue HLDA(HIGH)as an acknowledgement in the middle of a T4 or T1clock cycle. Simul- 
taneous with the issuance of HLDA the processor will float the local bus and control 
lines. After HOLD is detected as being LOW,the processor will LOWer HLDA, and when 
the processor needs to run another cycle, it will again drive the local bus and control 
lines. 


-- 
The same rules as for RQIGTapply regarding when the local bus will be released. 
. . 


HOLD is not an asynchronous input. External synchronization should be provided if the 
system cannot otherwise guarantee the setup time. 


inter 


The internal functions of the iAPX 86/10 processor are 
partitioned logically into two processing units. The first is 
the Bus Interface Unit (BIU) and the second is the Exe- 
cution 
Unit (EU) as shown in the block 
diagram of 


Figure 1. 


These units can interact directly but for the most part 
perform as separate asynchronous operational process- 
ors. The bus interface unit provides the functions related 
to instruction fetching and queuing, operand fetch and 
store, and address relocation. This unit also provides the 
basic bus control. The overlap of instruction pre-fetching 
provided by this unit serves to increase processor perfor- 
mancethrough improved bus bandwidth utilization. Up to 
6 bytes of the instruction stream can be queued while 
waiting for decoding and execution. 


The instruction 
stream queuing mechanism allows the 


BIU to keep the memory utilized very efficiently. When- 
ever there is space for at least 2 bytes in the queue, the 
BIU will attempt a word fetch memory cycle. This greatly 
reduces "dead time" on the memory bus. The queue 
acts as a First-In-First-Out (FIFO) buffer, from which the 
EUextracts instruction bytes as required. If the queue is 
empty (following a branch instruction, for example), the 
first byte into the queue immediately becomes available 
to the EU. 


The execution 
unit receives pre·fetched instructions 


from the BIU queue and provides un-relocated operand 
addresses to the BIU. Memory operands are passed 
through the BIU for processing by the EU,which passes 
results to the BIU for storage. See the Instruction Set 
description 
for further 
register set and architectural 


descriptions. 


MEMORY 
ORGANIZATION 
The processor provides a 20-bit address to memory which 
locates the byte being referenced. The memory is orga- 
nized as a linear array of up to 1 million bytes, addressed 
as OOOOO(H) 
to FFFFF(H).The memory is logically divided 


into code, data, extra data, and stack segments of up to 
64K bytes each, with each segment falling on 16-byte 
boundaries. (See Figure 3a.) 


All memory 
references 
are made relative 
to base 


addressescontained in high speedsegment registers. The 
segment types were chosen based on the addressing 
needsof programs. The segment register to be selected is 
automatically chosen according to the rules of the follow- 
ing table. All information in one segment type share the 
same logical attributes (e.g. code or data). By structuring 
memory into relocatable areas of similar characteristics 
and by automatically selecting segment registers, pro- 
grams are shorter, faster, and more structured. 


Word (16-bit) operands can be located on even or odd 
address boundaries and are thus not constrained 
to 


even boundaries as is the case in many 16-bit com- 
puters. For address and data operands, the least signifi· 
cant byte of the word is stored in the lower valued 
address location and the most significant 
byte in the 


next higher address location. The BIU automatically per· 
forms the proper number of memory accesses, one if 
the word operand is on an even byte boundary and two if 
it is on an odd byte boundary. Except for the perfor· 
mance penalty. this double access is transparent to the 
software. This performance penalty does not occur for 
instruction fetches, only word operands. 


Physically, the memory is organized as a high bank 
(D15-Dal and a low bank (07-00) 
of 512K 8·bit bytes 


addressed in parallel by the processor's address lines 


A19- A1. Byte data with even addresses is transferred on 
the 07-00 bus lines while odd addressed byte data (Ao 
HIGH) is transferred on the 015-08bus lines.The process- 
or provides two enable signals, SHE and Ao,to selectively 
allow reading from or writing into either an odd byte 
location, even byte location, or both. The instruction 
stream is fetched from memory as words and isaddressed 
internally by the processor to the byte level as necessary. 


Memory 
Segment 
Register 
Segment 
Reference 
Need 
Used 
Selection 
Rule 


Instructions 
CODE (CS) 
Automatic with all instruction prefetch. 


Stack 
STACK (SS) 
All stack pushes and pops. Memory references relative to BP 
base register except data references. 


Local Data 
DATA(OS) 
Data references when: relative to stack, destination of string 
operation, or explicitly overridden. 


External (Global) Data 
EXTRA(ES) 
Destination of string operations: Explicitly selected using a 
segment override. 


inter 
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In referencing word data the BIU requires one or two 
memory cycles depending on whether or not the start- 
ing byte of the word is on an even or odd address, 
respectively. Consequently, in referencing word oper· 
ands performance can be optimized by locating data on 
even address boundaries. This is an especially useful 
technique for using the stack, since odd address refer- 
ences to the stack may adversely affect the context 
switching 
time for interrupt processing or task multi- 


plexing. 


Certain locations in memory are reserved for specific 
CPU operations (see Figure 3b.) locations 
from address 


FFFFOH through FFFFFH are reserved for operations 
including a jump to the initial program loading routine. 
Following RESET,the CPU will always begin execution 
at location FFFFOH where the jump must be. locations 
OOOOOHthrough 
003FFH are reserved for 
interrupt 


operations. Each of the 256 possible interrupt types has 
its service routine pointed to by a 4-byte pointer element 


consisting of a 16-bit segment address and a 16-bit off- 
set address. The pointer elements are assumed to have 
been stored at the respective places in reserved memory 
prior to occurrence of interrupts. 


RESET BOOTSTRAP 
PROGRAM 
JUMP 


INTERRUPT 
POINTER 
FOR TYPE 255 


INTERRUPT 
POINTER 
FOR TYPE 
1 


INTERRUPT 
POINTER 
FOR TYPE 0 


The requirements for supporting minimum and maximum 
iAPX 86/10 
systems are sufficiently different that they 


cannot be done efficiently 
with 40 uniquely defined 


pins. Consequently, the 8086 is equipped with a strap 
pin (MN/MX) which defines the system configuration. 
The definition of a certain subset of the pins changes 
dependent on the condition 
of the strap pin. When 


MN/MX pin is strapped to GND, the 8086 treats pins 24 
through 31 in maximum mode. An 8288 bus controller 
interprets status information coded into 50,5,,82 to gen- 
erate bus timing and control signals compatible with 
the MUl TIBUS™ architecture. When the MN/MX pin is 
strapped to Vcc, the 8086 generates bus control signals 
itself on pins 24 through 31, as shown in parentheses in 
Figure 2. Examples of minimum mode and maximum 
mode systems are shown in Figure 4. 
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The 86/10 has a combined address and data bus com- 
monly referred to as a time multiplexed bus. This tech· 
nique provides the most efficient 
use of pins on the 
processor while permitting the use of a standard 40-lead 
package. This "local bus" can be buffered directly and 
used throughout the system with address latching pro· 
vided on memory and I/O modules. In addition, the bus 
can also be demultiplexed at the processor with a single 
set of address latches if a standard non-multiplexed bus 
is desired for the system. 


Each processor bus cycle consists of at least four ClK 
cycles. These are referred to as T" T2, T3 and T4 (see 
Figure 5). The address is emitted from the processor 
during T1 and data transfer occurs on the bus during T3 
and T4'T2is used primarily for changing the direction of 
the bus during read operations. In the event that a "NOT 
READY" indication 
is given by the addressed device, 
"Wait" states (Tw) are inserted between T3and T4. Each 
inserted "Wait" state is of the same duration as a ClK 
cycle. Periods can occur between 8086 bus cycles. 
These are referred to as "Idle" states (T,)or inactive ClK 
cycles. The processor uses these cycles for internal 
housekeeping. , 


During T, of any bus cycle the ALE (Address latch 
Enable) signal is emitted (by either the processor or the 
8288 bus controller, depending on the MN/MX strap). At 
the trailing edge of this pulse, a valid address and cer- 
tain status information for the cycle may be latched. 


Status bits Sa, 5" 
and S2are used, in maximum mode, 


by the bus controller to identify the type of bus transac- 
tion according to the following table: 


52 
S, 
So 
CHARACTERISTICS 
a (lOW) 
a 
a 
Interrupt Acknowledge 
a 
a 
1 
Read I/O 
a 
1 
a 
WriteI/O 
a 
1 
1 
Halt 
1 (HIGH) 
a 
a 
Instruction 
Fetch 
1 
a 
1 
Read Data from Memory 
1 
1 
a 
Write Data to Memory 
1 
1 
1 
Passive (no bus cycle) 


Status bits S3 through S7 are multiplexed 
with high- 
order address bits and the BHE signal, and are therefore 
valid during T2 through T4. S3 and S4 indicate which 
segment register (see Instruction Set description) was 
used for this bus cycle in forming the address, accord- 
ing to the following table: 


54 
53 
CHARACTERISTICS 
a (lOW) 
a 
Alternate Data (extra segment) 
a 
1 
Stack 
1 (HIGH) 
a 
Code or None 
1 
1 
Data 


S5isa reflection of the PSWinterrupt enable bit. S6=0and 
S7is a spare status bit. 


In the 86/10,I/Ooperations can address up to a maximum 
of 64K I/O byte registers or 32K I/O word registers. The 
I/O address appears in the same format as the memory 
address on bus lines A15-Ao. The address lines A19-A16 
are zero in I/O operations. The variable I/O instructions 
which use register OXasa pointer havefull address capa- 
bility while the direct I/O instructions directly address one 
or two of the 256 I/O byte locations in page 0 of the I/O 
address space. 


I/O ports are addressed in the same manner as memory 
locations. Even addressed bytes are transferred on the 
DrDo bus lines and odd addressed bytes on D'5-Da. 
Care must be taken to assure that each register within 
an 8-bit peripheral located on the lower portion of the 
bus be addressed as even. 


inter 
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Processor initialization or start up is accomplished with 
activation (HIGH) of the RESETpin. The 8086 RESETis 
required to be HIGH for greater than 4 ClK cycles. The 
8086will terminate operations on the high-going edge of 
RESET and will remain dormant as long as RESET is 
HIGH. The low-going transition of RESET triggers an 
internal reset sequencefor approximately 10ClK cycles. 
After this interval the 8086 operates normally beginning 
with the instruction in absolute locatjon FFFFOH(see 
Figure3Bl.Thedetailsof this operation arespecified inthe 
Instruction Set description of the MCS-86 Family User's 
Manual.The RESETinput is internally synchronized tothe 
processor clock. At initialization the HIGH-to-lOW trans- 
ition of RESET must occur no sooner than 50 IJ-S after 
power-up, to allow complete initialization of the 8086. 


NMI may not be asserted prior to the 2nd ClK cycle fol- 
lowing the end of RESET. 


Interrupt operations fall into two classes; software or 
hardware initiated. The software initiated interrupts and 
software aspects of hardware interrupts are specified in 
the Instruction Set description. Hardware interrupts can 
be classified as non-maskable or maskable. 


Interrupts result in a transfer of control to a new pro- 
gram location. A 256-element table containing address 
pointers 
to the interrupt 
service program locations 


resides in absolute 
locations 
0 through 3FFH (see 


Figure 3b), which are reserved for this purpose. Each 
element in the table is 4 bytes in size and corresponds 
to an interrupt "type". An interrupting device supplies 
an 8-bit type number, during the interrupt acknowledge 


sequence, which is used to "vector" 
through the ap- 


propriate element to the new interrupt service program 
location. 


NON·MASKABLE INTERRUPT (NMI) 


The processor provides a single non·maskable interrupt 
pin (NMI)which has higher priority than the maskable in- 
terrupt request pin (INTR).A typical use would be to ac- 
tivate a power failure routine. The NMI is edge-triggered 
on a lOW·to·HIGH transition. The activation of this pin 
causes a type 2 interrupt. (See Instruction Set descrip· 
tion.) 


NMI is required to have a duration in the HIGH state of 
greater than two ClK cycles, but is not required to be 
synchronized to the clock. Any high-going transition of 
NMI is latched on-chip and will be serviced at the end of 
the current instruction or between whole moves of a 
block-type instruction. 
Worst case response to NMI 


would be for multiply, divide, and variable shift instruc· 
tions. There is no specification on the occurrence of the 
low-going edge; it may occur before, during, or after the 
servicing of 
NMI. Another high-going edge triggers 


another response if it occurs after the start of the NMI 
procedure. The signal must be free of logical spikes in 
general and be free of bounces on the low-going edge to 
avoid triggering extraneous responses. 


MASKAI3LE INTERRUPT (INTR) 


The 86/10provides a single interrupt request input (INTR) 
which can be masked internally by software with the 
resetting of the interrupt enable FLAG status bit. The 
interrupt request signal is level triggered. It is internally 
synchronized during each clock cycle on the high-going 
edge of ClK. To be responded to, INTR must be present 
(HIGH)during the clock period preceding the end of the 
current instruction 
or the end of a whole move for a 


block-type instruction. 
During the interrupt 
response 


sequence further interrupts are disabled_The enable bit 
is reset as part of the response to any interrupt (INTR, 
NMI, software interrupt or single-step), although the 
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Figure 6. Interrupt 
Acknowledge 
Sequence 
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nllt"'U~I. 
Until 
me 010 r-LAI.:i::; 
register is restored the 
enable bit will be zero unless specifically 
set by an 
instruction. 


During the response sequence (figure 6) the processor 
executes 
two 
successive 
(back-to-back) 
interrupt 
acknowledge cycles. The 8086 emits the LOCK signal 
from T2 of the first bus cycle until T2 of the second. A 
local bus "hold" 
request will not be honored until the 
end of the second bus cycle. In the second bus cycle a 
byte is fetched from the external interrupt system (e.g., 
8259A PIC) which identifies 
the source (type) of the 
interrupt. This byte is multiplied by four and used as a 
pointer into the interrupt vector lookup table. An INTR 
signal left HIGH will be continually responded to within 
the limitations of the enable bit and sample period. The 
INTERRUPTRETURNinstruction includes a FLAGS pop 
which returns the status of the original interrupt enable 
bit when it restores the FLAGS. 


When a software "HALT" 
instruction 
is executed the 
processor indicates that it is entering the "HALT" state 
in one of two ways depending upon which mode is 
strapped. In minimum mode, the processor issues one 
ALE with no qualifying bus control signals. In Maximum 
Mode, the processor issues appropriate HALT status on 
828180 and the 8288 bus controller issues one ALE. The 
8086 will not leave the "HALT" state when a local bus 
"hold" 
is entered while in "HALT". 
In this case, the 
processor reissues the HALT indicator. An interrupt 
request or RESETwill force the 8086 out of the "HALT" 
state. 


READ/MODIFY/WRITE 
(SEMAPHORE) 
OPERATIONS 
VIA LOCK 


The LOCK status information is provided by the proc- 
essor when directly consecutive bus cycles are required 
during the execution of an instruction. This provides the 
processor with the capability of performing read/modify/ 
write operations on memory (via the Exchange Register 
With 
Memory instruction, 
for example) without 
the 
possibility 
of another system bus master receiving 


intervening 
memory cycles. This is useful in multi- 
processor system configurations 
to accomplish "test 
and set lock" operations. The LOCK signal is activated 
(forced LOW) in the clock cycle following 
the one in 
which 
the 
software 
"LOCK" 
prefix 
instruction 
is 
decoded by the EU. lt is deactivated at the end of the 
last bus cycle of the instruction following the "LOCK" 
prefix instruction. While LOCK is active a request on a 
RQ/GTpin will be recorded and then honored at the end 
of the LOCK. 


EXTERNAL 
SYNCHRONIZATION 
VIA TEST 


As an alternative 
to the interrupts 
and general I/O 
capabilities, 
the 
8086 provides 
a 
single 
software- 
testable input known as the TESTsignal. At any time the 
program may execute a WAIT instruction. If at that time 
the TEST signal is inactive (HIGH), program execution 
becomes suspended while the processor waits for TEST 


repeatedly until that time. This activity does not con- 
sume bus cycles. The processor remains in an idle state 
while waiting. All 8086 drivers go to 3-state OFF if bus 
"Hold"is 
entered. If interrupts are enabled, they may 
occur while the processor is waiting. When this occurs 
the processor fetches the WAIT instruction 
one extra 
time, processes the interrupt, and then re-fetches and 
re-executes the WAIT instruction 
upon returning from 
the interrupt. 


Typical 
system 
configurations 
for 
the 
processor 
operating in minimum mode and in maximum mode are 
shown in Figures 4a and 4b, respectively. In minimum 
mode, the MN/MX pin is strapped to Vcc and the proc- 
essor emits bus control signals in a manner similar to 
the 8085.In maximum mode, the MN/MX pin is strapped 
to Vss and the processor emits coded status informa- 
tion which the 8288 bus controller 
uses to generate 
MULTISUS compatible bus control signals. Figure 5 il- 
lustrates the signal timing relationships. 


AH 
AL 
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CH 
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DL 
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01 
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STACK 
POINTER 


BASE POINTER 


SOURCE 
INDEX 


DESTINATION 
INDEX 


CODE 
SEGMENT 


DATA SEGMENT 


STACK 
SEGMENT 


EXTRA 
SEGMENT 


SYSTEM 
TIMING 
- 
MINIMUM 
SYSTEM 


The read cycle begins in T1 with the assertion of the 
Address Latch Enable (ALE) signal. The trailing (lOW- 
going) edge of this signal is used to latch the address 
information, which is valid on the local bus at this time, 
into the 8282/8283 latch. The SHE and Ao signals 
address the low, high, or both bytes. From T1 to T4 the 
MilO signal indicates a memory or I/O operation. At T2 
the address is removed from the local bus and the bus 
goes to a high impedance state. The read control signal 
is also asserted at T2. The read (RD) signal causes the 
addressed device to enable its data bus drivers to the 
local bus. Some time later valid data will be available on 
the bus and the addressed device will drive the READY 
line HIGH. When the processor returns the read signal 


inter 


to a HIGH level, the addressed device will again 3-state 
its bus drivers. If a transceiver (8286/8287)is required to 
buffer the 8086 local bus, signals DTiR"and DENare pro- 
vided by the 8086. 


A write cycle also begins with the assertion of ALE and 
the emission of the address. The M/iO signal is again 
asserted to indicate a memory or I/O write operation. In 
the T2 immediately following the address emission the 
processor 
emits 
the 
data 
to 
be written 
into 
the 


addressed location. This data remains valid until the 
middle of T4. During T2, T3, and Tw the processor asserts 
the write control signal. The write (WR)signal becomes 
active at the beginning of T2 as opposed to the read 
which is delayed somewhat into T2 to provide time for 
the bus to float. 


The BHE and Ao signals are used to select the proper 
byte(s) of the memoryliO word to be read or written 
according to the following table: 


SHE 
AO 
CHARACTERISTICS 


0 
0 
Whole word 


0 
1 
Upperbyte froml 
to odd address 


1 
0 
Lower byte froml 
to evenaddress 


1 
1 
None 


I/O ports are addressed in the same manner as memory 
location. Even addressed bytes are transferred on the 
DrDo 
bus lines and odd addressed bytes on 0'5-08' 


The basic difference 
between the interrupt acknowl- 


edge cycle 
and a read cycle 
is that 
the interrupt 


acknowledge signal (INTA) is asserted in place of the 


read (Frn) signal and the address bus is floated. (See 
Figure 6.) In the second of two successive INTA cycles, 
a byte of information is read from bus lines DrDo 
as 


supplied by the interrupt system logic (i.e., 8259A Prior· 
ity Interrupt Controller). This byte identifies the source 
(type) of the interrupt. It is multiplied by four and used 
as a pointer into an interrupt vector lookup table, as 
described earlier. 


For medium size systemsthe MN/MX pin is connected to 
Vssand the 8288Bus Controller is added to the systemas 
well asan828218283 latch for latching the systemaddress, 
and a 8286/8287 transceiver to allow for bus loading 
greater than the 8086is capable of handling. Signals ALE, 
DEN, and DTifj are generated by the 8288 instead of the 
processor in this configuration although their timing re- 
mains relatively the same.The 8086status outputs (82, 5" 
and So) provide type-of-cycle information and become 
8288 inputs. This bus cycle information specifies read 
(code, data, or I/O),write (data or I/O), interrupt acknowl- 
edge, or software halt. The 8288 thus issues control 
signals specifying memory read or write, I/O read or write, 
or interrupt acknowledge. The 8288provides two types of 
write strobes, normal and advanced, to be applied as re- 
quired. The normal write strobes have data valid at the 
leading edge of write. The advanced write strobes have 
the sametiming as readstrobes, and hence data isn't valid 
at the leading edge of write. The 8286/8287transceiver 
receives the usual T and OE inputs from the 8288's DTifj 
and DEN. 


The pointer into the interrupt 
vector table, which is 


passed during the second INTA cycle, can derive from 
an 8259A located on either the local bus or the system 
bus. If the master 8259A Priority Interrupt Controller is 
positioned on the local bus, aITL gate is required to 
disable the 8286/8287transceiver when reading from the 
master 
8259A 
during 
the 
interrupt 
acknowledge 
sequence and software "poll". 


inter 


Ambient Temperature 
Under Bias 
O°C to 70°C 
Storage Temperature 
-65°C 
to + 150°C 


Voltage on Any Pin with 
Respect to Ground .......•.......... 
- 1.0 to + 7V 


Power Dissipation 
2.5 Watt 


"NOTICE: Stresses above those listed under "Absolute 
Maximum Ratings" may cause permanent damage to the 
device. This is a stress rating only and functional 
opera- 
tion of the device at these or any other conditions 
above 


those indicated in the operational sections of this specifi- 
cation 
is not implied. 
Exposure 
to absolute 
maximum 


rating conditions 
for extended periods may affect device 


reliability. 


D.C. CHARACTERISTICS 
(8086: 
TA= O°Cto 70°C, Vce = 5V ± 10%) 
(8086-1: TA = O°Cto 70°C, Vee = 5V ± 5%) 
(8086-2: TA= O°Cto 70°C, Vee = 5V ± 5%) 


Symbol 
Parameter 
Min. 
Max. 
Units 
TestConditions 


Vil 
Input Low Voltage 
-0.5 
+0.8 
V 


V1H 
Input High Voltage 
2.0 
Vee+ 0.5 
V 


VOL 
Output Low Voltage 
0.45 
V 
IOl=2.5 mA 


VOH 
Output High Voltage 
2.4 
V 
IOH= -400 
",A 


Ice 
Power Supply Current: 8086 
340 
8086-1 
360 
mA 
TA= 25°C 
8086-2 
350 


III 
Input Leakage Current 
± 10 
",A 
OV,,;:V1N,,;:Vec 


ILO 
Output Leakage Current 
±10 
",A 
0.45V", VOUT'" Vee 


Vel 
'Clock Input Low Voltage 
-0.5 
+0.6 
V 


VeH 
Clock Input High Voltage 
3.9 
Vee+1.0 
V 


Capacitance 
of Input Buffer 


C1N 
(All input except 
15 
pF 
fc= 1 MHz 
ADo - AD15, RQ/GT) 


C10 
Capacitance 
of I/O Buffer 
15 
pF 
fc= 1 MHz 
(ADo- AD15, RQ/GT) 


(8086-1: 
TA = ooe to 70oe, Vcc = 5V ± 5%) 


(8086-2: 
TA = ooe to 70oe. Vcc = 5V ± 5%) 


MINIMUM 
COMPLEXITY 
SYSTEM 
TIMING 
REQUIREMENTS 


Te.t 
Symbol 
Parameter 
8086 
8086-1 (Preliminary) 
8086-2 
Unit. 
Condition. 


Min. 
Max. 
Min. 
Max. 
Min. 
Max. 


TCLCL 
CLK Cycle Period 
200 
500 
100 
500 
125 
500 
ns 


TCLCH 
CLKLowTime 
(% TCLCL)-15 
(%TCLCL)-14 
(% TCLCL)-15 
ns 


TCHCL 
CLK High Time 
(V,TCLCL)+2 
(V, TCLCL)+6 
(V,TCLCL)+2 
ns 


TCH1CH2 
CLK Rise Time 
10 
10 
10 
ns 
From 1.0Vto 


3.5V 


TCL2CLI 
CLK Fall Time 
10 
10 
10 
ns 
From 3.5V to 


1.0V 


TDVCL 
Data in Setup Time 
30 
5 
20 
ns 


TCLDX 
Data in Hold Time 
10 
10 
10 
ns 


TR1VCL 
ROY Setup Time 
35 
35 
35 
ns 


into 8284A (See 


Notes I, 2) 


TCLRIX 
ROY Hold Time 
0 
0 
0 
ns 


into 8284A (See 


Notes 1. 2) 


TRYHCH 
READY Setup 
(% TCLCL)-15 
53 
(% TCLCL)-15 
ns 
Time :nto 8086 


TCHRYX 
READY Hold Time 
30 
20 
20 
ns 
into 8086 


TRYLCL 
READY Inactive 
to 
-8 
-10 
-8 
ns 
CLK (See Note 3) 


THVCH 
HOLD Setup Time 
35 
20 
20 
ns 


TINVCH 
INTR, NMI. TEST 
30 
15 
15 
ns 
Setup Time (See 


Note 2) 


TILtH 
Input Rise Time 
20 
20 
20 
ns 
From 0.8V to 
(Except 
CLK) 
2.0V 


TIHIL 
Input Fall Time 
12 
12 
12 
ns 
From 2.0Vto 


(Except 
CLK) 
0.8V 


inter 


A.C. CHARACTERISTICS 
(Continued) 


TIMING 
RESPONSES 


Ta.t 
Symbol 
Paramatar 
8088 
8086·1 (Prallmlnary) 
8088-2 
Unite 
Condition. 


Min. 
Max. 
Min. 
Max. 
Min. 
Max. 


TCLAV 
Address 
Valid Delay 
10 
110 
10 
50 
10 
60 
ns 


TClAX 
Addrass 
Hold Time 
10 
10 
10 
ns 


TCLAZ 
Address 
Float 
TClAX 
60 
10 
40 
TClAX 
50 
ns 


Delay 


TLHLL 
ALE Width 
TCLCH-20 
TCLCH-10 
TCLCH-10 
ns 
I 


TCLLH 
ALE Active 
Delay 
60 
40 
50 
ns 


TCHLL 
ALE Inactive 
Delay 
85 
45 
55 
ns 


TLlAX 
Address 
Hold Time 
TCHCL-10 
TCHCL-10 
TCHCL-10 
ns 


to ALE Inactive 


TCLDV 
Data Valid Delay 
10 
110 
10 
50 
10 
60 
ns 
·Cl = 20-100 pF 


TCHDX 
Data Hold Time 
10 
10 
10 
ns 
for all 8086 Out- 


puts (In addi· 
TWHDX 
Data Hold Time 
TCLCH-30 
TCLCH-25 
TCLCH-30 
ns 
tion to 8086 self- 
AfterWR 
load) 


TCVCTV 
Control Active 
10 
110 
10 
50 
10 
70 
ns 


Delay 1 


TCHCTV 
Control Active 
10 
110 
10 
45 
10 
60 
ns 


. Delay 2 


TCVCTX 
Control Inactive 
10 
110 
10 
50 
10 
70 
ns 


Delay 


TAZRL 
Address 
Float to 
0 
0 
0 
ns 


READ Active 


TCLRL 
iffi Active 
Delay 
10 
165 
10 
70 
10 
100 
ns 


TCLRH 
RD Inactive 
Delay 
10 
150 
10 
60 
10 
60 
ns 


TRHAV 
AD Inactive to Next 
TCLCL-45 
TCLCL-35 
TCLCL-40 
ns 


Address 
Active 


TCLHAV 
HLDA Valid Delay 
10 
160 
10 
60 
10 
100 
ns 


TRLRH 
RDWid1h 
2TCLCL-75 
2TCLCL-4O 
2TCLCL-50 
ns 


TWLWH 
WRWidth 
2TCLCL-60 
2TCLCL-35 
2TCLCL-4O 
ns 


TAVAL 
Address 
Valid to 
TCLCH-60 
TCLCH-35 
TCLCH-4O 
ns 


ALE Low 


TOLOH 
Output 
Rise Time 
20 
20 
20 
ns 
From 0.8Vto 


2.0V 


TOHOL 
Output 
Fall Time 
12 
12 
12 
ns 
From 2.0Vto 


0.8V 


NOTES: 


1. 
Signal 
at 8284A 
shown 
for 
reference 
only. 


2. 
Setup 
requirement 
for 
asynchronous 
signal 
only 
to guarantee 
recognition 
at next 
eLK. 


3. 
Applies 
only 
to T2 state. 
(8 ns into 
T3). 


inter 


READ 
CYCLE 


(NOTE 
1) 


iWR. 
INTA = VON) 


inter 


INTA 
CYCLE 
OTfR 


(NOTES 
1 & 3) 


RD. WR=VOH 
!Rl!:=Vod 


WRITE 
CYCLE 


<NOTE 1) 


(RD,INTA, 
DTIR=YOH) 


SOFlWARE 
HALT- 


RO, WR, INTA = YOH 


OT/R = INDETERMINATE 


NOTES: 
1. All signals switch between VOHand VOL unless otherwise 
specified. 
2. RDY is sampled 
near the end of T2' T3' Tw to determine 
if Tw machines 
states are to be inserted. 
3. Two INTA cycles run back-to-back. 
The 8086 LOCAL ADDR/DATA BUS is floating 
during 
both IN1A cycles. Control 
signals shown 
for second INTA cycle. 
4. Signals at 8284A are shown for reference only. 
5. All timing 
measurements 
are made at 1.5V unless otherwise 
noted. 


MAX MODE SYSTEM 
(USING 
8288 BUS CONTROLLER) 
TIMING 
REQUIREMENTS 


Te.t 
Symbol 
P.remeter 
8088 
8088·1 (Prellmlnery) 
8088·2 (Preliminary) 
Unit. 
Condition. 


Min. 
Ma". 
Min. 
Max. 
Min. 
Ma". 


TCLCL 
CLK Cycle Period 
200 
500 
100 
500 
125 
500 
ns 


TCLCH 
CLK Low Time 
(%TCLCL)-15 
(% TCLCL)-14 
(% TCLCL)-15 
ns 


TCHCL 
CLK High Time 
(V, TCLCL) +2 
(V, TCLCL) +6 
(V, TCLCL) + 2 
ns 


TCH1CH2 
CLK Rise Time 
10 
10 
10 
ns 
From 1.0Vto 


3.5V 


TCL2CL1 
CLK Fall Time 
10 
10 
10 
ns 
From 3.5Vto 


1.0V 


TDVCL 
Data in Setup Time 
30 
5 
20 
ns 


TCLDX 
Data In Hold Time 
10 
10 
10 
ns 


TR1VCL 
ROY Setup Time 
35 
35 
35 
ns 


into 6264A (See 


Notes 1, 2) 


TCLR1X 
ROY Hold Time 
0 
0 
0 
ns 


into 6264A (See 


Notes 1, 2) 


TRYHCH 
READY Setup Time 
(% TCLCL)-15 
53 
(% TCLCL)-15 
ns 


into 8086 


TCHRYX 
READY Hold Time 
30 
20 
20 
ns 


into 8086 


TRYLCL 
READY Inactive 
to 
-6 
-10 
-6 
ns 


CLK (See Note 4) 


T1NVCH 
Setup Time for 
30 
15 
15 
ns 


Recognition 
(INTR, 
- 


NMI, 'fEST) (See 


Note 2) 


TGVCH 
RQ/GT Setup Time 
30 
12 
15 
ns 


TCHGX 
Rei Hold Time into 
40 
20 
30 
ns 


6066 


TILIH 
Input Rise Time 
20 
20 
20 
ns 
From O.BV to 


(Except 
CLK) 
2.0V 


TIHIL 
Input Fall Time 
12 
12 
12 
ns 
From 2.0Vto 


(Except 
CLK) 
O.BV 


NOTES: 


1. 
Signal 
at 8284A 
or 8288 
shown 
for 
reference 
only. 


2. 
Selup 
requirement 
for 
asynchronous 
signal 
only 
to guarantee 
recognition 
et next 
eLK. 


3. 
Applies 
only 
to T3 and 
wait 
states. 


4. 
Applies 
only 
to T2 state 
(8 ns into 
T3). 


inter 


Teet 
Symbol 
Peramatar 
8086 
8086-1 (Preliminary) 
8086-2 (Preliminary) 
Unite 
Conditione 


Min. 
Max. 
Min. 
Max. 
Min. 
Max. 


TCLML 
Command 
Active 
10 
35 
10 
35 
10 
35 
ns 


Delay (See Note 1) 


TCLMH 
Command 
Inactive 
10 
35 
10 
35 
10 
35 
ns 


Delay (See Note 1) 


TRYHSH 
READY Active 
to 
110 
45 
65 
ns 


Status 
Passive (See 


Note 3) 


TCHSV 
Status Active 
Delay 
10 
110 
10 
45 
10 
60 
ns 


TCLSH 
Status 
Inactive 
10 
130 
10 
55 
10 
70 
ns 


Delay 


TCLAV 
Address 
Valid 
10 
110 
10 
50 
10 
60 
ns 


Delay 


TCLAX 
Address 
Hold Time 
10 
10 
10 
ns 


TCLAZ 
Address 
Float Delay 
TCLAX 
80 
10 
40 
TCLAX 
50 
ns 


TSVLH 
Status Valid to ALE 
15 
15 
15 
ns 


High (See Note 1) 


TSVMCH 
Status Valid to 
15 
15 
15 
ns 


MCE High (See 


Note 1) 


TCLLH 
CLK Low to ALE 
15 
15 
15 
ns 


Valid (See Note 1) 


TCLMCH 
CLK Low to MCE 
15 
15 
15 
ns 


High (See Note 1) 


TCHLL 
ALE Inactive 
Delay 
15 
15 
15 
ns 
Cl ~ 20-100 pF 
(See Note 1) 
for all 8086 Out- 


TCLMCL 
MCE Inactive 
Delay 
15 
15 
15 
ns 
puts (In addi· 


(See Note 1) 
tion to 8086 self- 


load) 


TCLDV 
Data Valid 
Delay 
10 
110 
10 
50 
10 
60 
ns 


TCHDX 
Data Hold Time 
10 
10 
10 
ns 


TCVNV 
Control 
Active 
5 
45 
5 
45 
5 
45 
ns 
Delay (See Note 1) 


TCVNX 
Control 
Inactive 
10 
45 
10 
45 
10 
45 
ns 


Delay (See Note 1) 


TAZRL 
Address 
Float to 
0 
0 
0 
ns 


Read Active 


TCLRL 
RD Active 
Delay 
10 
165 
10 
70 
10 
100 
ns 


TCLRH 
RD Inactive 
Delay 
10 
150 
10 
60 
10 
80 
ns 


TRHAV 
RD Inactive 
to 
TCLCL-45 
TCLCL-35 
TCLCL-40 
ns 
Next Address 
Active 


TCHDTL 
Direction Control 
50 
50 
50 
ns 


Active 
Delay (See 


Note1) 


TCHDTH 
Direction Control 
30 
30 
30 
ns 
Inactive 
Delay (See 


Note 1) 


TCLGL 
GT Active 
Delay 
0 
85 
0 
45 
0 
50 
ns 


TCLGH 
GT Inactive 
Delay 
0 
85 
0 
45 
0 
50 
ns 


TRLRH 
RDWidth 
2TCLCL-75 
2TCLCL-40 
2TCLCL-50 
ns 


TOLOH 
Output 
Rise Time 
20 
20 
20 
ns 
From 0.8V to 


2.0V 


TOHOL 
Output 
Fall Time 
12 
12 
12 
ns 
From 2.0Vto 


0.8V 


inter 


1 


ALE (8288 OUTPUT) 


SEE NOTE 
5 


ROY (8284A 
INPUn 


inter 


T, 
T, 
T, 
T. 


Tw 


\\~---- 


TCHDX- 


DATA 


TCVNX- 


TCLMH 


-TClMH 


FlOAT 


TC~DX 


~ 
/r--------.\.\.-- 
_ 


NOTES: 
1. All signals switch between VOH and VOL unless otherwise 
specified. 


2. RDY is sampled near the end of T2, T3' Tw to determine 
if Tw machines states are to be inserted. 


3. Cascade address is valid between first and second INTA cycle. 
4. Two INTA cycles run back-to-back. 
The 8086 LOCAL ADDRJDATA BUS is floating 
during 
both INTA cycles. Control 
for pointer 
address is shown for second INTA cycle. 
5. Signals at 8284A or 8288 are shown for reference only. 
6. The issuance of the 8288 command 
and control 
signals (MRDC, MWTC, AMWC, IORC, IOWC, AIOWC, INTA and DEN) lags the 
active high 8288 CEN. 
7. All timing 
measurements 
are made at 1.5V unless otherwise 
noted. 


8. Status inactive in state just prior to T4. 


inter 


Cl'\ 
r\ 
I :3rL",.:::::: 
I "9"" 
: 


NOTE: 
1. SETUP REQUIREMENTS 
FOR ASYNCHRO- 


NOUS SIGNALS ONLY TOGUAAANTEE 
RECOGNITION 


AT NEXT eLK 


"'Ou-ADo 
A,.S.-A.t/SJ 
S;.S1,SO 
"".= 
~S7 


Cl'~7LJCR2C<cces 


I 
'" 
!--THVCH 
I 
I 


-i 
,~ 


It' 
-* 
TCCAl 
AO's· .••OO. 
A,tlSt-A,&!S3. 


~S1MIii5 
DTIA".~,i5EN 


inter 


ORTl TRANSFER 


MOV· "VI: 
7 8543tl 
0 
715 
4 3Z • II 
1& 5 4 3 Z I II 
l' 
5 43 
t1 
(I 
DEC 
lIter,mlnl: 
11543! 
I II 
115 
4 3! 
I II 
715432 
I II 
115 
4 3 2 I II 


Re\l,sltrlmemorv 
lolhom 
register 
1 II II II 1 lid 
w 
• od .., ". 
Register/memory 
, 
1 1 1 1 \ 
1 w 
modO 
01 


Immediate 
roreglster/merno/y 
" 
II II 0 11 w 
madOOO 
<I. 
dala 
Aeo,ste, 
10tool 
reg 
I 


Immel:h'lt 
rareg,ster 
101' 
• .., 
data" 
.•• , 
IlliCllangeSIQR 
11111 
II 1 1 w 
mCHlO 11 


Memory 
10 accumulator 
1010000 
••• 
adelr·low 
adelr·hlgh 


CI' 
COIlPIr,: 


Accumu'1I10f 
lomemOly 
1 II 1 II II II 1 w 
add,-IOw 
add,·hlljlh 


Reglsln/memory10 
sellmenl'fg,SIfr 
10001110 
modO'f9 
Rtlllste"lMmoryanclre14l,sler 
011111 
II d w 
• od .., <I• 


Segment.,g,sl" 
roreglster/memory 
10001100 
modDrf1jJ 
<I. 
Immed"te 
W,III reg,ste,/memo,y 
10 
II II II 0 5 w 
moll I 1 1 ". 


dalltls.Ot 


ImmedIate 
wllhaccumul,lOf 
II 11"110 
w 
PUSN·PaIl: 
.u.SASCllaclIIlS\IOlsublr.l(1 
0111 
II 
1 1 I 


"'vlsle,/lIIfmory 
11111111 
IIlOlS 11 
0 
<I. 
au 
Oec,m,IJdrusl 
'0' 
sublr,c! 
00101111 


RegISle, 
01010 
.., 
.Ul 
MulllplYlunslllnedl 
1 I 1 1011 
W 
modI 
0 0 
<I. 


Se;menlt'O'Sler 
000119110 
1.lH.. 
Inleg,. 
mull'ply 
lSlgnedl 
1 I 1 101 
I W 
mod 
I 0 
I 
<I. 


lA. 
ASCltJd.usllor 
muttlply 
" 
0 
I 0 100 
00001010 
PCI,&l'tt: 


OIVOlwlCl'lunslljln'dl 
111 
1011 
W 
moO I 1 0 ". 
ReglSle,/memory 
100011 
1 1 
modO 
0 0 ". 
IOtV'nlege,chvlCletsogn'dl 
111 
101 
I W 
Rtolstel 
01011 
.., 


laD 
ASCtr'dlusttord'v'de 
11010101 
00001010 


Segmenl 
reglSler 
000reOl11 
CIW 
Converl 
byte 
10 worll 
10011000 


ICIiG-hclllnljll: 
CWO Conv,rl 
worO 10 double 
worO 
100 
" 
00 
I 
l 


ReglSler/memory 
wlIllreglsler 
1000011 
w 
.od ..• 
<I• 


Reg,Sler 
wilhilccumulillor 
10010 
..• 


IN·tnllul'rom: 


fl.,dporl 
: " 
10010 
•• 
: 
1I0r! 


\/Jtlilblepo'l 
I I 101 
lOw 


lOGIC 


OUT" 
OUIPUI 10; 
1I0Tlnve'l 
1111011 
•• 
1IIOlI010 
<I. 


f'.edPO'l 
11 
100 
I 1 W 
"'" 
Sill/ill 
SIII',loglCiltlJlltllmellCtell 
110100v 
•• 
1OllId100 
o. 
VJflJblep.o<l 
UI 
SIIotlt09,ut'I\l1l1 
110100y 
•• 
mod 
10 
1 
<I. 
Xl..n-Tr,nsl,lebyl,loAl 
1 10 
I 0 1 " 
SAI 
S/lltl JfllllmellC 
uglll 
I 
10100 
v w 
mod 
I I 
I 
<I. 
W-lo,dEAlorlglSltr 
10001101 
mod .., 
100Ao"letell 
IIOIOOv", 
modOOO 
<I. 


lDl·loJdl/OlnlerloOS 
11000101 
mod .., 11m I 
IlDlllolJl,"ghl 
110 
100 
y W 
mall 0 0 
1 ". 
lfl·loJdpolnl~loE5o 
11000100 
.., 
IClRolJtell'l'ollljll'lurryltJglt1t 
1 10100 
v w 
modO 
1 0 
<I. 


LAlIf-loJdAHw"ltllJOI 
10011111 
IClAotJlelhrouohUllyr'!lhl 
110100y 
•• 
modO 
1 I 
~-SloleAHlnloltJOS 
10011110 


'UItIf-PushIlJOS 
10011100 
••• 
And: 


PWf-POJIIl,gs 
100 
II 
101 
Reg Imemory 
,ndfeglSler 
loellher 
a 0 
1000 
II W 
.od .., 


lmmelllille10 
'egISle"noemory 
100 
a 0 0 0 w 
modI 
0 a 
<I. 
O,I,!lw 
1 


tmmeO,JlelOilcwmutill0' 
00 
10010 
W 


TEST 
And lUflCUln 
10 Illgl. 
no rllull: 
ARITHMETIC 
fltg,SlellmemoryilndreglSler 
10000 
lOw 
.od .., <I• 
ADa· 
•••••: 
tmmelllilltdillililndretliSttrlmemo'y 
" 
1 1011 
w 
modO 
0 0 
<I. 
d,l'l'w 
1 


Reg Im.mllfYWIII'I,.glll.rIOfllht< 
0000000 
w Imod .., <I• 
tmmell",ed.lJ,nOacc 
•••m •••tillor 
1 0 
I 0 1 0 0 w 
dillil,t 
wI 


tmm.dlille10 
ffglst.,/lIIfmory 
100000 
S W 
moOO 
0 0 
<I. 
d,l, 
dal"llw 
01 


tmmedlJle10 
JCcumutJlO' 
0000010 
W 
dJlill'w 
1 
'" ." 
fIt\lfmemory 
,nd'egISlt< 
loellller 
0000 
I 0 d W 
mod .., <I. 
ADe - 1M wltll wry: 
tmmed"le10 
reljl'$lerll'\tfllO'y 
1000000 
•• 
I'l'IOdOOl 
<I. 


lIIo/m8f'l\Of)'WltI'Iflg,ll.rlO,llht, 
000100 
d w 
mod '.. 
<I. 
Immed"te1o 
JCcumulillor 
0000110", 
dill,tlw 
I 


Immad'ille 
10 'tljIlSltl/memory 
10 
a 0 0 0 s w ImodO 
I 0 
<I. 
lIi1lJ"sw 
01 


ImmellllletoiICcum 
•••••'or 
00010 
lOw 
I 
dillil!lw 
1 
,.. 
Eulalln 
••. : 


ReQlmemory,nllle!JlS1erlaelllle. 
001 
10 
0 d w 
'.. 
<I. 


lit 
-1IcAIlIM: 
lmmtlllille 
10 .eg,ste"mtfOlOry 
1 0 0 0 0 0 O. 
moll 1 1 0 ". 
dill'''. 
1 


AtljItslt'/memo.y 
11 
I I 1 \I 
w ImodO 
a 0 ". 
Immed'ille10 
JCc •••mulJl0' 
0011010 
W 


Reo'II.t 
01000 
'.. 


W*ASClIJdjusllor 
alld 
001 
101111 


WoDlc,mal 
JdJust 
lor Jlld 
00100111 


IUI·IuIMtKt: 


Reg/m.mOIY 
.nd't\liSlttlo,lthtr 
001010 
d w 
.od 
'.. ". 
tmmtdl"e 
I.om 
'''Il1sltf/m.mory 
1 0 0 0 0 0 s w 
modI 
0 1 
<I. 
d,l.rllw-Ol 
STRING ••••• IPUUTlOIil 


tmm.di.t.lromiICCum 
••••iIlor 
001011 
0 w 
1Ii11,.lw-I 
RE'-Rep 
•• , 
1 1 " 
0 OIl 


MOVS·Mow 
byle/wo.d 
1010010 
W 


UI 
- •••••• 
wltll 
.,... 
CM'S·Comp,rtbYIf/worll 
101001 
I w 


AIO/fllllIIOIyand 
rIgISltttO'lttl" 
000110 
d. 
.od '.. 
<I. 
SCAS=Sunbyltfwo.d 
1010 
I I 
1 w 


lm/Mdi." 
lTom ,,,,isl./mamory 
1 0 0 0 0 0 S .!modO 
1 I 
<I. 
dillJ 
dJI"'Sw-OI 
lOOS·loJllbyl.f..cj,oALlA)f 
1010110 
W 


Immel!I."fromiICcumulalor 
10001110 
wi 
". 
datJ,lw.1 
STOS-5olo, 
bylffwd 
I.om 
ALIA 
1010 
I 0 I w 


Mnemonics 
©Intel, 
1978 


16543210 
16543210 


11101000 
dlsp·low 


11111111 
mod 010 
rIm 


110011010 
I 
ollseHow 
I 
seg·low 


111111111 
mod 011 
rIm 


Dlrecl w,lhlnsegmeni 


Indirect 
wIthin 
segment 


D,rect 
Intersegment 


JMP 
~ Uncondi!ion.1 
Jump: 


Drrect within 
segmenT 


D,rect WIThin segment·shorl 


Indl,ect 
within 
segmenT 


Direct Inlersegmenl 


16543210 
16543210 
16543110 


dlsp-hlgh 
JIilB/J,ucJump 
on not belowfabove 
101110011 
1 
d,sp 
1 
or eQual 


JNIE/JA 
Jump 
on nol below 
or 
[01 
1 tOll 
1 I 
dlsp 
I 
eQual/above 
ollsel-hlgh 
JNP/JPO-Jump 
on nol pal/par 
odd 
101111011 
I 
d,sp 
I 


seg·hlgh 
JNO Jump 
on not overllow 
101110001 
I 
d.sp 
1 


JNS 
Jump 
on noT Sign 
1011110011 
dlsp 
J 


LOOP LoopCX 
limes 
11 
1 1000 
I 0 1 
dlsp 
I 


1 


LOOPZlLOOPE loop 
while 
ltIo/eQual 
111100001 
I 
dlsp 
I 
dlsp·hlgh 
lOOPNZllOOPNELoop 
.•••.hllenOI 
111100000 
I 
dlSP 
I 
zero/eQual 


JCIl 
Jump 
on ex zero 
111100011 
I 
dlsp 
I 


O!lset.h'llhw:=J 
INT 
Inlerrupt 


seg''''lgh~ 
Typespecilled 
111001101 
I ~ 
TypeJ 
111001100 
I 


INTO InterfUpl0noverllow 
111001110 
1 


IIIU 
Interrupt 
reI urn 
111001111 
1 


REf 
- Return 
Irom CAll: 


Wllhln segmenl 
",Il~'~o~o_o~o~, 
_' 
,-I 
~ 
__ 
-~ 


W.thln 
seg 
addmg 
Immed to SP 
11 
1 0 0 0 0 1 0 I 
data·low 
data.""g£] 


Intelsegmenl 
l=ll~'~O~O~'~o~, 
_' 
f-I 
~ 
~ 


Intersegment. 
adding 
Immediate 
10 Spill 
001010 
I 
data·low 
data·hlgh 
I 


JE/Jl 
Jump 
on eQualizero 
101 
I 10100 
I 
dlsp 


JL/JNGE o~Ue~~a~n less/not 
!lre.lel 
10 
1 1 1 1 1 00 I 
dlsp 


JLE/JNG o~~~~ron 
less or eQual/not 
10 
1 1 1 1 1 1 0 I 
dlSp 


JI/JNAE o~ue~~a~n below/not 
above 
10 
1 I 1 00 
1 0 I 
dlsp 


JIE/JNA 
n~i~go~~ 
below or eQual I 
0 1 1 1 0 1 1 0 
dlsp 


JP/JPE 
Jump 
on paflly/parllyeven 
01111 
C 1 0 
dlsp 


Jo 
Jump 
on over/low 
101110000 
I 


JS=Jumpons'!ln 
101111000 
I 


JNE/JNZ-Jump 
on not eQual/nOlzero 
01 
1 1 0 1 0 1 


JNL/JGE o~ue~~a~n not less/greater 
0 1 1 1 1 1 0 1 


JNLE/JG 9~~~~ron 
not less 01 eQual! 
101 
1 1 1 1 1 1 I 


11 
~ 1 0 1 0 0 1 I 
dlsp-Iow 


1110101t 
dlsp 


11111111 
mod 
1 00 
rIm 


1111010101 
ollsel·tow 
I 
seg·low 


111111111 
(mOd 
101 
rim 


Al 
= 6-bit 
accumulator 


AX = 16-bit 
accumulator 


CX = Count 
register 


OS 
= Data 
segment 


ES 
= Extra 
segment 


Above/below 
refers 
to 
unsigned 
value. 


Greater 
= more 
positive; 


less 
= Jess 
positive 
(more 
negative) 
signed 
values 


if d = 1 then "to" 
reg; jf d = 0 then "from" 
reg 


if w = l1hen 
word instruction; 
jf w = 0 then byle inSlruClion 


if mod 
= 11 then 
rIm 
is 
treated 
as 
a REG 
field 


if mod 
= 00 then 
OISP 
= 0·. 
disp·low 
and 
disp·hlgh 
are 
absent 


if mod 
= 01 
then 
OISP 
= disp-Iow 
sign-extended 
to 
16-bits. 
disp-high 
IS absent 


if mod'" 
10 then 
OISP 
'" disp·high: 
disp·low 


if r 1m ~ 000 then EA ~ (BX) • (51) • olSP 
if rim: 
001 then EA ~ (BX) • (01) • olSP 
if rim ~ 010 then EA : (BP) • (51) • olSP 


if rim ~ 011 then EA : (BP) • (01) • olSP 
if rim: 
100 then EA ~ (51) • olSP 


if r 1m ~ 101 then EA ~ (01) • olSP 
if r 1m ~ 110 then EA : (BP) • olSp· 


if rim: 
111 then EA ~ (BX) • OISP 


OISP 
follows 
2nd· 
byte 
of 
instruction 
(before 
aata 
if required) 


CLCClearcarry 


CMC Complement 
carry 


STCSetcarry 


CLO Cleal 
direction 


STOSeldlfeclron 


CllClearlOlerrupt 


STISellOlerrupl 


HUHalt 


WAlT Wall 


ESCEscape(10elternaldevlcel 


LOCK Bus lock prell. 


1 1 11 
0 1 0 1 


1 11 
1 1001 


11111100 


1 1 1 11 
101 


11111010 


11111011 


11 
11 
10100 
I 


110011011 
! 


111011 
•• 
lFo~imJ 
~OYJ 


il s:w:;: 
01 
then 
16 
bits 
01 immediate 
data 
form 
the 
operand. 


II s:w 
= 11 
then 
an 
immediate 
data 
byte 
IS Sign 
extended 
to 


form 
the 
16-M 
operand 


il v = 0 then 
"count" 
= 1; 
If v = 1 then 
"count" 
in (el) 


x = don't 
care 


I IS used 
for 
stnng 
pnmltlves 
for 
companson 
WIth Z.F FLAG. 


SEGMENT 
OVERRIDE 
PREFIX 


10 0 1 reg 1 1 01 


16-Billw: 
11 
B-Bit (w ~ OJ 


000 
AX 
000 
Al 


001 
CX 
001 
Cl 
010 
OX 
010 
ol 
011 
BX 
011 
Bl 
100 
SP 
100 
AH 
101 
BP 
101 
CH 
no 
51 
110 
oH 
111 
01 
111 
BH 


Segment 


00 
ES 


01 
CS 
10 55 
11 
OS 


10-1:511 MMU~ MI\,;KUPROCESSOR 


(M8086) 
MILITARY 


• Direct Addressing 
Capability 
to 1 
• 8·and 16·Bit Signed and Unsigned 
MByte of Memory 
Arithmetic 
in Binary or Decimal 


Assembly 
Language Compatible 
with 
Including 
Multiply 
and Divide 
• 
8080/8085 
• 5 MHz Clock Rate 


• 14 Word, By 16·Bit Register Set with 
MULTIBUS™ System 
Compatible 
Symmetrical 
Operations 
• Interface 


• 24 Operand Addressing 
Modes 
• Military Temperature 
Range: 
• Bit, Byte, Word, and Block Operations 
-55°C to +125°C 


The Intel~ 
Military 
iAPX 86/10 is a new generation, 
high 
performance 
16-bit 
microprocessor 
implemented 
in N-channel, 


depletion 
load, silicon 
gate technology 
(HMOS). 
and packaged 
in a 40-pin 
CerDIP package. 
The processor 
has attributes 
of 
both 8- and 16-bit microprocessors. 
It addresses 
memory 
as a sequence 
of 8-bit bytes, but has a 16-bit wide physical 
path to 
memory 
for 
high 
performance. 


DATA 
POINTER. 
AND 


INDEX 
REGS 


(' 
WORDS) 


BUS 
INTERFACE 
UNIT 


RElOCATtON 
REGISTER 
FILE 


SEGMENT 


REGISTERS 


AND 


INSTRUCTION 


POINTER 


15 WORDSI 
GND 
VCC 


AD14 
ADTS 


A013 
A16/S3 


AD12 
A17/54 


A011 
A18/55 


AD10 
A19/S6 


AD9 
BHE/S7 


AD8 
MNIMX 


AD7 
Rii 


AD6 
RO/GTO 
(HOLD) 


ADS 
RO/Gn 
(HlOA) 


AD" 
LOCK 
(WR) 


AD3 
52 
(MliO) 


AD2 
51 
(DT/R) 


AD' 
So 
(DEN) 


ADO 
aso 
(ALE) 


NMI 
as, 
(INTA) 


INTR 
TEST 


CLK 
READY 


GND 
RESEr 


T£51-_ 
INT-_ 
NMI-- 


Inte' 
Corporation 
A •• um •• 
No R.sponsibilty 
tor the Ua. 
of Any Circuitry 
Other 
Than Circujlr~ 
Embodied 
in an Intel Product. No Other 
Circuit 
Patent 
Licenl.1 .,. 
Implied. 


'!lINTEL CORPORATION. 1980 
8-25 
AFN-Q'237B 


inter 


18086 
16-81T HMOS MICROPROCESSOR 


INDUSTRIAL 
• Industrial Grade Temperature 
• 24 Operand Addressing Modes 


Range: -40°C 
to +85°C 
• Bit, Byte, Word, and Block Operations 


• Direct Addressing Capability to 1 
MByte of Memory 
• 8· and 16·Bit Signed and Unsigned 


Arithmetic in.Binary or Decimal 
• Assembly Language Compatible 
with 
Including Multiply and Divide 


MCS·80,85® 
• 5 MHz Clock Rate 
• 14 Word, By 16·Bit General Register 
• MUL TIBUS™ System Compatible 
Set 
Interface 


The Intel'" Industrial iAPX 86/10 
is a new generation, high performance microprocessor 
implemented in N-channel, 


depletion load, silicon gate technology (HMOS), and packaged in a 40-pin CerDIP package. The processor has attri· 
butes of both 8· and 16·bit microprocessors. 
It addresses memory as a sequence of 8-bit bytes, but has a 16·bit wide 
physical path to memory for high performance. 


EXECUTION 
UNIT 
BUS 
INTERFACE 
UNIT 


RelOCATION 
I 


REGISTER 
FILE 
REGISTER 
FlU 


oAYA. 


POINTER. 
AND 


INDEX 
REGS 


'I WORDS) 
Vcc 


AD15 


A16/$3 


AU/54 


A18/55 


A191S6 


BHE/S7 


MNfMX 


AD 


AD/GTO 
(HOLD) 


iiQ/m (HLDA) 


LOCK 
(iNA) 


52 
(M/iO) 


51 
(DT/A) 


so 
(DEN) 


aso 
(ALE) 


as, 
(INTA) 


TEST 


READY 


RESET 


TEST-_ 
'0'-_ 
OM'-_ 


iAPX 88/10 
(8088) 
8-BIT HMOS MICROPROCESSOR 


• 8·Bit Data Bus Interface 


• 16·Bit Internal Architecture 


• Direct Addressing Capability to 1 Mbyte 
of Memory 


• Direct Software Compatibility with 
iAPX 86/10 (8086 CPU) 


• 14-Word by 16-Bit Register Set with 
Symmetrical Operations 


• 24 Operand Addressing Modes 


• Byte, Word, and Block Operations 


• 8-Bit and 16-Bit Signed and Unsigned 
Arithmetic in Binary or Decimal, 
Including Multiply and Divide 


• Compatible with 8155·2, 8755A·2 and 
8185·2 Multiplexed 
Peripherals 


The Intel~ iAPX 88/10 is a new generation, 
high performance 
microprocessor 
implemented 
in N-channel, 
depletion 
load, 
silicon 
gate technology 
(HMOS), 
and packaged 
in a 40-pin 
CerDIP package. 
The processor 
has attributes 
of both 8- and 
16-bit 
microprocessors. 
It is directly 
compatible 
with 
iAPX 86/10 software 
and 8080/8085 
hardware 
and peripherals. 


C·BUS 
MIN 
[MAX 1 
MODE 
MODE 


GND 
Vee 


AI. 
AIS 
INSTRUCTION 
A13 
A161S3 
STREAM 
BYTE 
OUEUE 
AI' 
A171$4 


All 
A18/SS 
ES 
Al0 
A19/S6 


BUS 
CS 
A9 
SSO 
(HIGH) 


INTERFACE 
SS 
A8 
MN/MX 
UNIT 
OS 
AD7 
RD 


IP 
ADS 
HOLD 
(ROllrfO) 


ADS 
HlDA 
(I!lllm) 


A·BUS 
AD. 
m 
(lOCK) 


AD3 
loiM 
(80) 


AD' 
DTiR 
(51) 


AH 
Al 
ADI 
Jmj 
(SO) 
BH 
Bl 
ADO 
ALE 
(aSO) 


CH 
Cl 


DH 
Dl 
NMI 
lIITA 
(aSl) 
EXECUTION 


UNIT 
SP 
INTR 
fm 


BP 
ClK 
READY 


SI 
GND 
RESET 


01 
FLAGS 


inter 


Table 1. Pin Description 


The following pin function descriptions are for 8088 systems in either minimum or maximum mode. The "local bus" in 
these descriptions is the direct multiplexed bus interface connection to the 8088 (without regard to additional bus 
bUffers). 


Symbol 
Pin No. 
Type 
Name 
and Function 


AD7-ADO 
9-16 
1/0 
Address 
Data 
Bus: 
These 
lines 
constitute 
the 
time 
multiplexed 
memory/IO 


address 
(T1) and data (T2, T3, Tw, and T4) bus. These lines are active 
HIGH and 


float to 3-state OFF during 
interrupt 
acknowledge 
and local bus "hold 
acknowl- 


edge". 


A15-A8 
2-8,39 
0 
Address 
Bus: 
These lines provide 
address 
bits 8 through 
15 for the entire 
bus 


cycle 
(T1- T4). These 
lines 
do not have to be latched 
by ALE to remain 
valid. 


A 15-A8 
are active 
HIGH and float to 3-state 
OFF during 
interrupt 
acknowledge 


and local 
bus "hold 
acknowledge". 


A19/56, 
A18/55, 
34-38 
0 
Address/Status: 
During 
T1, these 
are the four 
A17/54, 
A16/53 
most significant 
address 
lines for memory 
op- 
erations. 
During 
I/O operations, 
these 
lines are 
LOW. During 
memory 
and I/O operations, 
status 
information 
is available 
on these 
lines 
during 
T2, T3, Tw, and T4. 56 is always low. The status of 
.. 
53 
CHARACTERISTICS 


the interrupt 
enable 
flag 
bit (55) is updated 
at 
OllOW) 
0 
AllemateD". 


0 
, 
Stick 


the beginning 
of each clock 
cycle. 54 and 53 are 
' (HIGH) 
0 
Code 
or None 
, 
, •... 


encoded 
as shown. 
S6 
'$ 0 lLOWI 


This 
information 
indicates 
which 
segment 
reg- 


ister is presently 
being 
used for data accessing. 


These lines float to 3-state OFF during 
local bus 


"hold 
acknowledge". 


RD 
32 
0 
Read: 
Read strobe 
indicates 
that the processor 
is performing 
a memory 
or I/O 


read cycle, 
depending 
on the state of the 10/~ pin or 52. This signal 
is used to 


read devices 
which 
reside on the 8088 local bus. RD is active 
LOW during 
T2, T3 


and Tw of any read cycle, and is guaranteed 
to remain 
HIGH in T2 until the 8088 


local 
bus has floated. 


This signal 
floats 
to 3-state 
OFF in "hold 
acknowledge". 


READY 
22 
I 
READY: 
is the acknowledgement 
from the addressed 
memory 
or I/O device that 


it will 
complete 
the data transfer. 
The RDY signal 
from 
memory 
or I/O is syn- 


chronized 
by the 8284 clock 
generator 
to form 
READY. This 
signal 
is active 


HIGH. The 8088 
READY 
input 
is not synchronized. 
Correct 
operation 
is not 


guaranteed 
if the set up and hold 
times 
are not met. 


INTR 
18 
I 
Interrupt 
Request: 
is a level triggered 
input 
which 
is sampled 
during 
the last 


clock cycle of each instruction 
to determine 
if the processor 
should 
enter into an 
interrupt 
acknowledge 
operation. 
A subroutine 
is vectored 
to via an interrupt 


vector 
lookup 
table 
located 
in system 
memory. 
It can be internally 
masked 
by 


software 
resetting 
the interrupt 
enable 
bit. INTR is internally 
synchronized. 
This 


signal 
is active 
HIGH. 


TE5T 
23 
I 
TEST: 
input 
is examined 
by the "wait 
for test" 
instruction. 
If the TE5T input 
is 


LOW, execution 
continues, 
otherwise 
the processor 
waits in an "idle" 
state. This 


input 
is synchronized 
internally 
during 
each clock 
cycle on the leading 
edge of 


CLK. 


NMI 
17 
I 
Non-Maskable 
Interrupt: 
is an edge 
triggered 
input 
which 
causes 
a type 
2 


interrupt. 
A subroutine 
is vectored 
to via an interrupt 
vector 
lookup 
table located 


in system 
memory. 
NMI is not maskable 
internally 
by software. 
A transition 
from 


a LOW to HIGH initiates 
the interrupt 
at the end of the current 
instruction. 
This 


-, 
input 
is internally 
synchronized. 


inter 


Symbol 
Pin No. 
Type 
Name and Function 


RESET 
21 
I 
RESET: causes the processor 
to immediately 
terminate 
its present 
activity. The 


signal must be active HIGH for at least four clock cycles. It restarts execution, 
as 


described 
in the instruction 
set description, 
when RESET returns 
LOW. RESET 
is internally 
synchronized. 


eLK 
19 
I 
Clock: provides 
the 
basic 
timing 
for the 
processor 
and 
bus controller. 
It is 


asymmetric 
with 
a 33% duty cycle to provide 
optimized 
internal 
timing. 


Vee 
40 
Vee: 
is the +5V 
±10% 
power 
supply 
pin. 


GND 
1,20 
GND: are the ground 
pins. 


MN/MX 
33 
I 
Minimum/Maximum: 
indicates 
what 
mode the processor 
is to operate 
in. The 


two modes 
are discussed 
in the following 
sections. 


The following 
pin function 
descriptions 
are for the 8088 minimum 
mode (i.e., MNIMX = Vee). Only the pin functions 
which 


are unique 
to minimum 
mode 
are described; 
all other 
pin 
functions 
are as described 
above. 


101M 
28 
0 
Status Line: is an inverted 
maximum 
mode 
S2. It is used to distinguish 
a 


memory 
access from 
an 1/0 access. 
101M becomes 
valid in the T4 preceding 
a 
bus cycle and remains 
valid until the final T4 of the cycle (1/0=HIGH, 
M=LOW). 


101M floats 
to 3-state 
OFF in local 
bus "hold 
acknowledge". 


WR 
29 
0 
Write: strobe indicates 
that the processor 
is performing 
a write memory or write 


1/0 cycle, depending 
on the state of the 101M signal. 
WR is active for T2, T3, and 


Tw of any write cycle. It is active LOW, and floats to 3-stateOFF 
in local bus "hold 


acknowledge". 


INTA 
24 
0 
INTA: is used as a read strobe for interrupt 
acknowledge 
cycles. It is active LOW 


during 
T2, T3, and Tw of each interrupt 
acknowledge 
cycle. 


ALE 
25 
0 
Address Latch Enable: 
is provided 
by the processor 
to latch the address 
into 


the 8282/8283 address 
latch. It is a HIGH pulse active during 
clock 
low of T1 of 


any bus cycle. 
Note that ALE is never floated. 


DT/R 
27 
0 
Data Transmit/Receive: 
is needed 
in a minimum 
system that desires to use an 
8286/8287 data bus transceiver. 
It is used to control 
the direction 
of data flow 
through 
the transceiver. 
Logically, 
DT/R 
is equivalent 
to 51 in the maximum 
mode, 
and its timing 
is the same as for 
101M (T=HIGH, 
R=LOW). 
This signal 
floats 
to 3-state 
OFF in local 
"hold 
acknowledge". 


DEN 
26 
0 
Data Enable: 
is provided 
as an output 
enable 
for the 8286/8287 in a minimum 
system which 
uses the transceiver. 
DEN is active LOW during 
each memory 
and 


1/0 access, 
and for INTA cycles. 
For a read or INTA cycle, 
it is active from 
the 


middle 
of T2 until the middle 
of T4, while 
for a write 
cycle, 
it is active from the 
beginning 
ofT2 until the middle ofT4. DEN floats to 3-state OFF during 
local bus 


"hold 
acknowledge". 


HOLD,HLDA 
30,31 
1,0 
HOLD: indicates 
that 
another 
master 
is requesting 
a local 
bus "hold". 
To be 


acknowledged, 
HOLD must be active HIGH. The processor 
receiving 
the "hold" 


request will issue HLDA (HIGH) as an acknowledgement, 
in the middle of a T4 or 


TI clock 
cycle. Simultaneous 
with the issuance 
of HLDA the processor 
will float 


the 
local 
bus and 
control 
lines. 
After 
HOLD 
is detected 
as being 
LOW, the 
processor 
lowers 
HLDA, and when the processor 
needs to run another 
cycle, it 


will 
again 
drive the local 
bus and control 
lines. 


Hold 
is not 
an 
asynchronous 
input. 
External 
synchronization 
should 
be 


provided 
if the system 
cannot 
otherwise 
guarantee 
the set up time. 


SSO 
34 
0 
Status line: is logically 
equivalent 
to SO in th~ 
loiii 
DTl1i: ••• 


CHARACTI."'STIQ 


maximum 
mode. The combination 
of SSO, 101M 
leHIGH) 
0 . 
Inl••• uptA<;k••••.••ledg• 


and DT/R allows 
the system 
to completely 
de- 


0 
, 
FlndllO_t 
, 
0 
~~::ellOpotl 
, 


code the current 
bus cycle status. 


OILOWI 
0 
0 
Cooeacc&Ss 


0 
0 
, 
Reldmemo.y 


0 
, 
0 
~.:~~:emo<y 
0 
, 
, 


inter 


Table 1. Pin Description 
(Continued) 


The following pin function descriptions 
are for the 8088, 8228 system in maximum mode (i.e., MN/MX=GND.) 
Only the pin 


functions 
which are unique to maximum mode are described; all other pin functions 
are as described above. 


Symbol 
Pin No. 
Type 


52, 51, SO 
26-28 
0 


RQ/GTO, 
30, 31 
I/O 
RQ/GT1 


Status: 
is active 
during 
clock 
high 
of T4, T1, 


and 
T2, and 
is returned 
to the 
passive 
state 
(1,1,1) during 
T3 or during 
Tw when 
READY is 
HIGH. This status 
is used by the 8288 bus con- 
troller 
to generate 
all memory 
and I/O access 
control 
signals. 
Any change 
by 52, 51, or SO 
during 
T4 is used to indicate 
the beginning 
of a 
bus cycle, and the return 
to the passive state in 
T3 or Tw is used to indicate 
the end of a bus 
cycle. 


These signals 
float to 3-state OFF during 
"hold 
acknowledge". 
During 
the first clock 
cycle after 
RESET becomes 
active, these signals 
are active 
HIGH. After this first 
clock, 
they float to 3-state 
OFF. 


" 
ii 
iii 
CHAR~~ 


~lLOW) 
0 
0 
InltffuptAeknowltdg. 


0 
, 
A•• d IIOpo<t 


0 
, 
0 
::~:'lIOpofl 
0 
, 
, 


I (HIGH) 
0 
0 
CofH .ccen 
, 
0 
0 
..... _""'" 
, 
, 
0 
W"t.rnemory 
, 
, 
, 
P••••••• 


Request/Grant: 
pins are used by other local bus masters to force the processor 


to release the local bus at the end of the processor's 
current 
bus cycle. Each pin 


is bidirectional 
with RQ/GTO having 
higher 
priority 
than RQ/GT1. RQ/GT has an 


internal 
pull-up 
resistor, 
so may be left 
unconnected. 
The 
request/grant 
se- 
quence 
is as follows 
(See Figure 
8): 


1. A pulse of one CLK wide from another 
local bus master indicates 
a local bus 


request 
("hold") 
to the 8088 (pulse 
1). 


2. During 
a T4 or TI clock 
cycle, 
a pulse one clock 
wide from 
the 8088 to the 


requesting 
master (pulse 2), indicates 
that the 8088 has allowed 
the local bus 


to float 
and that it will enter the "hold 
acknowledge" 
state at the next CLK. 


The CPU's 
bus interface 
unit 
is disconnected 
logically 
from 
the local 
bus 


during 
"hold 
acknowledge". 
The same rules as for HOLD/HOLDA 
apply as for 


when 
the bus is released. 


3. A pulse one CLK wide from the requesting 
master indicates 
to the 8088 (pulse 


3) that the "hold" 
request 
is about 
to end and that the 8088 can reclaim 
the 


local 
bus at the next CLK. The CPU then 
enters 
T4. 


Each master-master 
exchange 
of the local 
bus is a sequence 
of three 
pulses. 


There 
must 
be one idle CLK cycle 
after each bus exchange. 
Pulses are active 


LOW. 


If the request 
is made while the CPU is performing 
a memory 
cycle, it will release 


the local 
bus during 
T4 of the cycle when 
all the following 
conditions 
are met: 


1. Request 
occurs 
on or before 
T2. 


2. Current 
cycle 
is not the low bit of a word. 


3. Current 
cycle 
is not 
the 
first 
acknowledge 
of 
an interrupt 
acknowledge 


sequence. 


4. A locked 
instruction 
is not currently 
executing. 


If the local 
bus is idle when 
the request 
is made the two 
possible 
events 
will 
follow: 


1. Local 
bus will 
be released 
during 
the next clock. 
2. A memory 
cycle will start within 
3 clocks. 
Now the four 
rules for a currently 


active 
memory 
cycle 
apply 
with 
condition 
number 
1 already 
satisfied. 


inter 


Symbol 
Pin No. 
~pe 
Name and Function 


LOCK 
29 
0 
LOCK: indicates that other system bus maste~ 
not to gain control of the 
system bus while LOCK is active (lOW). The LOCK signal is activated by the 


Ii 
"LOCK" prefix instruction and remains active until the completion of the next 
instruction. This signal is active lOW, and floats to 3-state off in "hold acknowl- 
edge". 


a51, a50 
24, 25 
0 
Queue StatuI: 
provide status to allow external 
OS1 
000 
CHAJtACTVI;I'TICI 
tracking of the internal 8088 instruction queue. 
OllOW) 
0 
No opel.llon 
0 
, 
Fir"byll 
01 opl;ode 
f,om queue 


The queue status is valid during the ClK cycle 
1 (HIGHI 
0 
Emplylhlqveul 
, 
, 
SUbMqu,nt 
byt. 
Irom qu_ 


after which the queue operation is performed. 
- 
34 
0 
Pin 34 is always high in the maximum mode. 
't 


Memory Organization 


The processor provides a 20-bit address to memory which 
locates the byte being referenced. The memory is orga- 
nized as a linear array of up to 1 million bytes, addressed 
as OOOOO(H) 
to FFFFF(H).The memory is logicall'y divided 
into code, data, extra data, and stack segments of up to 
64K bytes each, with each segment falling on 16-byte 
boundaries. (See Figure 3.) 


All 
memory 
references 
are made relative 
to base 
addresses contained in high speed segment registers. The 
segment types were chosen based on the addressing 
needs of programs. The segment registerto be selected is 
automatically chosen according to the rules of the follow- 
ing table. All information in one segment type share the 
same logical attributes (e.g. code or data). By structuring 
memory into relocatable areas of similar characteristics 
and by automatically selecting segment registers, pro- 
grams are shorter, faster, and more structured. 


Word (16-bit) operands can be located on even or odd ad- 
dress boundaries. For address and data operands, the 
least significant 
byte of the word is stored in the lower 
valued address location and the most significant byte in 


~FFFFFH 


6'!.. D}CODE SEGMENT 
-L 
XXXXOH 


} STACK SEGMENT 


SEGMENT 
REGISTER 
FilE 


CS 
SS 
OS 


ES 


} DATA SEGMENT 


} EXTRA DATA SEGMENT 


matically 
execute two fetch or write cycles for 16-bit 
operands. 


Certain locations 
in memory are reserved for specific 
CPU operations. 
(See Figure 4.l 
Locations 
from ad- 
dresses 
FFFFOH through 
FFFFFH 
are reserved for 
operations including a jump to the initial system initial- 
ization routine. Following RESET, the CPU will always 
begin execution at location 
FFFFOH where the jump 
must be located. Locations OOOOOH 
through 003FFH are 
reserved for interrupt 
operations. 
Four·byte pointers 
consisting of a 16-bit segment address and a 16-bit off- 
set address direct program flow to one of the 256 possi- 
ble interrupt service routines. The pointer elements are 
assumed to have been stored at their respective places 
in reserved memory prior to the occurrence of inter· 
rupts. 


Minimum 
and Maximum 
Modes 


The requirements 
for supporting 
minimum and maxi- 
mum 8088 systems are sufficiently 
different 
that they 
cannot 
be done efficiently 
with 40 uniquely 
defined 
pins. Consequently, the 8088 is equipped with a strap 
pin (MN/MX) which defines the system configuration. 
The definition 
of a certain subset of the pins changes, 
dependent on the condition of the strap pin. When the 
MN/MX pin is strapped to GND, the 8088 defines pins 24 
through 31 and 34 in maximum mode. When the MN/MX 
pin is strapped to Vcc, the 8088 generates bus control 
signals itself on pins 24 through 31 and 34. 


RESET 
.OOTSTRAP 
FFFFFH 


PROGRAM 
JUMP 
FFFFOH 
•·• 


INTERRUPT 
POINTER 
3FFH 


FOR TYPE 
255 
3FOH 
··· 
INTERRUPT 
POINTER 
7H 


FOR 
TYPE 
1 
'H 
INTERRUPT 
POINTER 
3H 


FOR TYPE 
0 
OH 


Memory 
Segment 
Register 
Segment 
Reference 
Need 
Used 
Selection 
Rule 
Instructions 
CODE (CS) 
Automatic with all instruction prefetch. 
Stack 
STACK (55) 
All stack pushes and pops. Memory references relative to BP 
base register except data references. 


Local Data 
DATA(DS) 
Data references when: relative to stack, destination of string 
operation, or explicitly overridden. 


External (Global) Data 
EXTRA (ES) 
Destination of string operations: Explicitly selected using a 
segment override. 


inter 


The minimum 
mode 8088 can be used with either a 
multiplexed or demultiplexed 
bus. The multiplexed bus 
configuration 
is compatible 
with the MCS-85™ multi- 
plexed bus peripherals (8155, 8156, 8355, 8755A, and 
8185).This configuration (See Figure 5) provides the user 
with a minimum chip count system. This architecture 
provides the 8088 processing power in a highly integrated 
form. 


The demultiplexed 
mode requires one latch (for 64K ad- 
dressability) 
or two latches (for a full megabyte of ad- 
dressing). A third latch can be used for buffering if the 
address bus loading requires it. An 8286 or 8287 trans- 
ceiver can also be used if data bus buffering is required. 
(See Figure 6'>The 8088 provides DEN and DT/R to con- 


trol the transceiver, and ALE to latch the addresses. 
This configuration 
of the minimum mode provides the 
standard demultiplexed 
bus structure 
with heavy bus 
buffering and relaxed bus timing requirements. 


The maximum mode empioys the 8288 bus controller. 
(See~ure 
7.> The 8288 decodes status lines SO,51, 
and 52, and provides the system with all bus control 
signals. Moving the bus control 
to the 8288 provides 
better source and sink current capability to the control 
lines, and frees the 8088 pins for extended large system 
features. Hardware lock, queue status, and two request/ 
grant interfaces are provided by the 8088 in maximum 
mode. These features allow co-processors in local bus 
and remote bus configurations. 
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Figure 7. Fully Buffered 
System Using Bus Controller 
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Bus Operation 


The 8088 address/data 
bus is broken into three parts - 
the lower eight address/data 
bits (ADO-AD?), the middle 
eight address bits (A8-A15), and the upper four address 
bits (A16-A19). The address/data 
bits and the highest 


four address bits are time mUltiplexed. 
This technique 
provides 
the most 
efficient 
use of pins on the proc- 
essor, permitting 
the use of a standard 40 lead package. 


The middle eight address bits are not multiplexed, 
i.e. 


they remain valid throughout 
each bus cycle. 
In add i- 


tion, the bus can be demultiplexed 
at the processor with 
a single 
address 
latch 
if a standard, 
non-multiplexed 
bus is desired for the system. 


Each processor bus cycle consists 
of at least four elK 


cycles. These are referred to as T1, T2, T3, and T4. (See 
Figure 8). The address 
is emitted 
from the processor 


during T1 and data transfer occurs on the bus during T3 
and T4. T2 is used primarily for changing the direction 
of 


the bus during read operations. 
In the event that a "NOT 


READY" indication 
is given by the addressed 
device, 
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"wait" states (Tw)are inserted between T3 and T4. Each 
inserted "wait" state is of the same duration as a ClK 
cycle. 
Periods can occur 
between 8088 driven bus 
cycles. These are referred to as "idle" states (Ti),or inac· 
tive ClK cycles. The processor uses these cycles for in· 
ternal housekeeping. 


During T1 of any bus cycle, the ALE (address latch 
enable) signal is emitted (by either the processor or the 
8288 bus controller, depending on the MN/IVIXstrap). At 
the trailing edge of this pulse, a valid address and cer· 
tain status information for the cycle may be latched. 


Status bits SO,81, and S2are used by the bus controller, 
in maximum mode, to identify the type of bus transac· 
tion according to the following table: 


S2 
S1 
SO 
CHARACTERISTICS 


o (low) 
0 
0 
Interrupt Acknowledge 
0 
0 
1 
Read I/O 
0 
1 
0 
Write 110 
0 
1 
1 
Halt 
1(High) 
0 
0 
Instruction fetch 
1 
0 
1 
Readdata from memory 


1 
1 
0 
Write data to memory 
1 
1 
1 
Passive (no bus cycle) 


Status bits S3 through S6 are multiplexed 
with high 
order address bits and are therefore valid during T2 
through T4, S3 and S4 indicate which segment register 
was used for this bus cycle in forming the address ac· 
cording to the following table: 


S4 
S3 
CHARACTERISTICS 


o (low) 
0 
Alternate data (Extra Segment) 
0 
1 
Stack 
1(High) 
0 
Code or none 
1 
1 
Data 


S5 is a reflection of the PSW interrupt enable bit. S6 is 
always equal to O. 


1/0 Addressing 


In the 8088, I/O operations can address up to a maxi· 
mum .of64K 110 registers. The 110 address appears in the 
same format 
as the memory address on bus lines 
A15-AO. The address lines A19-A16 are zero in 110 
operations. 
The variable 
110 instructions, 
which 
use 


register DX as a pointer, have full address capability, 
while the direct I/O instructions directly address one or 
two of the 256 110 byte locations in page 0 of the 110 ad· 
dress space. 110 ports are addressed in the same man· 
ner as memory locations. 


Designers familiar with the 8085 or upgrading an 8085 
design should note that the 8085 addresses 110 with an 
8·bit address on both halves of the 16·bit address bus. 
The 8088 uses a full 16·bit address on its lower 16 ad· 
dress lines. 


Processor initialization or start up is accomplished with 
activation (HIGH) of the RESETpin. The 8088 RESET is 
required to be HIGH for greater than four clock cycles. 
The 8088 will terminate operations on the high·going 
edge of RESET and will 
remain dormant as long as 


RESETis HIGH. The low-going transition of RESETtrig- 
gers an internal reset sequence for approXimately 7 
clock cycles. After this interval the 8088 operates nor- 
mally, beginning with the instruction 
in absolute loca- 


tion FFFFOH.(SeeFigure 4'> 
The RESETinput Is Inter- 


nally synchronized to the processor clock. At initializa- 
tion, the HIGH to lOW transition of RESETmust occur 
no sooner than 50 JLsafter power up, to allow complete 
initialization of the 8088. 


If INTR is asserted sooner than nine clock cycles after 
the end of RESET,the processor may execute one in- 
struction before responding to the interrupt. 


All 3-state outputs float to 3-state OFF during RESET. 
Status is active in the idle state for the first clock after 
RESETbecomes active and then floats to 3-state OFF. 


Interrupt 
Operations 


Interrupt operations fall into two classes; software or 
hardware initiated. The software initiated interrupts and 
software aspects of hardware interrupts are specified in 
the instruction set description in the 8086 Family User's 
Manual. Hardware interrupts can be classified as non- 
maskable or maskable. 


Interrupts result in a transfer of control to a new pro- 
gram location. A 256 element table containing address 
pointers 
to the interrupt 
service program locations 
resides in absolute locations 0 through 3FFH (see Fig- 
ure 4), which are reserved for this purpose. Each ele· 
ment in the table is 4 bytes in size and corresponds to 
an interrupt "type". An interrupting device supplies an 
8·bit type number, during the interrupt acknowledge se· 
quence, which is used to vector through the appropriate 
element to the new interrupt service program location. 


Non·Maskable 
Interrupt 
(NMI) 


The processor provides a single non·maskable interrupt 
(NMI) pin which has higher priority than the maskable in- 
terrupt request (INTR)pin. A typical use would be to actio 
vate a power failure routine. The NMI is edge-triggered 
on a lOW to HIGH transition. The activation of this pin 
causes a type 2 interrupt. 


NMI is required to have a duration in the HIGH state of 
greater than two clock cycles, but is not required to be 
synchronized to the clock. Any higher going transition 
of NMI is latched on·chip and will be serviced at the end 
of the current instruction 
or between whole moves (2 


bytes in the case of word moves)of a block type instruc· 
tion. Worst case response to NMI would be for multiply, 
divide, and variable shift 
instructions. 
There is no 
specification on the occurrence of the low·going edge; it 
may occur before, during, or after the servicing of NMI. 
Another high·going edge triggers another response if it 
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oounces on tne low-gOing edge to avoid tnggenng ex- 
traneous responses_ 


Maskable 
Interrupt (INTR) 


The 8088 provides a single interrupt request input (INTR) 
which can be masked internally by software with the 
resetting of the interrupt enable (IF) flag bit. The in- 
terrupt request signal is level triggered. It is internally 
synchronized during each clock cycle on the high-going 
edge of CLK.To be responded to, INTR must be present 
(HIGH) during the clock period preceding the end of the 
current instruction 
or the encl of a whole move for a 


block type instruction. 
During interrupt 
response se- 


quence, further interrupts are disabled. The enable bit is 
reset as part of the response to any interrupt (INTR, 
NMI, software interrupt, or single step), although the 
FLAGS register which is automatically pushed onto the 
stack reflects the state of the processor prior to the in- 
terrupt. Until the old FLAGS register is restored, the 
enable bit will be zero unless specifically 
set by an in- 
struction. 


During the response sequence (SeeFigure 9), the proc- 
essor executes two successive (back to back) interrupt 
acknowledge cycles. The 8088 emits the LOCK signal 
(maximum mode only) from T2 of the first bus cycle until 
T2 of the second. A local bus "hold" request will not be 
honored until the end of the second bus cycl'e. In the 
second bus cycle, a byte is fetched from the external in- 
terrupt system (e.g., 8259A PIC) which identifies 
the 


source (type) of the interrupt. This byte is multiplied by 
four and used as a pointer into the interrupt 
vector 
lookup table. An INTR signal left HIGH will be continual· 
Iy responded to within the limitations of the enable bit 
and sample period. The interr.upt return instruction 
in- 


cludes a flags pop which 
returns the status of the 


original interrupt enable bit when it restores the flags. 
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processor indicates that it is entering the HALT state.in 
one of two 
ways, depending 
upon which 
mode is 


strapped. In minimum mode, the processor issues ALE, 
delayed by one clock cycle, to allow the system to latch 
the halt status. Halt status is available on 10/M, DT/R, 
and SSO. In maximum mode, the processor issues ap- 
propriate HALT status on S2, S1, and SO,and the 8288 
bus controller issues one ALE. The 8088 will not leave 
the HALT state when a local bus hold is entered while in 
HALT. In this case, the processor reissues the HALT in- 
dicator at the end of the local bus hold. An interrupt re- 
quest or RESET will force the'8088 out of the HALT 
state. 


Read/Modify/Write 
(Semaphore) 
Operations 


via LOCK 


The LOCK status information is provided by the proc- 
essor when consecutive bus cycles are required during 
the execution of an instruction. This allows the proc- 
essor 
to 
perform 
read/modify/write 
operations 
on 


memory (via the "exchange 
register 
with 
memory" 
instruction), without another system bus master receiv- 
ing intervening memory cycles. This is useful in multi- 
processor system configurations 
to accomplish 
"test 


and set lock" operations. The rnrn< signal is activated 
(LOW) in the clock cycle following 
decoding 
of the 


LOCK prefix instruction. It is deactivated at the end of 
the last bus cycle of the instruction following the LOCK 
prefix. While LOCKis active, a request on a RQ/GTpin will 
be recorded, and then honored at the end of the LOCK. 


External Synchronization 
via TEST 


As an alternative to interrupts,- the 8088 provides a 
single software-testable input pin (TESn. This input is 
utilized by executing a WAIT instruction. 
The single 


\~--~/ 


Figure 9. Interrupt 
Acknowledge 
Sequence 


8-38 
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WAIT instruction Is repeatedly executed until the TEST 
Input goes active (LOW).The execution of WAIT does 
not consume bus cycles on~e the queue is full. 


If a local bus request occurs during WAIT execution, the 
80883-states all output drivers. If interrupts are enabled, 
the 8088 will recognize interrupts and process them. 
The WAIT instruction is then refetched, and reexecuted. 


Basic System Timing 


In minimum mode, the MN/MX pin is strapped to Vcc 
and the processor emits bus control signals compatible 
with the 8085 bus structure. 
In maximum. mode, the 
MN/MX pin is strapped to GND ana the processor emits 
coded status information which the 8288 bus controller 
uses to generate M ULTIBUS compati ble bus control 
signals. 


System Timing 
- 
Minimum 
Syste!T' 


(SeeFigure 8.l 


The read cycle begins in T1 with the assertion of the ad· 
dress latch enable (ALE) signal. The trailing (low going) 
edge of this signal is used to latch the address informa- 
tion, which is valid on the address/data bus (ADO-AD7) 
at this time, into the 8282/8283 latch. Address lines A8 
through A15 do not need to be latched because they re- 
main valid throughout the bus cycle. From T1 to T4 the 
10/M signal indicates a memory or I/O operation. At T2 
the address is removed from the address/data bus and 
the bus goes to a high impedance state. The read con- 
trol signal is also asserted at T2. The read (RD) signal 
causes the addressed device to enable its data bus 
drivers to the local bus. Some time later, valid data will 
be available on the bus and the addressed device will 
drive the READY line HIGH. When the processor returns 
the read signal to a HIGH level, the addressed device 
will 
again 
3·state 
its 
bus drivers. 
If 
a transceiver 
(8286/8287) is required to buffer the 8088 locai bus, 
signals DT/Rand DEN are provided by the 8088. 


A write cycle also begins with the assertion of ALE and 
the emission of the address. The 10/M signal is again 
asserted to indicate a memory or I/O write operation. In 
T2, immediately following 
the address emission, the 
processor emits the data to be written 
into the ad- 
dressed location. This data remains valid until at least 
the middle of T4. During T2, T3, and Tw, the processor 
asserts the write control signal. The write (WR) signal 
becomes active at the beginning of T2, as opposed to 
the read, which is delayed somewhat into T2 to provide 
time for the bus to float. 


The basic difference 
between the interrupt acknowl- 
edge cycle 
and a read cycle 
is that 
the 
interrupt 
acknowledge (INTA) signal is asserted in place of the 
read (RD) signal and the address bus is floated. (See 
Figure 9.l In the second of two successive INTA cycles, 
a byte of information is read from the data bus, as sup- 
plied by the interrupt system logic (i.e. 8259A priority in- 
terrupt controller). This byte identifies the source (type) 
of the interrupt. It is multiplied by four and used as a 
pointer into the interrupt vector lookup table, as de- 
scribed earlier. 


(See Figure 10.l 


For medium complexity systems, the MN/MX pin is con- 
nected to GND and the 8288 bus controller is added to 
the system, as well as an 8282/8283,Jatch for latching 
the system address; and an 8286/8287 transceiver to 
allow for bus loadin'g greater than the 8088 is capable of 
handling. Signals ALE, DEN, and DT/Rare generated by 
the 8288 instead of the processor in this configuration, 
although their timing remains relatively the same. The 
8088 status outputs (52, 51, and SO) provide type of 
cycle information 
and become 8288 inputs. This bus 
cycle information 
specifies 
read (code, data, or I/O), 


write (data or I/O), interrupt acknowledge, or software 
halt. The 8288 thus issues control signals specifying 
memory read or write, I/O read or write, or interrupt 
acknowledge. The 8288 provides two types of write 
strobes, normal and advanced, to be applied as required. 
The normal write strobes have data valid at the leading 
edge of write. The advanced write strobes have the 
same timing as read strobes, and hence, data is not 
valid at the leading edge of write. The 8286/8287trans- 
ceiver receives the usual T and OE inputs from the 
8288's DT/Rand DEN outputs. 


The pointer into the interrupt vector table, which is 
passed during the second INTA cycle, can derive from 
an 8259A located on either the local bus or the system 
bus. If the master 8289A priority interrupt controller is 
positioned on the local bus, a TTL gate is required to 
disable the 8286/8287transceiver when reading from the 
master 8259A during 
the interrupt 
acknowledge 
se- 
quence and software "poll". 


The 8088CPU is an 8-bit processor designed around the 
8086 internal structure. Most internal functions of the 
8088 are identical to the equivalent 8086 functions. The 
8088 handles the external bus the same way the 8086 
does with the distinction 
of handling only 8 bits at a 
time. Sixteen-bit operands are fetched or written in two 
consecutive bus cycles. Both processors will appear 
identical to the software engineer, with the exception of 
execution time. The internal register structure is iden- 
tical and all instructions have the same end result. The 
differences 
between the 8088 and 8086 are outlined 
below. The engineer who is unfamiliar with the 8086 is 
referred to the 8086 Family User's Manual, Chapters 2 
and 4, for function 
description 
and instruction 
set 
information. 


Internally, there are three differences between the 8088 
and the 8086.All changes are related to the 8-bit bus in- 
terface. 


• The queue length is 4 bytes in the 8088, whereas the 
8086 queue contains 6 bytes, or three words. The 
queue was shortened to prevent overuse of the bus by 
the BIU when prefetching instructions. This was reo 
quired because of the additional time necessary to 
fetch instructions 8 bits at a time. 


inter 


• To further optimize the queue, the prefetching algo- 
rithm was changed. The 8088 BIU will fetch a new in- 
struction to load into the queue eae. time there is a 1 
byte hole (space available) in the queue. The 8086 
waits until a 2-byte space is available. 


• The internal execution time of the instruction 
set is 


affected by the 8-bit interface. All 16-bit fetches and 
writes fromlto 
memory take an additional 
four clock 
cycles. The CPU is also limited by the speed of in- 
struction 
fetches. 
This latter 
problem 
only occurs 


when a series of simple operations occur. When the 
more sophisticated 
instructions of the 8088 are being 


used, the queue has time to fill and the execution pro- 
ceeds as fast as the execution unit will allow. 


The 8088 and 8086 are completely software compatible 
by virture of their identical execution 
units. Software 
that is system dependent may not be completely trans· 
ferable, but software that is not system dependent will 
operate equally as well on an 8088 or an 8086. 


The hardware interface of the 8088 contains the major 
differences 
between the two CPUs. The pin assign- 


ments are nearly identical, however, with the following 
functional 
changes: 


• A8-A 15 - 
These pins are only address outputs on the 


8088. These address lines are latched internally and 
remain valid throughout 
a bus cycle 
in a manner 
similar to the 8085 upper address lines. 


• 
BHE has no meaning on the 8088 and has been elimi- 
nated. 


• SSO provides the SO status information 
in the mini- 


mum mode. This output occurs on pin 34 in minimum 
mode only. DTlR, 101M, and SSO provide the complete 
bus status in minimum mode. 


• 
101M has been inverted to be compatible 
with the 


MCS-85 bus structure. 


• ALE is delayed by one clock cycle in the minimum 


mode when entering HALT, to allow the status to be 
latched with ALE. 
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8288 
ROY 
8284 


READY 
8088 


ADT-ADO 


8088 
A15-A8 


RD 


OT/R 


8288 
MRDC 


DEN 


Ambient 
Temperature 
Under 
Bias 
0"°C10 70°C 
Storage 
Temperature 
-65·C 
to 
+ 150·C 
Voltage 
on Any 
Pin with 
Respect 
to Ground 
- 1.0 to + 7V 


Power 
Dissipation 
..............•......... 
2.5 Watt 


device. This is a stress rating only and functional opera- 
tion of the device at these or any other conditions above 
those indicated in the operational sections of this specifi- 
cation is not implied. Exposure to absolute maximum 
rating conditions for extended periods may affect device 
reliability 


Symbol 
Parameter 


.~ 
Min. 
Max. 
Units 
Test 
Conditions 


V1l 
Input 
Low Voltage 
-0.5 
+0.8 
V 


V1H 
Input 
High 
Voltage 
2.0 
Vee+0.5 
V 


VOL 
Output 
Low Voltage 
0.45 
V 
10l = 2.0 mA 


VOH 
Output 
High 
Voltage 
2.4 
V 
10H = 400 f'A 


Ice 
Power 
Supply 
Current 
340 
mA 
TA = 25·C 


III 
Input 
Leakage 
Current 
±10 
f'A 
OV <S VIN 
<S VCC 


Ilo 
Output 
Leakage 
Current 
±10 
f'A 
0.45V '" VOUT '" Vee 


Vel 
Clock 
Input 
Low Voltage 
-0.5 
+0.6 
V 
- 


VeH 
Clock 
Input 
High 
Voltage 
3.9 
Vee+ 
1.0 
V 
..... 


Capacitance 
of Input 
Buffer 


CIN 
(All input 
except 
15 
pF 
fc = 
1 MHz 


ADo-AD? 
RO/Gl) 


C10 
Capacitance 
of 1/0 Buffer 
15 
pF 
fc = 
1 MHz 
(ADo-AD? 
RO/GT) 


A.C. CHARACTERISTICS 
(TA = o·C to 70·C, Vcc 
= 5V ±10%) 


MINIMUM 
COMPLEXITY 
SYSTEM 
TIMING 
REQUIREMENTS 


Symbol 
Parameter 
Min. 
Max. 
Units 
Test Conditions 


TCLCL 
CLK Cycle Period 
200 
500 
ns 


TCLCH 
CLK Low Time 
. 
(~ TCLCL)-15 
ns 


TCHCL 
CLK High Time 
(V> TCLCL)+2 
ns 


TCH1CH2 
CLK Rise Time 
10 
ns 
From 
1.0V to 3.5V 


TCL2CLl 
CLK Fall Time 
10 
ns 
From 3.5V to 1.0V 
- 


TDVCL 
Data In Setup 
Time 
30 
ns 


TCLDX 
Data In Hold Time 
10 
ns 


TR1VCL 
RDY Setup 
Time 
into 8284 (See Notes 
1,2) 
35 
ns 


TCLR1X 
RDY Hold Time 
into 8284 (See Notes 
1, 2) 
0 
ns 


TRYHCH 
READY Setup 
Time 
into 8088 
(~ TCLCL)-15 
ns 


TCHRYX 
READY Hold Time 
into 8088 
30 
ns 


TRYLCL 
READY Inactive 
to CLK(See 
Note 3) 
-8 
ns 


THVCH 
HOLD Setup 
Time 
35 
ns 


TINVCH 
INTR, NMI, TEST Setup 
Time 
(See Note 
2) 
30 
ns 


TILIH 
Input 
Rise Time 
(Except 
CLK) 
20 
ns 
From 0.8V to 2.0V 


TIHIL 
Input 
Fall Time 
(Except 
CLK) 
12 
ns 
From 2.0V to 0.8V 


A.C. CHARACTERISTICS 
(Continued) 


TIMING 
RESPONSES' 


Symbol 
Parameter 
Min. 
Max. 
Units 
Test Conditions 


TCLAV 
Address 
Valid 
Delay 
10 
110 
ns 


TCLAX 
Address 
Hold 
Time 
I)' 
10 
ns 


TCLAZ 
Address 
Float 
Delay 
TCLAX 
80 
ns 


TLHLL 
ALE Width 
TCLCH-20 
ns 


TCLLH 
ALE Active 
Delay 
80 
ns 


TCHLL 
ALE 
Inactive 
Delay 
85 
ns 


TLLAX 
Address 
Hold 
Time 
to ALE Inactive 
TCHCL-10 
ns 


TCLDV 
Data Valid 
Delay 
10 
110 
ns 
CL = 20-100 
pF for 


TCHDX 
Data 
Hold 
Time 
10 
ns 
all 8088 Outputs 
in addition 
to 
TWHDX 
Data 
Hold 
Time 
After 
WR 
TCLCH-30 
ns 
internal 
loads 
TCVCTV 
Control 
Active 
Delay 
1 


'," 
10 
110 
ns 


TCHCTV 
Control 
Active 
Delay 
2 
10 
110 
ns 


TCVCTX 
Control 
Inactive 
Delay 
10 
110 
ns 


TAZRL 
Address 
Float 
to READ Active 
0 
ns 


TCLRL 
RD Active 
Delay 
10 
165 
ns 


TCLRH 
RD Inactive 
Delay 
10 
150 
ns 


TRHAV 
RD Inactive 
to Next 
Address 
Active 
TCLCL-45 
ns 


TCLHAV 
HLDA 
Valid 
Delay 
10 
160 
ns 


TRLRH 
RD Width 
- 
-', 
2TCLCL-75 
ns 


TWLWH 
WR Width 
2TCLCL-60 
ns 


TAVAL 
Address 
Valid 
to ALE 
Low 
TCLCH-60 
ns 


TOLOH 
Output 
Rise Time 
• 
20 
ns 
From 
0.8V to 2.0V 


TOHOL 
Output 
Fall Time 
12 
ns 
From 
2.0V to 0.8V 


A C TESTING· 
INPUTS ARE DRIVEN AT 2 4V FOR A LOGIC 
l' 
AND 0 45V FDA 


A LOGIC 
0 ' THE CLOCK 
IS DRIVEN AT 4 3V AND 02SV 
TIMING 
MEASURE- 
MENTS 
ARE MADE AT 1,sv FOR BOTH A LOGIC 
l' 
AND 
0 


inter 


READ CYCLE 


(NOTE 
1) 
(WlI, IIITA. vo.) 


inter 


INTA 
CYCLE 


NOTES 
1,3 


(RD, WR::. VOH) 


SOFTWARE 
HALT 
- 


OEN,RD,WR,INTA::. 
VOH 


DTiil INDETERMINATE 


NOTES: 
1. All 
SIGNALS 
SWITCH 
BETWEEN 
VOH AND VOL UNLESS 
OTHERWISE 
SPECIFIED. 


2. ROY IS SAMPLED 
NEAR 
THE 
END OF 12. 13. Tw TO DETERMINE 
IF Tw 
MACHINES 
STATES 
ARE TO BE INSERTED. 


3. TWO 
INfA 
CYCLES 
RUN BACK·YO·BACK. 
THE 8088 LOCAL 
ADDRIDATA 
8US 
IS FLOATING 
DURING 
BOTH 
(NfA 
CYCLES. 
CONTROL 
SIGNALS 
ARE SHOWN 
FOR THE SECOND 
INfA 
CYCLE. 


4. SIGNALS 
AT 8284 ARE SHOWN 
FOR REFERENCE 
ONLY. 


5. ALL 
TIMING 
MEASUREMENTS 
ARE MADE 
AT 1.5V UNLESS 
OTHERWISE 


NOTED 


MAX 
MODE 
SYSTEM 
(USING 
8288 
BUS CONTROLLER) 


TIMING 
REQUIREMENTS 


Symbol 
Parameter 
- 
Min. 
Max. 
Units 
Test CondItions 


TClCl 
ClK 
Cycle 
Period 
200 
500 
ns 


TClCH 
ClK 
low 
Time 
(% TClCl) 
-15 
ns 


TCHCl 
ClK 
High 
Time 
('13 TClCl)+2 
ns 


TCH1CH2 
ClK 
Rise Time 
r 
10 
ns 
From 
1.0V to 3.5V 


TCl2Cl1 
ClK 
Fall Time 
10 
ns 
From 
3.5V to 1.0V 


TDVCl 
Data In Setup 
Time 
30 
ns 


TClDX 
Data In Hold 
Time 
10 
ns 


TR1VCl 
RDY Setup 
Time 
into 
8284 (See Notes 
1,2) 
35 
ns 


TClR1X 
RDY Hold 
Time 
into 
8284 (See Notes 
1, 2) 
0 
ns 


TRYHCH 
READY 
Setup 
Time 
into 
8088 
(% TClCl)-15 
ns 


TCHRYX 
READY 
Hold 
Time 
into 
8088 
30 
ns 


TRYlCl 
READY 
Inactive 
to ClK 
(See Note 
4) 
-8 
ns 


TINVCH 
Setup 
Time 
for 
Recognition 
(INTR, 
NMI, TEST) 
30 
ns 
(See Note 
2) 


TGVCH 
RQ/GT 
Setup 
Time 
30 
ns 


TCHGX 
RQ Hold 
Time 
into 
8086 
40 
ns 


T1L1H 
Input 
Rise Time 
(Except 
ClK) 
20 
ns 
From 
0.8V to 2.0V 


T1Hll 
Input 
Fall Time 
(Except 
ClK) 
12 
ns 
From 
2.0V to 0.8V 


A.C. CHARACTERISTICS 
(Continued) 


TIMING RESPONSES 


Symbol 
Parameter 
Min. 
Max. 
Units 
Test Conditions 


TClMl 
Command 
Active 
Delay (See Note 1) 
10 
35 
ns 


TClMH 
Command 
Inactive 
Delay (See Note 1) 
10 
35 
ns 


TRYHSH 
READY Active 
to Status 
Passive 
(See Note 3) 
110 
ns 


TCHSV 
Status 
Active 
Delay 
10 
110 
ns 


TClSH 
Status 
Inactive 
Delay 
- 
10 
130 
ns 


TClAV 
Address 
Valid 
Delay 
10 
110 
ns 


TCLAX 
Address 
Hold Time 
- 
10 
ns 


TCLAZ 
Address 
Float Delay 
TClAX 
80 
ns 


TSVlH 
Status 
Valid 
to ALE High (See Note 1) 
15 
ns 


TSVMCH 
Status 
Valid 
to MCE High (See Note 1) 
15 
ns 


TCllH 
ClK 
low 
to ALE Valid 
(See Note 1) 
15 
ns 


TClMCH 
ClK 
low 
to MCE High (See Note 1) 
, 
.-" 
15 
ns 


TCHll 
ALE Inactive 
Delay (See Note 1) 
15 
ns 


TClMCl 
MCE Inactive 
Delay (See Note 1) 
15 
ns 
Cl = 20-100 pF for 


TClDV 
Data Valid 
Delay 
10 
110 
ns 
all 8088 Outputs 


TCHDX 
Data Hold Time 
10 
ns 
in addition 
to 
internal 
loads 
TCVNV 
Control 
Active 
Delay (See Note 1) 
5 
45 
ns 


TCVNX 
Control 
Inactive 
Delay (See Note 1) 
10 
45 
ns 


TAZRl 
Address 
Float to Read Active 
0 
ns 


TClRl 
RD Active 
Delay 
10 
165 
ns 


TClRH 
RD Inactive 
Delay 
10 
150 
ns 


TRHAV 
RD Inactive 
to Next Address 
Active 
TClCl-45 
ns 


TCHDTl 
Direction 
Control 
Active 
Delay (See Note 1) 
50 
ns 


TCHDTH 
Direction 
Control 
Inactive 
Delay (See Note 1) 
- 
30 
ns 


TClGl 
GT Active 
Delay 
I 
110 
ns 


TClGH 
GT Inactive 
Delay 
85 
ns 


TRlRH 
RD Width 
2TClCl-75 
ns 


TOlOH 
Output 
Rise Time 
20 
ns 
From 0.8V to 2.0V 


TOHOl 
Output 
Fall Time 
-- 
- 
12 
ns 
From 2.0V to 0.8V 


NOTES: 
1. Signal at 8284 or 8288 shown for reference 
only. 


2. Setup requirement 
for asynchronous 
signal only to guarantee 
recognition 
at next elK. 
3. Applies only to T2 state (8 ns into T3 state). 
4. Applies only to T2 state (8 ns into T3 state). 


inter 


BUS TIMING-MAXIMUM 
MODE 
SYSTEM (USING 8288) 


1 


ALE (8288 OUTPUT) 


see NOTE 
5 


ROY (8284 
INPUn 


BUS TIMING-MAXIMUM 
MODE SYSTEM 
(USING 8288) 


,------- 
\\._----- 


NOTES: 
1. ALL 
SIGNALS 
SWITCH 
BETWEEN 
VOH AND 
VOL UNLESS 
OTHERWISE 
SPECIFIED. 
2. RDY lS SAMPLED 
NEAR THE END OF 12. T3. Tw TO DETERMINE 
IF Tw 
MACHINES 
STATES ARE TO BE INSERTED. 
3.• CASCADE 
ADDRESS 
IS VALID 
BETWEEN 
FIRST 
AND 
SECOND 
INTA 
CYCLES. 
4. TWO INTA CYCLES RUN BACK·TO·BACK. 
THE 8088 LOCAL 
AODR/DATA 
BUS 
IS 
FLOATINQ 
DURING 
BOTH 
INTA 
CYCLES. 
CONTROL 
FOR 
POINTER ADDRESS 
IS SHOWN 
FOR SECOND 
INTA CYCLE. 
5. SIGNALS 
AT 8284 OR 8288 ARE SHOWN 
fOR 
REFERENCE 
ONLY. 
8. THE ISSUANCE 
OF THE 8288 COMMAND 
AND 
CONTROL 
SIGNALS 
(M"R"IR:, 
MW1'C, 
~, 
RmC. 
rowe, 
JJnWe', 
~ 
AND 
DEN) LAGS 
THE 
ACTIVE HIGH 8288 CEN. 
7. ALL TIMING 
MEASUREMENTS 
ARE MADE AT 1.5V UNLESS OTHERWISE 
NOTED. 
8. STATUS INACTIVE 
IN STATE JUST PRIOR TO T4. 
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ASYNCHRONOUS 
SIGNAL 
RECOGNITION 


BUS LOCK SIGNAL 
TIMING 


(MAXIMUM 
MODE ONLY) 


Pr.viOUlvr.n' 
A''''';:l~/-' 
-< 


ADr-ADo 
IOU 


:~ 
~I----------------< 


~ 
, 


~ 


-lClKC'CLE- 


C" l 
'HVCH 
'SEE NOYE 1) 


HOlD~ 


HOLD/HOLD 
ACKNOWLEDGE 
TIMING 
(MINIMUM 
MODE ONLY) 


~(C:·_'" L 


{ 


'ClHAY 


-----\'~, ----_. 


inter 


IAPX 86/10, 88/10 
INSTRUCTION 
SET SUMMARY 


Jisono 
71'43"0 
11543110 
1I54Stli 


~rnodOO"'''' 
~I, I 1 10' 
, w 
11IlOO011 
" ••• I 


1 0 0 0 I 04 .•. _. 
'" ". 
, 1 0 0 0'1 
•• _000 
". 
(l.l. 
I 
lI." 
,1.1 


101 
I •• '" 
4'" 
dll.,I. 
I I 


1010000 
W 
.lIllI·low 
11)(I,·1l'01l1 


10 
I 0 0 01. 
'deI,lo. 
.(\4 •. 11'9" 
1 


10001110 
mod 
0 Ie; ". 
10001'00 
modO,t9 ". 


Ae9,slelimeftlO'ylo,homrf9,Slt' 


IlIIlIIICIoJle10 
If\l'iltrlmtmory 


Immtclo.'t10 
'tg'Sltr 


Wtmoly 
10 <lCC\lm",I,lor 


AUllmul"0110mtmOly 


IItg,srerIlTlflOlOl·,tougmtnt'fll,sltr 


Stomenl',g,s'" 
IOffQ'Slt.1mtmOf'l' 


AtO,Sltrfmtmo,y 


Atll'"t. 


.ECiCII.nges19" 


00' 
I lOCI 
'*' _. 
'" ". 
1 0 0 0 0 0 s •• 
mOll' 
, 1 ". 
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8089 
8 & 16-BIT HMOS I/O PROCESSOR 


• High Speed DMA Capabilities 
Including 
I/O to Memory, Memory to I/O, Memory 
to Memory, and I/O to I/O 


• iAPX 86, 88 Compatible: Removes I/O 
Overhead from CPU in iAPX 86/11 or 
88/11 Configuration 


• Allows Mixed Interface of 8- & 16-Bit 
Peripherals, to 8· & 16-Bit Processor 
Busses 


• 1 Mbyte Addressability 


• Memory Based Communication with 
CPU 


• Supports LOCAL or REMOTE I/O 
Processing 


• Flexible, Intelligent DMA Functions 
Including Translation, Search, Word 
Assembly/Disassembly 


• MULTIBUS™ Compatible System 
Interface 


The IntelQl)8089 is a revolutionary 
concept 
in microprocessor 
input/output 
processing. 
Packaged 
in a 40-pin 
DIP package, 


the 8089 is a high performance 
processor 
implemented 
in N-channel, 
depletion 
load silicon 
gate technology 
(HMOS). The 
8089's 
instruction 
set and capabilities 
are optimized 
for high 
speed, 
flexible 
and efficient 
I/O handling. 
It allows 
easy 


interface 
of Intel's 
16-bit 
iAPX 86 and 
8-bit 
iAPX 88 microprocessors 
with 
8- and 
16-bit 
peripherals. 
In the 
REMOTE 
configuration, 
the 8089 bus is user definable 
allowing 
itto 
be compatible 
with any 8/16-bit 
Intel microprocessor, 
interfacing 


easily to the Intel multiprocessor 
system 
bus standard 
MULTI BUST•. 


The 8089 performs 
the function 
of an intelligent 
DMA controller 
for the Intel iAPX 86, 88 family 
and with 
its processing 
power, can remove 
I/O overhead 
from 
the iAPX 86 or iAPX 88. It may operate 
completely 
in parallel 
with 
a CPU, giving 
dramatically 
improved 
performance 
in I/O intensive 
applications. 
The 8089 provides 
two I/O channels, 
each supporting 
a 


transfer 
rate up to 1.25 mbyte/sec 
at the standard 
clock 
frequency 
of 5 MHz. Memory 
based communication 
between 
the 
lOP and CPU enhances 
system 
flexibility 
and encourages 
software 
modularity, 
yielding 
more reliable, 
easier 
to develop 
systems. 
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Figure 2. 
8089 Pin Configuration 


I/O 
Multiplexed 
Address 
and Data Bus: The 
function 
of these lines are defined by the 
state of SO, 51 and S2 lines. The pins are 
floated after reset and when the bus is not 
acquired. A8-A15 are stable on transfers to a 
physical 8-bit data bus (same bus as 8088), 
and are multiplexed with data on transfers to 
a 16-bit physical bus. 
o 
Address 
and 
Status: 
Multiplexed 
most 


significant 
address 
lines and status 
in- 
formation. The address lines are active only 
when addressing 
memory. Otherwise, 
the 
status lines are active and are encoded as 
shown below. The pins are floated after reset 
and when the bus is not acquired. 
56555453 
1 
1 
0 
0 DMA cycle on CH1 


1 
1 
0 
1 DMA cycle on CH2 
1 
1 
1 
0 Non-DMA cycle on CH1 


1 
1 
1 
1 Non-DMA cycle on CH2 
o 
Bus High Enable: The Bus High Enable is 
used to enable data operations on the most 
significant half of the data bus (D8-D15). The 
signal is active low when a byte is to be 
transferred on the upper half of the data bus. 
The pin is floated after reset and when the 
bus is not acquired. BHE does not have to be 
latched. 


o 
Status: These are the status pins that define 
the lOP activity during any given cycle. They 
are encoded as shown below: 
S2S1SO 
o 
0 
0 Instruction fetch; I/O space 
o 
0 
1 Data fetch; I/O space 
o 
1 
0 Data store; I/O space 
o 
1 
1 Not used 
1 
0 
0 Instruction fetch; System Memory 


1 
0 
1 Data fetch; System Memory 
1 
1 
0 Data store; System Memroy 


1 
1 
1 Passive 
The status 
lines are utilized 
by the bus 
controller 
and bus arbiter 
to generate all 
memory and I/O control signals. The signals 
change during T4 if a new cycle is to be 
entered while the return to passive state in T3 
or Tw indicates the end of a cycle. The pins 
are floated after system reset and when the 
bus is not acquired. 


I 
Ready: The ready signal received from the 
addressed device indicates that the device is 
ready for data transfer. The signal is active 
high and is synchronized by the 8284 clocl< 
generator. 


Symbol 
Type 
Name and Function 


lOCK 
0 
Lock: The lock output signal indicates to the 
bus controller that the bus is needed for more 
than one contiguous cycle. It is set via the 
channel control register, and during the TSl 
instruction. 
The pin floats after reset and 


when the bus is not acquired. This output is 
active low. 


RESET 
I 
ReSElt: The receipt of a reset signal causes 
the lOP to suspend all its activities and enter 
an idle state until a channel 
attention 
is 


received. The signal must be active for at 
least four clock cycles. 


ClK 
I 
Clock: 
Clock provides all timing needed for 
internal lOP operation. 


CA 
I 
Channel Attention: 
Gets the attention of the 


lOP.Upon the falling edge of this signal, the 


1, 


SEl 
input 
pin is examined 
to determine 


Master/Slave or CH1/CH2 information. This 
input is active high. 


SEl 
I 
Select: 
The first CA received after system 


reset informs the lOP via the SEl line, whe- 
ther it is a Master or Slave (0/1 for Mas- 
ter/Slave 
respectively) 
and starts 
the in- 


itialization sequence. During any other CA 
the 
SEl 
line 
signifies 
the 
selection 
of 


CH1/CH2. (0/1 respectively.) 


DRQl-2 
I 
Data Request: 
DMA request inputs which 


signal the lOP that a peripheral is ready to 
transfer/receive data using channels 1 or 2 
respectively. The signals must be held active 
high until the appropriate 
fetch/stroke 
is 


initiated. 


RQ/GT 
I/O 
Request Grant: 
Request Grant implements 


the communication dialogue required to ar- 
bitrate the use of the system bus (between 
lOP and CPU, lOCAL mode) or I/O bus when 
two 
lOPs share-!!'e 
same bus (REMOTE 


mode). The RQ/GT signal..!! ~ive 
low. An 


internal 
pUll-up permits 
RQ/GT to be left 


floating if not used. 


SINTR1-2 
0 
Signal 
Interrupt: 
Signal 
Interrupt 
outputs 


from channels 
1 and 2 respectively. 
The 
interrupts may be sent directly to the CPU or 
through the 8295A interrupt controller. They 
are used to indicate 
to the system 
the 


occurrence of user defined events. 


EXTl-2 
I 
External 
Terminate: 
External 
terminate 


inputs for channels 1 and 2 respectively. The 
EXTsignals will cause the termination of the 
current DMA transfer operation if the chan- 
nel is so programmed by the channel control 
register. The signal must be held active high 
until termination is complete. 


Vcc 
Voltage: 
+5 volt power input. 


Vss 
Ground. 


The 8089 lOP has been designed to remove I/O proces· 
sing, control and high speed transfers from the central 
processing unit. Its major capabilities include that of in· 
itializing 
and maintaining 
peripheral components and 
supporting 
versatile DMA. This DMA function 
boasts 
flexible termination conditions (such as external termi· 
nate, mask compare, single transfer and byte count ex· 
pired). The DMA function of the 8089 lOP uses a two cy- 
cle 
approach 
where the 
information 
actually 
flows 
through the 8089 lOP. This approach to DMA vastly sim- 
plifies the bus timings and enhances compatibility 
with 
memory and peripherals, in addition to allowing opera- 
tions to be performed on the data as it is transferred. 
Operations can include such constructs 
as translate, 
where the 8089 automatically vectors through a lookup 
table and mask compare, both on the "fly". 


The 8089is functionally compatible with Intel's iAPX86,88 
family. It supports any combination of 8/16-bit busses. In 
the REMOTE mode it can be used to complement other 
Intel processor families. Hardware and communication 
architecture are designed to provide simple mechanisms 
for system upgrade. 


The only direct communication 
between the lOP and 
CPU is handled by the Channel Attention and Interrupt 
lines. Status 
information, 
parameters and task 
pro- 
grams are passed via blocks of shared memory, simpli- 
fying hardware interface and encouraging structured 
programming. 


The 8089 can be used in applications such as file and 
buffer management in hard disk or floppy disk control. It 
can also provide for soft error recovery routines and scan 
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control. CRT control, such as cursor control and auto 
scrolling. is simplified with the 8089. Keyboard control. 
communication control and general I/O are just a few of 
the typical applications for the 8089. 


Shown in Figure 3 is the 8089 in a LOCAL configuration. 
The iAPX86(or iAPX88)is used in its maximum mode.The 
8089and iAPX86 reside on the same local bus. sharing the 
same set of system buffers. Peripherals located on the 
system bus can be addressed by either the iAPX 86 or the 
8089. The 8089 requests the use of the LOCAL bus by 
means of the RQ/GTline. This performs a similar function 
to that of HOLD and HLDA on the Intel 808SA.8080A and 
iAPX 86 minimum 
mode, but is implemented 
on one 
physical line. When the iAPX 86 relinquishes the system 
bus. the 8089 uses the same bus control. latches and 
transceiver components to generate the system address. 
control 
and data 
lines. 
This 
mode allows 
a more 
economical 
system configuration 
at the expense of 
reduced CPUthruput due to lOP bus utilization. 


A typical REMOTEconfiguration is shown in Figure 4. In 
this mode, the lOP's bus is physically separated from 
the system bus by means of system buffers/latches. The 
lOP maintains its own local bus and can operate out of 
local or system memory. The system bus interface con- 
tains the following components: 


• Up to three 8282 buffer/latches to latch the address to 
the system bus. 


• Up to two 8286 devices bidirectionally 
buffer the 
system data bus. 


inter 


• An 8288 bus controller supplies the control signals 
necessary for buffer 
operation 
as well 
as MRDC 
(Memory Read)and MWTC (Memory Write) signals. 


• An 
8289 bus 
arbiter 
performs 
all 
the 
functions 
necessary to arbitrate the use of the system bus. This 
is used in place of the RQ/GT logic in the LOCAL 
mode. This arbiter decodes type of cycle information 
from the 8089 status lines to determine if the lOP 
desires to perform a transfer over the "common" 
or 
system bus. 


The peripheral devices PERl and PER2are supported on 
their own data and address bus. the 8089 communicates 
with the peripherals without affecting system bus opera- 
tion. Optional buffers may be used on the local bus when 
capacitive loading conditions so dictate. I/Oprograms and 
RAM buffers may also reside on the local bus to further 
reduce system bus utilization. 


Fundamentally, communication 
between the CPU and 
lOP is performed through messages prepared in shared 
memory. The CPU can cause the 8089 to execute a pro- 
gram by placing it in the 8089's memory space and/or 
directing the 8089's attention to it by asserting a hard- 
ware Channel Attention 
(CA) signal 
to the lOP, ac- 
tivating the proper I/O channel. The SEL Pin indicates to 
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the lOP which channel is being addressed. Communica- 
tion from the lOP to the processor can be performed in a 
similar manner via a system interrupt (SINTR 1,2),if the 
CPU has enabled interrupts for this purpose. Addition- 
ally, the 8089 can store messages in memory regarding 
its status and the status of any peripherals. This com- 
munication 
mechanism is supported by a hierarchial 
data structure to provide a maximum amount of flexi- 
bility of memory use with the added capability of handl- 
ing mUltiple lOP's. 


Illustrated in Figure 5 is an overview of the communica· 
tion data structure hierarchy that exists for the 8089 I/O 
processor. Upon the first CA from RESET, if the lOP is 
initialized asthe BUS MASTER,5 bytes of information are 
read into the 8089 starting at location FFFF6 (FFFF6, 
FFFF8-FFFFB)where the type of system bus (16-bit or 8- 
bill and pointers to the system configuration 
block are 
obtained. This isthe only fixed location the 8089accesses. 
The remaining addresses are obtained via the data struc- 
ture hierarchy. The 8089 determines addresses in the 
same manner as does the iAPX86; i.e., a l6-bit relocation 
pointer is offset left 4 bits and added to the l6-bit address 
offset, obtaining a 20-bit address. Once these 20-bit ad- 
dressesare formed, they are stored assuch, as all the 8089 
address registers are 20 bits long. After the system con- 
figuration pointer address is formed, the 8089 lOP ac- 
cesses the system configuration block. 
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The System Configuration 
Block (SCB), used only dur- 
ing startup, points to the Control Block (CB)and provides 
lOP system configuration 
data via the SOC byte. The 
SOC byte initializes 
lOP 1/0 
bus width to 8/16, 
and 
defines one of two lOP RQ/GT 
ope"rating modes. For 
RQ/GT mode 0, the lOP is typically initialized as SLAVE 
and has its RQ/GT line tied to a MASTER CPU (typical 
LOCAL configuration). In this mode, the CPU normally 
has control of the bus, grants control to the lOP as need- 
ed, and has the bus restored to it upon lOP task comple- 
tion (lOP request-CPU 
grant-lOP 
done). For RQ/GT 
mode 1, useful only in remote mode between two lOPs, 
MASTERISLAVE designation 
is used only to initialize 
bus control: from then on, each lOP requests and grants 
as the bus is needed (IOP1 request-IOP2 
grant-IOP2 
request-IOP1 
grant). Thus, each lOP retains bus con- 
trol until the other requests it. The completion of in- 
itialization 
is signalled by the lOP clearing the BUSY 
flag in the CB. This type of startup allows the user to 
have the startup pointers in ROM with the SCB in RAM. 
Allowing the SCB to be in RAM gives the user the flex- 
ibility of being able to initialize multiple lOPs. 


The Control Block furnishes bus control Initialization for 
the lOP operation (CCW or Channel Control Word) and 
provides pointers to the Parameter Block or "data" 
memory for both channels 1 and 2. The CCW is retrieved 
and analyzed upon all CA's other than the first after a 
reset. The CCW byte is decoded to determine channel 
operation. 


The Parameter Block contains the address of the Task 
Block and acts as a messge center between the lOP and 
CPU. Parameters or variable information is passed from 
the CPU to its lOP in this block to customize the soft- 
ware interface to the peripheral device. It is also used 
for transferring data and status information between the 
lOP and CPU. 


The Task Block contains the instructions for the respec- 
tive channel. This block can reside on the local bus of 
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the CPU, or reside in system memory. 


The advantage of this type of communication 
between 
the processor, lOP and peripheral, is that it allows for a 
very clean method for the operating system to handle 
I/O routines. Canned programs or "Task Blocks" allow 
for execution of general purpose I/O routines with the 
status 
and 
peripheral 
command 
information 
being 
passed via the Parameter Block ("data" memory). Task 
Blocks (or "program" 
memory) can be terminated or 


restarted by the CPU, if need be. Clearly, the flexibility 
of this communication lends itself to modularity and ap- 
plicability to a large number of peripheral devices and 
upward compatibility 
to future end user systems and 
microprocessor families. 


The 8089 maintains separate registers for its two I/O 
channels as well as some common registers (see Figure 
6).There are sufficient registers for each channel to sus- 
tain its own DMA transfers, and process its own instruc- 
tion stream. The basic DMA pointer registers (GA, GB - 
20 bits each), can point to either the system bus or local 
bus, DMA source or destination, and can be autoincre- 
mented. A third register set (GC) can be used to allow 
translation during the DMA process through a lookup 
table it points to. Additionally, registers are provided for a 
masked compare during the data transfer and can be set 
up to act as one of the termination conditions. Other 
registers arealso provided. Manyof these registers can be 
used as general purpose registers during program execu- 
tion, when the lOP is not performing DMA cycles. 


TAG19 
0 


G.P. ADDRESS A (GA) 


O.P. ADDRESS B (GB) 


O.P. ADDRESS C (GC) 


TASK POINTER (TP) 


__ 
1·BIT POINTER TO EITHER 110OR SYSTEM MEMORY SPACE 
~15.c====;;:;';;;::::====~1 
I 
INDEX 
(IX) 


BYTE COUNT (BC) 


I 
MASK 
I 
COMPARE (MC) 


CHANNEL CONTROL (CC) 


NON USER PROGRAMMABLE 
(ALWAYS POINTS TO SYSTEM MEMORY) 
191 
I 
PARAMETER POINTER (PP) 


Bus Operation 


The 8089 utilizes 
the 
same bus structure 
as the 


iAPX 86, 88 in their maximum mode configurations (see 
Figure 7). The address is time multiplexed with the data 
on the first 
16/8 
lines. A16 through A19 are time multi- 
plexed with four status lines S3-S6.For 8089 cycles, S4 
and S3 determine what type of cycle (DMA versus non- 
DMA) is being performed on channels 1 or 2. S5 and S6 


the user to detect which processor is performing a bus 
cycle in a multiprocessing 
environment. 


The first three status lines, SO·S2,are used with an 8288 
bus controller 
to determine if an instruction 
fetch or 


data transfer 
is being 
performed 
in 
110 or system 


memory space. 


DMA transfers require at least two bus cycles with each 
bus cycle requiring a minimum of four clock cycles. Ad- 
ditional 
clock cycles are added if wait states are reo 


quired. This two cycle approach simplifies considerably 
the bus timings in burst DMA. The 8089 optimizes the 
transfer 
between two different 
bus widths 
by using 


three bus cycles versus four to transfer 1 word. More 
than one read (write) is performed when mapping an 
8-bit bus onto a 16·bit bus (vice versa). For example, a 
data transfer from an 8·bit peripheral to a 16·bit physical 
location 
in memory is performed by first 
doing two 


reads, with word assembly within 
the lOP assembly 


register file and then one write. 


As can be expected, the data bandwidth of the lOP is a 
function of the physical bus width of the system and 1/0 
busses. Table 2 gives the bandwidth, latency and bus 
utilization of the 8089. The system bus is assumed to be 
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column) or 16·bit peripheral (word column) being shown. 


The latency refers to the worst case response time bv 
the lOP to a DMA request, without the bus arbitration 
times. Notice that the word transfer allows 50% more 
bandwidth. This occurs since three bus cycles are reo 
quired to map 8·bit data into a 16-bit location, versus two 
for a 16·bit to 16·blt transfer. Note that it is possible to 
fully 
saturate 
the system 
bus in the 
LOCAL mode 


whereas in the REMOTE mode this is reduced to a max· 
imum of 50%. 


Local 
Remote 


Byte 
Word 
Byte 
Word 


Bandwidth 
830 KBiS 
1250 KBiS 
830KBlS 
1250 KBiS 


Latency 
1.0/2.4 ~sec' 
1.0/2.4 ~sec' 
1.0/2.4 ~sec' 
1.0/2.4 ~sec' 


System 
Bus 
2.4lJsec 
1.6 ~sec 
0.8~ec 
0.8 ~sec 


PER 
PER 
PER 
PER 


Utilization 
TRANSFER 
TRANSFER 
TRANSFER 
TRANSFER 


GOES INACTIVE IN THE STATE 
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'NOTlCE: Stresses above those listed under "Absolute 
Maximum Ratings" may cause permanent damage to the 
device. This is a stress rating only and functional opera- 
tion of the device at these or any other conditions above 
those indicated 
in the operational 
sections 
of this 
specification is not implied. Exposure to absolute maxi- 
mum rating conditions for extended periods may affect 
device reliability. 


Ambient 
Temperature 
Under 
Bias 
O·C to lO·C 
Storage 
Temperature 
- 65·C 
to 
+ 150·C 
Voltage 
on Any 
Pin with 
Respect 
to Ground 
- 
1.0 to + 7V 


Power 
Dissipation 
.............•.......... 
2.5 Watt 


Symbol 
Parameter 
Min. 
Max. 
Units 
Test 
Conditions 


Vil 
Input 
Low 
Voltage 
-0.5 
+0.8 
V 


V1H 
Input 
High 
Voltage 
2.0 
Vcc+ 
1.0 
V 


VOL 
Output 
Low 
Voltage 
0.45 
V 
IOl=2.0 
mA 


VOH 
Output 
High 
Voltage 
2.4 
V 
10H = 
-400,..A 


Icc 
Power 
Supply 
Current 
350 
mA 
TA=25·C 


III 
Input 
Leakage 
Current(1) 
± 10 
~ 
OV < VIN < VCC 


ILO 
Output 
Leakage 
Current 
± 10 
~ 
0.45V 
.;; VOUT .;; VCC 


VCl 
Clock 
Input 
Low 
Voltage 
-0.5 
+0.6 
V 


VCH 
Clock 
Input 
High 
Voltage 
3.9 
Vcc + 1.0 
V 


Capacitance 
of Input 
Buffer 
CIN 
(All 
input 
except 
15 
pF 
fc 
= 
1 MHz 
ADo- 
AD15• RQ/Gfj 


CIO 
Capacitance 
of I/O Buffer 
15 
pF 
fc 
= 
1 MHz 
(ADo- 
AD15• RQ/GT) 


A.C. CHARACTERISTICS 
(TA = o·C to lO·C. Vcc 
= 5V ±10%) 


8089/8086 
MAX 
MODE 
SYSTEM 
(USING 
8288 
BUS CONTROLLER) 
TIMING 
REQUIREMENTS 


Symbol 
Perameter 
Min. 
Max. 
Units 
Tost Condition. 


TClCl 
ClK Cycle Period 
200 
500 
ns 


TClCH 
ClK low Time 
('hTClCl) - 15 
ns 


TCHCl 
ClK High Time 
('13 TClCl) + 2 
ns 


TCH1CH2 
eLK 
Rise Time 
': 
. 
• 
1\ 
10 
ns 
From 1.0Vt03.5V 


TCL2Cl1 
ClK Fall Time 
10 
ns 
From3.5Vto 1.0V 


TDVCl 
Data In Setup 
Time 
30 
ns 


TClOX 
Data In Hold Time 
10 
ns 


TR1VCl 
ROYSetup Time into 8284(See Notes 1,2) 
35 
ns 


TClR1X 
ROYHold Time into 8284(See Notes 1, 2) 
0 
ns 


TRYHCH 
READYSetup Time into 8089 
('hTClCl)- 
15 
ns 


TCHRYX 
READYHold Time into 8089 
30 
ns 


TRYlCl 
READYInaclive to ClK (SeeNole 4) 
-8 
ns 


TINVCH 
Setup Time Recognition (ORO1.2 RESET.Ext 1,2)(See Nole 2) 
30 
ns 


TGVCH 
RO/GT Setup Time 
30 
ns 


TCAHCAl 
CA Width 
95 
ns 


TSlVCAl 
SEl Setup Time 
75 
ns 


TCAlSLX 
SEL Hold Time 
0 
ns 


TCHGX 
GT Hold Time into 8089 
40 
ns 


TlllH 
Input RiseTime (Except ClK) 
20 
ns 
From0.8Vto 2.0V 


TIHll 
Input FallTime (ExceptClK) 
12 
ns 
From2.0Vto 0.8V 


inter 


Symbol 
Parameter 
. 
Min. 
Max. 
Units 
Test Conditions 


TCLML 
Command 
Active Delay (See Note 1) 
10 
35 
ns 
CL=80 
pF 


TCLMH 
Command 
Inactive Delay (See Note 1) 
10 
35 
ns 


TRYHSH 
READY Active 
to Status 
Passive 
(See Note 3) 
110 
ns 


TCHSV 
Status Active Delay 
10 
110 
ns 


TCLSH 
Status Inactive Delay 
10 
130 
ns 


TCLAV 
Address Valid Delay 
, 
10 
110 
ns 


TCLAX 
Address Hold Time 
10 
ns 


TCLAZ 
Address Float Delay 
TCLAX 
80 
ns 


TSVLH 
Status 
Valid 
to ALE High (See Note 
1) 
15 
ns 


TCLLH 
CLK Low to ALE Valid (See Note 
1) 
15 
ns 
CL= 
150 pF 


TCHLL 
ALE Inactive 
Delay (See Note 1) 
15 
ns 


TCLDV 
Data Valid Delay 
10 
110 
ns 


TCHDX 
Data Hold Time 
10 
ns 


TCVNV 
Control Active Delay (See Note 1) 
5 
45 
ns 


TCVNX 
Control Inactive Delay (See Note 1) 
10 
45 
ns 


TCHDTL 
Direction 
Control Active Delay (See Note 1) 
50 
ns 


TCHDTH 
Direction 
Control Inactive Delay (See Note 1) 
30 
ns 


TCLGL 
AQ Active Delay 
0 
85 
ns 
CL= 
100 pF 


TCLGH 
RQ Inactive Delay 
85 
ns 
Note 5: CL ~ 30 pF 


TCLSRV 
SINTR Valid 
Delay 
150 
ns 
CL= 
100 pF 


TOLOH 
Output Rise Time 
20 
ns 
From 0.8V to 2.0V 


TOHOL 
Output 
Fall Time 
12 
ns 
From 2.0V to 0.8V 


NOTES: 
1. Signal 
at 8284 or 8288 
shown 
tor reference 
only. 
2. Setup 
requirement 
for asynchronous 
signal 
only to guarantee 
recognition 
at next elK. 


3. Aplies 
only to T3 and TW states. 
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(8288 OUT PUn 


see NOTE 
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ROY (8284 INPUT) 


SEE NOTE 7 J 
AND ABOVE 
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WRITE - 
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SEE NOTE 7 ( 
AND ABOVE 
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(BIll) 
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WAVEFORMS (Continued) 


EXTERNAL 
TERMINATE 
SETUP 


EXT1,2: 
------' 


inter 


POINTER INSTRUCTIONS 
OPCODE 


7 
o 7 
0 


LPD 
P,M 
Load Pointer PPP from Addressed 
Location 
P P P 0 
o A A 1 1 000 
1 OMM 


LPDI 
P,I 
Load Pointer PPP Immediate 4 Bytes 
P P P 1 
000 
1 o 0 0 0 
1 000 


MOVP M,P 
Store Contents 
of Pointer PPP in Addressed 
Location 
P P P 0 
o A A 1 1 0 0 1 
1 OMM 


MOVP P,M 
Restore Pointer 
P P P 0 
o A A 1 1 0 0 0 
1 1 MM 


MOVE DATA 
OPCODE 


MOV 
M,M 
Move from Source to Destination 
Source- 
000 
0 
OAAW 
1 o 0 1 
OOMM 
Destination- 
000 
0 
OAAW 
1 100 
11M 
M 


MOV 
R,M 
Load Register RRR from Addressed 
Location 
R R R 0 
OAAW 
1 000 
OOMM 


MOV 
M,R 
Store Contents of Register RRR in Addressed Location 
R R R 0 
OAAW 
1 000 
o 1 M M 


MOVI 
R 
Load Register RRR Immediate (Byte) Sign Extend 
R R R 
wb 
OOW 
o 0 1 1 
o 0 0 0 


MOVI 
M 
Move Immediate to Addressed 
Location 
000 
wb 
AAW 
o 1 0 0 
11M 
M 


CALLS 
OPCODE 


7 
07 
0 


"CALL 
Call Unconditional 
11 0 0 
dd 
AAwI100 
1 
1 1 M M I 


JUMP 
OPCODE 


JMP 
Unconditional 
100 
dd 
OOW 
o 0 1 0 
o 000 


JZ 
M 
Jump on Zero Memory 
000 
dd 
AAW 
1 1 1 0 
o 1 M M 
JZ 
R 
Jump on Zero Register 
R R R 
dd 
000 
o 1 o 0 
o 1 0 0 
JNZ 
M 
Jump on Non-Zero Memory 
000 
dd 
AAW 
1 1 1 0 
OOMM 
JNZ 
R 
Jump on Non-Zero Register 
R R R 
dd 
000 
o 1 o 0 
o 0 0 0 
JBT 
Test Bit and Jump if True 
B B B 
dd 
A A 0 
1 0 1 1 
1 1 MM 
JNBT 
Test Bit and Jump if Not True 
B. B B 
dd 
A A 0 
1 0 1 1 
10M 
M 
JMCE 
Mask/Compare and Jump on Equal 
000 
dd 
A A 0 
1 0 1 1 
OOMM 
JMCNE 
Mask/Compare and Jump on Non-Equal 
000 
dd 
A A 0 
1 0 1 1 
o 1 M M 


INCREMENT, DECREMENT 
OPCODE 
7 
07 
0 


"ADDI 
M,I 
ADD Immediate to Memory 
o 0 0 0 
OAAW 
1 1 1 0 
1 OMM 
"ADDI 
R,I 
ADD Immediate to Register 
R R R 0 
000 
0 o 0 1 1 
1 000 
tADD 
M,R 
ADD Register to Memory 
o 0 0 0 
o A AW 
1 1 1 0 
1 1MM 
tADD 
R,M 
ADD Memory to Register 
R R R 0 
o 0 0 0 o 0 1 1 
1 100 


inter 


ADD 


c" 
OPCODE 
,~ 
r:, 


7 
o 7 
0 
ADDI 
M,I 
ADD Immediate to Memory 
o 0 0 
wb 
AAW1 
100 
OOMM 
ADD! 
R,I 
ADD Immediate to Register 
R R R 
wb 
OOW 
001 
0 
0, 0 0 0 
ADD 
M,R 
ADD Register to Memory 
R R R 0 
OAAW1 
101 
OOMM 
ADD 
R,M 
ADD Memory to Register 
R R R 0 
o A AW 
1 0 1 0 
OOMM 


AND 
' 
i 
I" 
OPCODE 


ANDI 
M,I 
AND Memory with Immediate 
000 
wb 
AAW 
1 100 
1 OMM 
ANDI 
R,I 
AND Register with Immediate 
R R R 
wb 
OOW 
001 
0 
1 000 
AND 
M,R 
AND Memory with Register 
R R R 0 
OAAW 
1 1 0 1 
1 OMM 
AND 
R,M 
AND Register with Memory 
R R R 0 
OAAW 
101 
0 
1 OMM 


OR 
OPCODE 


ORI 
M,I 
OR Memory with Immediate 
000 
wb 
AAW1 
100 
0 1MM 
ORI 
R,I 
OR Register with Immediate 
R R R 
wb 
AAW 
001 
0 
0 1 0 0 
OR 
M,R 
OR Memory with Register 
R R R 0 
OAAW 
~ 1 0 1 
0 1MM 
OR 
R,M 
OR Register with Memory 
R R R 0 
OAAW 
101 
0 
0 1MM 


NOT 
. 
- 
OPCODE 
--" 
. 


NOT 
R 
Complement 
Register 
R R R 0 
o 0 0 0 
001 
0 
1 100 
NOT 
M 
Complement 
Memory 
o 0 0 0 
OAAW 
1 101 
1 1 MM 
NOT 
R,M 
Complement 
Memory, Place in Register 
R R R 0 
OAAW 
1" 0 1 0 
1 1 MM 


":,1 


BitManipulationand Test Instructions 


BIT MANIPULATION 
OPCODE 


7 
07 
0 


SET 
Set the Selected Bit 
B B B 0 
OAAO 
1 
o 1 M M 
CLR 
Clear the Selected Bit 
B B B 0 
o A A 0 
1 
10M 
M 


TEST 
OPCODE 


TSL 
Test and Set Lock 
10 0 0 
1 A A 0 11 o 0 1 
0 1MMI 


Control 
OPCODE 
'} 


7 
07 . 
0 
HLT 
Halt Channel Execution 
o 0 1 0 
o 0 0 0 
o 1 0 0 
1 000 
SINTR 
Set Interrupt 
Service Flip Flop 
o 1 o 0 
o 000 
o 0 0 0 
o 0 0 0 
NOP 
No Operation 
o 0 o 0 
o 000 
o 0 0 0 
o 0 0 0 
XFER 
Enter DMA Transfer 
o 1 1 0 
o 0 0 0 
o 0 0 0 
o 0 0 0 
WID 
Set Source, Destination 
Bus Width; S,D 0 = 8, 1= 16 
1 S 0 0 
o 0 0 0 
o 0 0 0 
000 
0 
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"n field in call instruction 
can be 00, 01,10 only. 


··OPCODE 
is second byte fetched. 


All Instructions 
consist 
of at least 2 bytes, while some 


Instructions 
may use up to 3 additional 
bytes to specify 


literals 
and displacement 
data. The definition 
of the 


various fields within each Instruction 
is given below: 


7 
0 
7 


~I--o-PC-O-OE--B 
PPPBBB 


00 
GA 
01 GB 
10 
GC 
11 
PP 


RRR Register Field 


The RRR field specifies 
a 16·bit register 
to be used in 


the instruction. 
if GA, GB, GC or TP, are referenced 
by 


the RRR field, the upper 4 bits of the registers are load· 
ed with the sign bit (Bit 15). PPP registers 
are used as 


20·blt address pointers. 


RRR 
. 


000 
rO 
GA 
001 
r1 
GB 
010 
r2 
GC 
011 
r3 
BC 
; byte count 
100 
r4 
TP 
; task block 
101 
r5 
IX 
; index register 


110 
r6 
CC 
; channel control 
(mode) 


111 
r7 
MC 
; mask/compare 


ppp 


000 
pO GA 
001 
p1 GB 
010 
p2 GC 


100 
p4 
TP 
; task block pointer 


BBB Bit Select Field 


The bit select field replaces the RRR field In bit manlpu· 
lation instructions 
and is used to select a bit to be oper· 


ated on by those instructions. 
Bit 0 is the least slgnlfi· 


cant bit. 


wb 


01 
1 byte literal 
10 2 byte (word) literal 
dd 


01 1 byte displacement 
10 2 byte (word) displacement. 


AA Field 


00 The selected pointer contains 
the operand address. 


01 The operand address is formed 
by adding an a·blt, 


unsigned, 
offset 
contained 
in the instruction 
to the 


selected pointer. The contents 
of the pointer are un· 


changed. 


10 The operand address is formed by adding the con· 


tents of the Index register 
to the selected 
pointer. 


Both registers 
remain unchanged. 


11 Same as 10 except the Index register 
is post auto· 
incremented 
(by 1 for a·bit transfer, 
by 2 for 16·blt 
transfer). 


W Width Field 
o 
The selected operand Is 1 byte long. 


1 
The selected operand is 2 bytes long. 


Additional 
Bytes 


OFFSET: 
a·bit unsigned offset. 


SDISP 
: a/16·bit signed displacement. 


LITERAL: 
8/16-bit literal. (32 bits for LDPIl. 


The order in which the above optional bytes appear in lOP 
instructions 
is given below: 


Offsets 
are treated as unsigned 
numbers. 
Literals 
and 


displacements 
are sign extended (2's complement). 
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8259A/ 8259A-2/ 8259A-8 
PROGRAMMABLE 
INTERRUPT 
CONTROLLER 


• IAPX 86, IAPX 88 Compatible 


• MCS-80®, MCS-85® Compatible 


• Eight-Level Priority Controller 


• Expandable to 64 Levels 


• Programmable Interrupt Modes 


• Individual Request Mask Capability 


• Single + 5V Supply (No Clocks) 


• 28-Pin Dual-In-Line Package 


The Intel'" 8259A Programmable Interrupt Controller handles up to eight vectored priority interrupts for the CPU. It is 
cascadable for up to 64 vectored priority interrupts without additional circuitry. It is packaged in a 28-pin DIP, uses 
NMOS technology and requires a single + 5V supply. Circuitry is static, requiring no clock input. 


The 8259A is designed to minimize the software and real time overhead in handling multi-level priority interrupts. It has 
several modes, permitting optimization for a variety of system requirements. 


The 8259A is fully upward compatible with the Inte'''' 8259. Software originally written for the 8259 will operate the 
8259A in all 8259 equivalent modes (MCS-80/85,Non-Buffered, Edge Triggered). 


DATA 
CONTROL 
lOGIC 
0,-00 
BUS 


BUFFER 


Cs 


WR 


AD 
INTA 


0, 
IR7 


RO 
0. 
IR6 


WR 
C\ 
0, 


OJ 
0, 
IR2 
cs 
,.7 
0, 
IR1 


°0 
CASO 


CASO 
CAS 1 


GNO 


CAS 1 


CAS 2 


SP/EN 
~INTERNAl 
BUS 


Symbol 
Pin No. 
Type 
Name and Function 
-, 
l 


VCC 
26 
I 
Supply: 
+5V Supply. 
. . 


GND 
14 
I 
Ground. 


CS 
1 
I 
Chip Select: 
A Iowan this pin enables RD and WR communication 
between the CPU and the 6259A. 


INTA functions are independent of CS. 


WR 
2 
0 
Write: A Iowan this pin when CSis low enables the 6259Ato accept command words from the CPU. 


RD 
3 
I 
Read: A Iowan this pin when CS is low enables the 6259Ato release status onto the data bus for the 
CPU. 


Dr-Do 
4-11 
I/O 
Bidirectional 
Data Bu.: 
Control, status and interrupt-vector 
information is transferred via this bus. 


CASo-CAS2 
12,13,15 
I/O 
Cascade Line.: 
The CAS lines form a private 6259A bus to control a multiple 6259Astructure. These 


pins are outputs for a master 6259A and inputs for a slave 6259A. 


SP/EN 
16 
I/O 
Slave Program/Enable Buffer: 
This is a dual function pin. When in the Buffered Mode it can be used 


as an output to control buffer transceivers (EN). When not in the buffered mode it is used as an input 
to designate a master (SP = 1) or slave (SP = 0). 


INT 
17 
0 
Interrupt: 
This pin goes high whenever a valid interrupt request is asserted. It is used to interrupt the 


CPU, thus it is connected to the CPU's interrupt pin. 


1Ro-IR7 
16-25 
I 
Interrupt Reque.t.: 
Asynchronous inputs. An interrupt request is executed by raising an IR input 


(low to high), and holding it high until it is acknowledged (Edge Triggered Mode), or just by a high 
level on an IR input (Level Triggered Mode). 


INTA 
26 
I 
Interrupt Acknowledge: 
This pin is used to enable 6259A interrupt-vector data onto the data bus by 


a sequence of interrupt acknowledge pulses issued by the CPU. 


Ao 
27 
I 
AO Addre •• Line: This pin acts in conjunction with the CS,WR, and RD pins. It is used by the 6259A 
to decipher various Command Words the CPUwrites and status the CPUwishes to read. It is typically 
connected to the CPU AOaddress line (A1 for iAPX 66, 68). 


Interrupts in Microcomputer 
Systems 


Microcomputer system design requires that I/O devices 
such as keyboards, displays, sensors and other com· 
ponents receive servicing In an efficient manner so that 
large amounts of the total system tasks can be assumed 
by the microcomputer with little or no effect on through- 
put. 


The most common method of servicing such devices I!" 
the Polled approach. This is where the processor mus. 
test each device in sequence and in effect "ask" each 
one if it needs servicing. It is easy to see that a large por- 
tion of the main program Is looping through this con· 
tinuous 
polling cycle and that such a method would 
have a serious, detrimental effect on system through- 
put, thus limiting the tasks that could be assumed by 
the microcomputer and reducing the cost effectiveness 
of using such devices. 


A more desirable method would be one that would allow 
the microprocessor to be executing its main program 
and only stop to service peripheral devices when it Is 
told to do so by the device itself. In effect, the method 
would 
provide an external 
asynchronous 
input that 
would inform the processor that it should complete 
whatever instruction 
that is currently being executed 
and fetch a new routine that will service the requesting 
device. Once this servicing is complete, however, the 
processor would resume exactly where it left off. 


This me~hod is called Interrupt. It is easy to see that 
system throughput would drastically increase, and thus 
more tasks could be assumed by the microcomputer to 
further enhance its cost effectiveness. 


The Programmable Interrupt Controller (PIC) functions 
as an overall manager in an Interrupt·Driven system 
environment. It accepts requests from the peripheral 
equipment, determines which of the incoming requests 
is 
of 
the 
highest 
importance 
(priority), 
ascertains 
whether the incoming request has a higher priority value 
than the level currently being serviced, and issues an 
interrupt to the CPU based on this determination. 
Each peripheral device or structure usually has a special 
program or "routine" that is associated with its specific 
functional or operational requirements; this is referred 
to as a "service routine". The PIC,after issuing an Inter· 
rupt to the CPU, must somehow input information into 
the CPU that can "point" 
the Program Counter to the 
service routine associated with the requesting device. 
This "pointer" 
is an address in a vectoring table and will 
often be referred to, in this document, as vectoring data. 


The 8259A 


The 8259A is a device specifically designed for use in 
real time, interrupt driven microcomputer 
systems. It 
manages eight levels or requests and has bullt·ln fea· 
tures for expandability to other 8259A's (up to 64 levels). 
It is programmed by the system's software as an I/O 
peripheral. A selection of priority modes is available to 
the programmer so that the manner in which the reo 
quests are processed by the 8259A can be configured to 


be changed or reconfigured dynamically at any time dur- 
ing the main program. This means that the complete 
interrupt structure can be defined as required, based on 
the total system environment. 


CP\J·DRIVEN 
MULTIPLEXOR 
CPU 
---- --y 
(0 


A 
RAM 
1/011) 
- 


. 
ROM 
1/0(2) - 
--y 


r---, 
"I 
I 
A 
1/0lNI 
...- 
~I 
I 
L___ ...J 


v 
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INTERRUPT 
REQUEST 
REGISTER 
(IRR) AND 
IN·SERVICE 
REGISTER 
(ISR) 


The interrupts at the IR input lines are handled by two 
registers 
in cascade, the Interrupt 
Request Register 
(IRR) and the In-Service Register (ISR).The IRR is used 
to store all the interrupt levels which are requesting ser- 
vice; and the ISR is used to store all the interrupt levels 
which are being serviced. 


PRIORITY 
RESOLVER 


This logic block determines the priorities of the bits set 
in the IRR. The highest priority is selected and strobed 
into the corresponding bit of the ISRduring INTA pulse. 


INTERRUPT 
MASK 
REGISTER 
(IMR) 


The IMR stores the bits which mask the interrupt lines 
to be masked. The IMR operates on the IRR. Masking of 
a higher 
priority 
input 
will 
not affect 
the interrupt 
request lines of lower priority. 


INT (INTERRUPT) 


This output goes directly to the CPU interrupt input. The 
VOH level on this line is designed to be fully compatible 
with the 8080A, 8OS5Aand 8086 input levels. 


INTA 
(INTERRUPT 
ACKNOWLEDGE) 


INTA pulses will cause the 8259A to release vectoring 
information onto the data bus. The format of this data 
depends on the system mode (I'PM) of the 8259A. 


DATA 
BUS 
BUFFER 


This 3-state, bidirectional 8-bit buffer is used to inter- 
face the 8259A to the system Data Bus. Control words 
and status information are transferred through the Data 
Bus Buffer. 


READIWRITE 
CONTROL 
LOGIC 


The function 
of this block is to accept OUTput com- 
mands from the CPU. It contains the Initialization Com- 
mand Word (ICW) registers and Operation Command 
Word (OCW) registers which store the various control 
formats for device operation. This function block also 
allows the status of the 8259A to be transferred onto the 
Data Bus. 


CS (CHIP 
SELECT) 


A LOW on this input enables the 8259A. No reading or 
writing 
of the chip will 
occur 
unless the device is 
selected. 


WR (WRITE) 


'A LOW on this input enables the CPU to write control 
words (ICWs and OCWs) to the 8259A. 


RD (READ) 


A LOW on this input enables the 8259A to send the 
status of the Interrupt Request Register (IRR),In Service 
Register (ISR),the Interrupt Mask Register (IMR), or the 
Interrupt level onto the Data Bus. 


:~ 
... 
••'n 
'QG'" 
'o- 


ff 


Ao 
This input signal is used in conjunction with WR and RD 
signals to write commands into the various command 
registers, as well as reading It,,') various status registers 
of the chip. This line can be tied directly to one of the ad- 
dress lines. 


THE 
CASCADE 
BUFFER/COMPARATOR 


This function block stores and compares the IDs of all 
8259A's used in the system. The associated three I/O 
pins (CASO-2)are outputs when the 8259A is used as a 
master and are inputs when the 8259A is used as a 
slave. As a master, the 8259A sends the 10 of the inter· 
rupting slave device onto the CASO-2 lines. The slave 
thus selected will send its preprogrammed subroutine 
address onto the Data Bus during the next one or two 
consecutive INTA pulses. (See section "Cascading the 
8259A".) 


INTERRUPT 
SEQUENCE 


The powerful features of the 8259A in a microcomputer 
system are its programmability and the interrupt routine 
addressing capability. The latter allows direct or indirect 
jumping 
to the specific 
interrupt 
routine 
requested 
without any polling of the interrupting devices. The nor· 
mal sequence of events during an interrupt depends on 
the type of CPU being used. 


The events occur as follows in an MCS·80/85system: 


1. One or more of the 
INTERRUPT REQUEST lines 


(IR7-0) are raised high, setting the corresponding IRR 
bit(s). 


2. The 8259A evaluates these requests, and sends an 
INT to the CPU, if appropriate. 
3. The CPU acknowledges the INT and responds with an 
INTA pulse. 


4. Upon receiving an INTA from the CPU group. the 
highest priority ISR bit is set, and the corresponding 
IRR bit is reset. The 8259A will also release a CALL in· 
struction 
code (11001101) onto the 8·bit Data Bus 
through its 07-0 pins. 
5. This CALL instruction 
will initiate 
two more INTA 
pulses to be sent to the 8259A from the CPU group. 
6. These two INTA pulses allow the 8259A to release its 
preprogrammed 
subroutine 
address onto the Data 
Bus. The lower 8·bit address is released at the first 
INTA pulse and and the higher 8·bit address is reo 
leased at the second INTA pulse. 


7. This completes the 3·byte CALL instruction 
released 
by the 8259A. In the AEOI mode the ISR bit is reset at 
the end of the third INTA pulse. Otherwise, the ISR bit 
remains set until an appropriate 
EOI command is 
issued at the end of the Interrupt sequence. 


The events occurring in an iAPX 86 system are the same 
until step 4. 


4. Upon receiving an INTA from the CPUgroup, the high· 
est priority ISR bit is set and the corresponding IRR 
bit is reset. The 8259A does not drive the Data Bus 
during this cycle. 


5. The iAPX 86/10 will initiate 
a second INTA pulse. 


During this pulse, the 8259A releases an 8·bit pointer 
onto the Data Bus where it is read by the CPU. 


6. This completes the interrupt cycle. In the AEOI mode 


the ISR bit is reset at the end of the second INTA 
pulse. Otherwise, the ISR bit remains set until an 
appropriate EOI command is issued at the end of the 
interrupt subroutine. 


If no interrupt 
request is present at step 4 of either 
sequence (i.e., the request was too short in duration) the 
8259A will issue an interrupt level 7. Both the vectoring 
bytes and the CAS lines will look like an interrupt level 7 
was requested. 


I 
INTERRUPT 
REOUESTS 


Figure 5. 8259A Interface 
to Standard 
System 
Bus 
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INTERRUPTSEQUENCE OU~PUTS 


MCS-80illl, 
MCS-85i11l 


This sequence is timed by three INTA pulses. During the 
first iNfA pulse the CALL opcode is enabled onto the 
data bus. 


Content of First Interrupt 
Vector Byte 


07 
D6 
OS 
04 
03 
02 
01 
DO 


CALL CODE I 
1 
0 
0 
0 
1 I 


During the second INTA pulse the lower address of the 
appropriate service routine is enabled onto the data bus. 
When Interval = 4 bits As-A7 'are programmed, while Ao- 
A4 are automatically inserted by the 8259A. When Inter- 
val = 8 only A6 and A7 are programmed, while Ao-As are 
automatically inserted. 


Content of Second Interrupt 
Vector Byte 


IR 
Intenal.4 


07 
D6 
OS 
04 
03 
02 
01 
DO 


7 
A7 
A6 
AS 
1 
1 
1 
0 
0 


6 
A7 
A6 
AS 
1 
1 
0 
0 
0 


S 
A7 
A6 
AS 
1 
0 
1 
0 
0 


4 
A7 
A6 
AS 
1 
0 
0 
0 
0 


3 
A7 
A6 
AS 
0 
1 
1 
0 
0 
- 


2 
A7 
A6 
AS 
0 
1 
0 
0 
0 


1 
A7 
A6 
AS 
0 
0 
1 
0 
0 


0 
A7 
A6 
AS 
0 
0 
0 
0 
0 


IR 
Intervll =8 


07 
08 
OS 
04 
03 
02 
01 
DO 


7 
A7 
A6 
1 
1 
1 
0 
0 
0 


6 
A7 
A6 
1 
1 
0 
0 
0 
0 


S 
A7 
A6 
1 
0 
1 
0 
0 
0 
~ 
A7 
A6 
1 
0 
0 
0 
0 
0 
r-- 


3 
A7 
A6 
0 
1 
1 
0 
0 
0 
--- 


2 
A7 
A6 
0 
1 
0 
0 
0 
0 
------- 


1 
A7 
A6 
0 
0 
1 
0 
0 
0 


0 
A7 
A6 
0 
0 
0 
0 
0 
0 
------ 


During the third INTA pulse the higher address of the 
appropriate service routine, which was programmed as 
byte 
2 of 
the 
initialization 
sequence (A8 - A1s!, is 
enabled onto the bus. 


Content of Third Interrupt 
Vector Byte 


D6 
os 
04 
03 
02 


A14 
A13 
A12 
A11 
A10 


iAPX 86, iAPX 88 
iAPX86 mode is similar to MCS-80mode except that only 
two Interrupt Acknowledge cycles are issued by the pro- 
cessor and no CALL opcode is sent to the processor. The 
first interrupt 
acknowledge 
cycle is similar to that of 
MCS-80.85 systems in that the 8259A uses it to internally 
freezethe state of the interrupts for priority resolution and 
as a master it issues the interrupt code on the cascade 
lines at the end of the INTApulse. Onthis first cycle it does 


not issueany data to the processor and leavesits data bus 
buffers disabled. On the second interrupt acknowledge 
cycle in iAPX 86 mode the master (or slave if so pro- 
grammed) will send a byte of data to the processor with 
the acknowledged interrupt code composed as follows 
(note the state of the ADI mode control is ignored and 
As-A11 are unused in iAPX 86 mode): 


Content of Interrupt Vector Byte 
for IAPX 86 System Mode 


07 
06 
05 
04 
03 
02 
01 
00 


IR7 
T7 
T6 
T5 
T4 
T3 
1 
1 
1 


IR6 
T7 
T6 
T5 
T4 
T3 
1 
1 
0 


IR5 
T7 
T6 
T5 
T4 
T3 
1 
0 
1 


IR4 
T7 
T6 
T5 
T4 
T3 
1 
a 
0 


IR3 
T7 
T6 
T5 
T4 
T3 
a 
1 
1 


IR2 
T7 
T6 
T5 
T4 
T3 
a 
1 
a 


IR! 
T7 
T6 
T5 
T4 
T3 
0 
a 
1 


IRa 
T7 
T6 
T5 
T4 
T3 
a 
a 
a 


PROGRAMMING 
THE 8259A 
The 8259Aaccepts two types of command words gener- 
ated by the CPU: 


1. Initialization 
Command Words (lCWs): Before normal 
operation can begin. each 8259A in the system must 
be brought to a starting point - 
by a sequence of 2 to 
4 bytes timed 
by WR pulses. 


2. Opera/ion 
Command 
Words (OCWs): These are the 
command words which command the 8259A to oper- 
ate in various Interrupt modes. These modes are: 
a. Fully nested mode 
b. Rotating priority mode 
c. Special mask mode 
d. Polled mode 


The OCWs can be written into the 8259A anytime after 
initialization. 
' 


INITIALIZATION 
COMMAND WORDS 
(ICWS) 


GENERAL 


Whenever a command is issued with AO= 0 and D4= 1, 
this is interpreted as Initialization 
Command Word 1 
(ICW1). ICW1 starts the initialization 
sequence during 
which the following automatically occur. 


a. The edge sense circuit is reset. which means that fol- 
lowing initialization, an interrupt 
request (IR) input 
must make a low-to-high transition to generate an 
interrupt. 
b. The Interrupt Mask Register is cleared. 
c. IR7 input is assigned priority 7. 
d. The slave mode address is set to 7. 
e. Special MaskMode is cleared and Status Readis set to 
IRR. 


f. If IC4=O,then all functions selected in ICW4 are set to 
zero. (Non-Buffered mode', no Auto-EOI. MCS-80, 85 
system). 


inter 


INITIALIZATION COMMAND WORDS 1 AND 2 
(ICW1,ICW2) 


As-A,s: Page starting 
address of service routines. 
In an 
MCS 80/85 system, the 8 request levels will generate 
'CAlls 
to 8 locations equally spaced in memory. These 
can be programmed to be spaced at intervals of 4 or 8 
memory locations, 
thus the 8 routines will occupy a 
page of 32 or 64 bytes, respectively. 


The address format is 2 bytes long (Ao-A,s>. When the 
routine interval is 4, Ao-A4 are automatically inserted by 
the 8259A, while As-A,s are programmed externally. 
When the routine interval is 8, Ao-As are automatically 
inserted by the 8259A, while A6-A,s are programmed 
externally. 


The 8·byte interval will maintain compatibility 
with cur· 
rent software, while the 4·byte interval is best for a com· 
pact jump table. 


In an iAPX 86 system A15-A11 are inserted in the five most 
significant bits of the vectoring byte and the 8259A sets 
the three least significant bits according to the interrupt 
level. A10-A5 are ignored and ADI (Address interval) has 
no effect. 


lTlM: 
If lTIM = 1, then the 8259A will operate in the 
level interrupt mode. Edge detect logic on the 
interrupt inputs will be disabled. 


ADI: 
CAll 
address interval. ADI = 1 then Interval = 4; 
ADI = 0 then interval = 8. 


SNGl: 
Single. Means that this Is the only 8259A in the 
system. If SNGl= 
1 no ICW3 will be issued. 


IC4: 
If this bit Is set - 
ICW4 has to be read. If ICW4 
is not needed, set IC4= O. 


INITIALIZATION COMMAND WORD 3 (ICW3) 


This word is read only when there is more than one 
8259A in the system and cascading is used, in which 
case SNGl = O. It will load the 8·bit slave register. The 
functions of this register are: 


a. In the master mode (either when SP= 1,or in buffered 
mode when MIS= 1 in ICW4) a "1" is set for each 
slave in the system. The master then will release byte 
1 of the call sequence (for MCS·80/85 system) and 
will enable the corresponding slave to release bytes 2 
and 3 (for iAPX 86 only byte 2) through the cascade 
lines. 


b. In the slave mode (either when SP= 0, or if BUF = 1 
and MIS = 0 in ICW4) bits 2-0 identify the slave. The 
slave compares its cascade input with these bits and, 
if they are equal, bytes 2 and 3 of the call sequence (or 
just byte 2 for iAPX 86 are released by it on the Data 
Bus. 
INITIALIZATION COMMAND WORD 4 (ICW4) 


SFNM: If SFNM = 1 the special fully nested mode is 
programmed. 
BUF: 
If BUF = 1 the buffered mode is programmed. In 
buffered mode SP/EN becomes an enable outpul 
and the masterlslave determination Is by MIS. 


MIS: 
If buffered mode is selected: MIS = 1 means the 
8259A is programmed to be a master, MIS = 0 
means the 8259A is programmed to be a slave. If 
BUF = 0, MIS has no function. 
AEOI: If AEOI = 1 the automatic end of interrupt mode 
is programmed. 
"PM: 
Microprocessor mode: JLPM= 0 sets the 8259Afor 
MCS-80, 85 system operation, JLPM= 1 sets the 
8259A for iAPX 86 system operation. 


Figure 6. Initialization 
Sequence 
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1 
0 
- 
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- 
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o 
1 
J" 
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o 
1 
1 
0 
1 
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1 


• 
1 
• 
o 
0 
1 
1 
1 
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Y NESTED 
MODE 


o ""NOT SPECIAL FULL Y 


NESTED MODE 
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OPERATION 
COMMAND 
WORDS (OCWs) 


After the Initialization 
Command Words (ICWs) are pro· 


grammed 
Into the 8259A, the chip Is ready to accept 


Interrupt requests at Its Input lines. However, during the 
8259A operation, 
a selection 
of algorithms 
can com· 
mand the 8259A to operate In various modes through 
the Operation Command Words (OCWs). 


OCW1 


AO 
07 
De 
05 
D4 
03 
02 


~ 
I M7 
Me 
M5 
M4 
M3 
M2 


OCW2 
I 
R 
SL 
EOI 
0 


OCW3 
o 
ESMM 
SMM 
0 


OPERATION CONTROL WORD 1 (OCW1) 


OCW1 sets and clears the mask bits 
in the interrupt 


Mask Register (IMR). M7- 
Mo represent the eight mask 


bits. 
M = 1 
indicates 
the 
channel 
is 
masked 


(inhibited), 
M = 0 indicates the channel is enabled. 


OPERATIO,.. CONTROL WORD 2 (OCW2) 


R, SL, EOI - 
These three bits control 
the Rotate and 


End of Interrupt 
modes and combinations 
of the two. A 


chart of these combinations 
can be found on the Opera· 


tion Command Word Format. 


L2, L1, Lo- These bits determine the interrupt level acted 
upon when the SL bit is active. 


OPERATION C~NTROL \yORD 3 (OCW3) 


ESMM - 
Enable Special Mask Mode. When this bit is 


set to 1 it enables the SMM bit to set or reset the Special 
,Mask Mode. When ESMM"; 0 the SMM bit becomes a 
"don't care". 


SMM - 
Special Mask Mode. If ESMM = 1 and SMM = 1 


the 8259A will enter Special 
Mask Mode. If ESMM = 1 


and SMM = 0 the 8259A will revert to normal mask mode. 
When ESMM = 0, SMM. has no effect. 


"'LEVEL 
TO BE 
ACm>UOON 
o 
1 
2 
J 
•• 
~ 
6 
7 


o 
1 
0 
1 
0 
1 
0 
, 


o 
0 
1 
1 
0 
0 
••• 


NON-WIQptC 
IOICOMMAHD 


••• 
CM'tC8)t~ 


IlIOTATION~IOt~ 


N)1JU'I 
IN AUTOIIAT1C 
IOf 
MOOI: (SI1"I 
o 
0 
fIIOTAYI 
•• 
AU1'OIIIATlC 
IOf 
MOOI: (CUAR) 


1 
1 
"MmO'I 
ON INClAC 
lot COMMANO 


WT...uoItlTY 
COMMAND 
NO_ 


READ 
READ 
IR REG 
IS REG 
ON NE'I(T 
ON NEXT 


RO 
PULSE 
AD PULSE 
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FULLY NESTED MODE 
This mode is entered after initialization 
unless another 
mode 
is 
programmed. 
The 
interrupt 
requests 
are 
ordered in priority form a through 7 (0 highest). When an 
interrupt is acknowledged the highest priority request is 
determined and its vector placed on the bus. Additional· 
Iy, a bit of the Interrupt Service register (ISO·7) is set. 
This bit remains set until the microprocessor issues an 
End of Interrupt 
(EOI) command immediately 
before 
returning from the service routine, or if AEOI (Automatic 
End of Interrupt) bit is set, until the trailing edge of the 
last INTA. While the IS bit is set, all further interrupts of 
the same or lower priority are inhibited, while higher 
levels 
will 
generate 
an 
interrupt 
(which 
will 
be 
acknowledged only if the microprocessor internal Inter· 
rupt enable flip·flop has been re·enabled through soft· 
ware). 


After the Initialization 
sequence, IRO has the hignesl 
priority and IR7the lowest. Priorities can be changed, as 
will be explained, in the rotating priority mode. 


END OF INTERRUPT(EOI) 


The In Service (IS) bit can be reset either automatically 
following the trailing edge of the last in sequence INTA 
pulse (when P.EOIbit in ICW1 is set) or by a command 
word that must be issued to the 8259A before returning 
from a service routine (EOIcommand). An EOIcommand 
must be issued twice if in the Cascade mode, once for the 
master and once for the corresponding slave. 


There are two forms of EOIcommand: Specific and Non· 
Specific. When the 8259A is operated in modes which 
preserve the fully nested structure, 
it can determine 
which IS bit to reset on EOI. When a Non·Specific EOI 
command is issued the 8259A will automatically reset 
the highest IS bit of those that are set, since in the 
fully nested mode the highest IS level was necessarily the 
last level acknowledged and serviced. A non-specific EOI 
can be issued with OCW2 (EOI = 1, SL = 0, R = 0). 


When a mode is used which may disturb the fully nested 
structure, the 8259A may no longer be able to determine 
the last level acknowledged. In this case a Specific End of 
Interrupt must be issued which includes as part of the 
command the IS level to be reset. A specific EOIcan be is- 
sued with OCW2(EOI = 1,SL = 1, R = 0, and LO-L2 is the 
binary level of the IS bit to be reset). 


It should be noted that an IS bit that is masked by an 
IMR bit will not be cleared by a non·specific EOI if the 
8259A is in the Special Mask Mode. 


AUTOMATIC END OF INTERRUPT(AEOI) MODE 


If AEOI= 1 in ICW4,then the 8259A will operate in AEOI 
mode continuously until reprogrammed by ICW4. In this 
mode the 8259A will 
automatically 
perform a non· 
specific EOI operation at the trailing edge of the last 
interrupt acknowledge pulse (third pulse in MCS·80/85, 
second in iAPX 86). Note that from a system standpoint, 
this mode should be used only when a nested multilevel 
interrupt structure is not required within a single 8259A. 


The AEOI mode can only be used in a master 8259A and 
not a slave. 


AUTOMATIC 
ROTATION 
(Equal Priority 
Devices) 


In some applications there are a number of Interrupting 
devices of equal priority. In this mode a device, after 
being serviced, receives the lowest priority, so a device 
requesting an interrupt will have to wait, in the worst 
case until each of 7 other devices are serviced at most 
once. For example, if the priority and "in service" status 
is: 


157 
158 
lsa 
154 
153 
152 
IS1 
ISO 
1011101,101010101 


l_••t Prtorlty 
HI9M.'\.rtorIty 
cd 6 I 5 I 4 1 3 1 2 I 11'0 
1 


Aft.r 
Rotet. (IR4 was serviced, all other priorities 
rotated correspondingly) 


157 
158 
lsa 
154 
153 
IS2 
IS1 
ISO 


"IS" Status 
1 0 1 ' I 0 I 0 I 0 1 0 I 0 I 0 1 


High••• Prtority 
low •• t Prtority 


~o 
1 7FI1] 
4 
I 
3 
1 


There are two ways to accomplish Automatic Rotation 
using OCW2,the Rotation on Non-Specific EOICommand 
(R = 1, SL = 0, EOI = 1) and the Rotate in Automatic EOI 
Mode which is set by (R = 1, SL = 0, EOI = 0) and cleared 
by (R = 0, SL = 0, EOI = 0). 


SPECIFIC 
ROTATION 
(Specific 
Priority) 


The programmer can change priorities by programming 
the bottom priority and thus fixing all other priorities; 
i.e., if IR5 is programmed as the bottom priority device, 
then IR6 will have the highest one. 


The Set Priority 
command 
is issued in OCW2 where: 


R = 1,SL = 1; LO-L2 isthe binary priority level code of the 
bottom priority device. 


Observe thatin 
this mode internal status is updated by 
software control during OCW2.However,it is independent 
of the End of Interrupt (EOI) command (also executed by 
OCW2).Priority changes can be executed during an EOI 
command by using the Rotate on Specific EOI command 
in OCW2(R = 1, SL = 1, EOI = 1 and LO-L2 = IR level to 
receive bottom priority). 


INTERRUPTMASKS 


Each Interrupt Request input can be masked Indivldu· 
ally by the Interrupt Mask Register (IMR) programmed 
through OCW1. Each bit in the IMR masks one interrupt 
channel if It is set (1). Bit 0 masks IRO,Bit 1 masks IR1 
and so forth. Masking an IR channel does not affect the 
other channels operation. 


intel 


Some applications 
I)'lay require an interrupt 
service 
routine to dynamically alter the system priority struc· 
ture during its execution under software control. For 
example, the routine may wish to inhibit lower priority 
requests for a portion of its execution but enable some 
of them fo~ another portion. 


The difficulty 
here is that if an Interrupt Request is 
acknowledged and an End of Interrupt command did not 
reset its IS bit (i.e., while executing a service routine), 
the 
8259A would 
have inhibited 
all 
lower 
priority 
requests with no easy way for the routine to enable 
them 


That is where the Special Mask Mode comes In. In the 
special Mask Mode, when a mask bit Is set in OeW1, it 
inhibits further interrupts at that level and enables inter· 
rupts from a/l other levels (lower as well as higher) that 
are not masked. 


Thus, any interrupts 
may be selectively 
enabled by 
loading the mask register. 


The 
special 
Mask 
Mode 
is 
set 
by 
OeW3 
where: 


SSMM=1, 
SMM=1, 
and cleared 
where 
SSMM=1, 
SMM=O. 


lTlM liT 
o -=- EDGE 
1:: lEVel 


In this mode the INT output is not used or the micropro- 
cessor internal Interrupt Enable flip-flop is reset, disabling 
its interrupt 
input. Service to devices is achieved by 
software using a Poll command. 


The Poll command is issued by setting P= "1" in OeW3. 
The 8259A treats the next iffi pulse to the 8259A (I.e., 
RD= 0, CS = 0) as an interrupt acknowledge, sets the 
appropriate IS bit if there is a reque~ 
and~ads 
the 
priority level. Interrupt is frozen from WR to RD. 


The word enabled onto the data bus during ~ 
is: 


07 
De 
05 
04 
I 
I 


03 
02 
01 
00 


W2 
WI 
WO I 


WO-W2: Binary code of the highest 
priority 
level 
requesting service. 
I: Equal to a "1" if there is an interrupt. 


This mode is useful if there is a routine command com· 
mon to several levels so that the rnTA sequence is not 
needed (saves ROM space). Another application 
is to· 


use the poll mode to expand the number of priority 
levels to more than 64. 
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8-78 


READING THE 8259A STATUS 


The input status of several internal registers can be read to 
update the user information 
on the system. The following 
registers 
can be read via OCW3 (IRR and ISR or OCW1 
[IMR)). 


Interrupt Request Register (IRR): 8-bit registerwhich 
con- 
tains the levels requesting 
an interrupt 
to be acknowl- 
edged. The highest 
request level is reset from the IRR 
when an interrupt is acknowledged. 
(Not affected by IMR.) 


In-Service Register (ISR): 8-bit register which contains the 
priority levels that are being serviced. The ISR is updated 
when an End of Interrupt 
Command is issued. 


Interrupt Mask Register: 8-bit register which contains the 
interrupt 
request lines which are masked. 


The IRR can be read when, prior to the RD pulse, a Read 
Register Command is issued with OCW3 (RR = 1, RIS = 0.) 


The ISR can be read when, prior to the RD pulse, a Read 
Register Command is issued with OCW3 (RR = 1, RIS = 1). 


There is no need to write an OCW3 before every status 
read operation, 
as long as the status read corresponds 
with the previous 
one; i.e., the 8259A "remembers" 
whether the IRR or ISR has been previously 
selected by 
the OCW3. This is not true when poll is used. 


After initialization 
the 8259A is set to IRR. 


For reading the IMR, no OCW3 is needed. The output data 
bus will contain the IMR whenever RD is active and AO =1 
(OCW1). 


Polling overrides status read when P = 1, RR = 1 in OCW3. 


This mode is programmed 
using bit 3 in ICW1. 


If LTIM = '0', an interrupt request will be recognized by a 
low to high transition 
on an IR input. The IR input can re- 
main high without generating 
another interrupt. 


If LTIM = '1', an interrupt request will be recognized by a 
'high' level on IR Input. and there is no need for an edge 
detection. The interrupt 
request must be removed before 
the EOI command is issued or the CPU interrupt is enabled 
to prevent a second interrupt 
from occurring. 


The priority cell diagram shows a conceptual circuit of the 
level sensitive 
and edge sensitive 
input circuitry 
of the 
8259A. Be sure to note that the request latch is a transpar- 
ent D type latch. 


In both the edge and level triggered 
modes the IR inputs 
must remain high until after the falling 
edge of the first 
INTA. If the IR input goes low before this time a DEFAULT 
IR7 will occur when the CPU acknowledges the interrupt. 
This can be a useful safeguard 
for detecting 
interrupts 
caused by spurious noise glitches on the IR inputs. To im- 
plement this feature the IR7 routine is used for "clean up" 
simply executing 
a return instruction, 
thus ignoring 
the 
interrupt. If IR7 is needed for other purposes a default IR7 
can still be detected 
by reading 
the ISR. A normal 
IR7 
interrupt will set the corresponding 
ISR bit, a default IR7 
won't. If a default IR7 routine occurs during a normallR7 
routine, however, the ISR will remain set. In this case it is 
necessary to keep track of whether or not the IR7 routine 
was previously 
entered. 
If another 
IR7 occurs 
it is a 
default. 


---, 


cs 
AO 
00·7 
INTA 
INI 


CASO 
CASO 


8259A 
CAS 1 
82509A 
CAS 1 
SLAVE 
A 
SLAVE 8 


CAS 2 
CAS 2 


$PIEH7 
6 
5 • 
J 
2 
1 
0 l 


~IENl 
6 
5 


GND I I I I I I I I 


1 • 
5 • 
J 
2 
1 
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This mode will be used in the case of a I)ig system 
where cascading is used, and the priority has to be con- 
served within each slave. In this 'case the fully nested 
mode will be programmed to the master (using ICW4). 
This mode is similar to the normal nested mode with the 
following exceptions: 


a. When an interrupt request from a certain slave is in 
service this slave is not locked out from the master's 
priority 
logic 
and further 
interrupt 
requests 
from 
higher priority IR's within the slave will be recognized 
by the master and will initiate interrupts to the proc- 
essor. (In the normal nested mode a slave is masked 
out when Its request is In service and no higher 
requests from the same slave can be serviced.) 


b. When exiting the Interrupt Service routine the soft- 
ware has to check whether the interrupt serviced was 
the only one from that slave. This is done by sending 
a non-specific End of Interrupt (EOI)command to the 
slave and then reading its In-Service register and 
checking for zero. If it is empty, a non-specific EOI 
can be sent to the master too. If not, no EOI should be 
sent. 


BUFFERED MODE 


When the 8259A Is used in a large system where bus 
driving buffers are required on the data bus and the cas- 
cading mode Is used, there exists the problem of enabl- 
ing buffers. 
The buffered mode will structure the 8259A to send an 
enable signal on SP/EN to enable the buffers. In this 


mode, whenever the 8259A's data bus outputs are ena- 
bled, the SP/EN ou~put becomes active. 


This modificalion 
forces the use of software program- 


ming to determine whether the 8259A is a master or a 
slave. Bit 3 in ICW4 programs the buffered mode, and bit 
2 in ICW4 determines whether it is a master or a slave. 


The 8259Acan be easily interconnected in a system of one 
master with up to eight slaves to handle up to 64 priority 
levels. 


The master controls the slavesthrough the 3 line cascade 
bus. The cascade bus acts like chip selects to the slaves 
during the INTA sequence. 


In a cascade configuration, the slave interrupt outputs are 
connected to the master interrupt request inputs. When a 
slave request line is activated and afterwards acknowl- 
edged, the master will enable the corresponding slave to 
release the device routine address during bytes 2 and 3 of 
INTA. (Byte 2 only for 8086/8088). 


The cascade bus lines are normally low and will contain 
the slave address code from the trailing edge of the first 
INTA pulse to the trailing edge of the third pulse. Each 
8259A in the system must follow a separate initialization 
sequence and can be programmed to work in a different 
mode. An EOI command must be issued twice: once for 
the master and once for the corresponding 
slave. An 


address decoder is required to activate the Chip Select 
(CS) input of each 8259A. 


The cascade lines of the Master 8259A are activated only 
for slave inputs, non slave inputs leave the cascade line 
inactive (low). 


inter 


ABSOLUTE MAXIMUM 
RATINGS· 


Ambient Temperature Under Bias 
-40·C 
t085·C 
Storage Temperature 
-65·C 
to + 150·C 
Voltage on Any Pin 
with Respect to Ground 
-0.5V 
to + 7V 
Power Dissipation 
1Watt 


"NOnCE: 
Stresses above those listed 
under 
"Absolute 


Maximum 
Ratings" 
may cause permanent 
damage 
to the 


device. 
This is a stress rating 
only and functional 
opera- 
tion of the device 
at these or any other 
conditions 
above 


those indicated 
in the operational 
sections 
of this specifi- 
cation 
is not implied. 


Symbol 
Parameter 
Min. 
Max. 
Units 
Test Conditions 


VIL 
Input Low yoltage 
-0.5 
0.8 
V 


VIH 
Input High Voltage 
2.0 
Vee +0.5V 
V 


VOL 
Output High Voltage 
0.45 
V 
IOL = 2.2mA 


VOH 
Output High Voltage 
2.4 
V 
IOH - 
-400/LA 


Interrupt Output High 
3.5 
V 
IOH = -100/LA 


VOH(lNT) 
Voltage 
2.4 
V 
IOH - 
-400/LA 


Iu 
Input Load Current 
10 
/LA 
OV ,,;VIN ,,;Vee 


ILOL 
Output Leakage Current 
-10 
/LA 
0.45V ,,;VOUT";Vee 


lee 
Vee Supply Current 
85 
mA 


-300 
/LA 
VIN = 0 
IUR 
IR Input Load Current 
10 
/LA 
VIN = Vee 


Symbol 
Parameter 
Min. 
Typ. 
Max. 
Unit 
Teat 
Condltlona 


C'N 
Input Capacitance 
- 
10 
pF 
Ie = 1 MHz 


GIIO 
110 Capacitance 
20 
pF 
Unmeasured 
pins returned 
to Vss 


Symbol 
Parameter 
8259A·8 
8259A 
8259A-2 
Unlta 
Teat 
Condition 
a 


Min. 
Max. 
Min. 
Max. 
Min. 
Max. 


TAHRL 
AO/CS 
Setup to RD/INTAJ 
50 
0 
0 
ns 


TRHAX 
AO/CS 
Hold after 
RDIINTAj 
5 
0 
0 
ns 


TRLRH 
RD Pulse Width 
, 
420 
235 
160 
ns 


TAHWL 
AO/CS 
Setup to WRJ 
50 
_ 0 
0 
ns 


TWHAX 
AO/CS 
Hold after 
WRj 
20 
0 
0 
ns 


TWLWH 
WR Pulse Width 
400 
290 
190 
ns 


TDVWH 
Deta Setup to WRj 
300 
240 
160 
ns 
..:, 


TWHDX 
Data Hold after WRj 
40 
0 
0 
ns 


TJLJH 
Interrupt 
Request 
Width 
(Low) 
100 
100 
100 
ns 
See Note 1 


TCVIAL 
Cascade 
Setup to Second 
or Third 
55 
55 
40 
ns 
INTAj (Slave 
Only) 


TRHRL 
End of RD to Next Command 
160 
160 
160 
ns 


TWHRL 
End 01 WR to Next Command 
190 
190 
190 
ns 


inter 


Symbol 
Paramatar 
8259A·8 
8259A 
8259A·2 
Unll. 
T •• t Condition. 


Min. 
Mu. 
Min. 
Mu. 
Min. 
Max. 


TRLDV 
Data 
Valid 
from 
RDIINTAj 
300 
200 
120 
ns 
C 
of 
Data 
Bus 
- 


l00pF 


TRHDZ 
Data 
Float 
after 
RD/INTAI 
10 
200 
100 
85 
na 
C of Data 
Bua 


TJHIH 
Interrupt 
Output 
Delay 
400 
350 
300 
na 
Max text 
C - 
100 pF 
Min. teat 
C - 
15 pF 


TIALCV 
Cascade 
Valid 
from 
Firat 
INTAj 
565 
565 
360 
na 
C,NT = 100 pF 


(Maater 
Only) 


TRLEL 
Enable 
Active 
from 
RD j or INTAI 
160 
125 
100 
na 
CCASCAOE= 100 pF 


TRHEH 
Enable 
Inactive 
from 
RDI or INTAI 
325 
150 
150 
ns 


TAHDV 
Data 
Valid 
from 
Stable 
Address 
350 
200 
200 
ns 


TCVDV 
Cascade 
Valid 
to Valid 
Data 
300 
300 
200 
ns 


2. 


0.45=x::>TEST.oINTS <:.:;C 


Ci 
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inter 
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NOTES: 
Interrupt 
output 
must remain HIGH at least until leading edge of firstlNTA. 


1. Cycle 1 in iAPX 86, iAPX 88 systems, the Data Bus is not active. 


8282/8283 
OCTAL LATCH 


• 
Address Latch for iAPX 86, 88, 
MCS-80®, MCS-85®, MCS-48® Families 


• 
High Output Drive Capability for 
Driving System Data Bus 


• 
Fully Parallel 8·Bit Data Register and 
Buffer 


• 
Transparent during Active Strobe 
• 
No Output Low Noise when Entering 
or Leaving High Impedance State 


The 8282 and 8283 are 8-bit bipolar latches with 3-state output buffers. They can be used to implement latches, buffers 
or multiplexers. The 8283 inverts the input data at its outputs while the 8282 does not. Thus, all of the principal periph 
eral and input/output 
functions of a microcomputer 
system can be implemented with these devices_ 


~F 
00, 


~ 
l_______ 
-EJ 


Vcc 


0°0 
00, 


002 


0°3 
00. 


14 
005 


DOS 


12 
007 


5TB 


Vcc 


000 
00, 


0°2 


0°3 
00. 


14 
005 


13 
006 


DOT 


5TB 


Pin 
Description 


STB 
STROBE (Input). 
STB is an input 
control 
pulse used to strobe data at the data input 
pins 
(Ao-A7) 
into 
the 
data 
latches. 
This 
signal 
is active HIGH to admit 
input data. 
The data 
is latched 
at the HfGH to LOW 
transition 
of STB. 


OE 
OUTPUT ENABLE 
(Input). OE is an input 
control 
signal 
which 
when 
active 
LOW 
enables 
the 
contents 
of the data 
latches 
onto the data output 
pin (Bo-B7). OE being 
inactive 
HIGH forces the output 
buffers 
to 
their high impedance 
state. 
010-017 
DATA INPUT PINS (Input). Data presented 
at 
these 
pins 
satisfying 
setup 
time 
re- 
quirements 
when 
STB 
is 
strobed 
and 
latched 
into the data input latches. 


000-007 
DATA OUTPUT PINS (Output). When OE is 
(8282) 
true, 
the data 
in the data 
latches 
is pre· 
000'-007 
sented 
as inverted 
(8283) or non·inverted 
(8283) 
(8282) data onto the data output 
pins. 


The 8282 and 8283 octal 
latches 
are 8·bit 
latches 
with 
3-state 
output 
buffers. 
Data having 
satisfied 
the setup 
time 
requirements 
is latched 
into 
the 
data 
latches 
by 
strobing 
the STB line 
HIGH 
to LOW. 
Holding 
the STB 
line 
in its active 
HIGH state 
makes 
the latches 
appear 
transparent. 
Data is presented 
to the data output 
pins by 
activating 
the OE input 
line. When OE is inactive 
HIGH 
the 
output 
buffers 
are in their 
high 
impedance 
state. 
Enabling 
or disabling 
the output 
buffers 
will 
not cause 
negative·going 
transients 
to appear 
on the data output 
bus. 


"NOTICE: Stresses 
above those listed under "Absolute 
Maximum Ratings" may cause permanent damage to the 
device. This is a stress rating only and functional 
opera- 
tion of the device at these or any other conditions 
above 
those indicated in the operational sections of this specifi- 
cation is not implied. 
Exposure 
to absolute 
maximum 
rating conditions 
for extended periods may affect device 
reliability. 


Temperature Under Bias 
O·C to 70·C 
Storage Temperature 
-65·C 
to + 150·C 
All Output and Supply Voltages 
- 0.5V to + 7V 
All Input Voltages 
-1.0V 
to + 5.5V 
Power Dissipation 
1 Watt 


Symbol 
Parameter 
Min. 
Max. 
Units 
Test Conditions 


Ve 
Input Clamp 
Voltage 
-1 
V 
Ie = 
-5 
mA 


Ice 
Power Supply Current 
160 
mA 


IF 
Forward Input Current 
-0.2 
mA 
VF = 0.45V 


IA 
Reverse Input Current 
50 
fAA 
VA = 5.25V 


VOL 
Output Low Voltage 
.45 
V 
10L = 32 mA 


VOH 
Output High Voltage 
2.4 
V 
10H = 
-5 
mA 


10FF 
Output Otf Current 
± 50 
fAA 
VOFF= 0.45 to 5.25V 


V1L 
Input Low Voltage 
0.8 
V 
.Vee= 5.0V 
SeeNote1 


V1H 
Input High Voltage 
2.0 
V 
Vee= 5.0V 
See Note 1 


F = 1 MHz 
C1N 
Input Capacitance 
12 
pF 
VB1AS=2.5V, Vee= 5V 
TA= 25·C 


NOTE: 


1. OutputLoading10L= 32mA,10H = -SmA, eL= JOOpF. 


A.C. CHARACTERISTICS 
(Vee = 5V ±10%, TA = O°Cto 700C 
Loading: 
Outputs - 
IOl = 32 mA, IOH = -5 mA, Cl = 300 pF) 


Symbol 
Parameter 
Min. 
Max. 
Units 
Test Conditions 


TIVOV 
Input to Output Delay 
(See Note 1) 


-Inverting 
5 
22 
ns 
- Non-I nvert ing 
5 
30 
ns 


TSHOV 
STB to Output Delay 
-Inverting 
10 
40 
ns 
-Non-Inverting 
10 
45 
ns 


TEHOZ 
Output 
Disable Time 
5 
18 
ns 


TELOV 
Output 
Enable Time 
10 
30 
ns 


TIVSL 
Input to STB Setup Time 
0 
ns 


TSLIX 
Input to STB Hold Time 
25 
ns 


TSHSL 
STB High Time 
15 
ns 


TILlH, TOLOH 
Input, Output Rise Time 
20 
ns 
From O.SVtq 2.0V 


TIHIL, TOHOL 
Input. Output Fall Time 
12 
ns 
From 2.0V to O.SV 


NOTE: 
1. Seewaveformsand test load circuit on following page. 


I300 pF 


2.' ----v.S_ 
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18282/8283 
OCTAL LATCH 


INDUSTRIAL 


• 
Fully Parallel 8-Bit Data Register 
and 
Buffer 


• 
Transparent 
during 
Active 
Strobe 


• 
Address 
Latch for iAPX 86, 88, 
MCS-8Q®, MCS-85®, MCS-48® Families 


• High Output 
Drive Capability 
for 
Driving System 
Data Bus 


• 
3·State Outputs 


• 
20-Pin Package with 0.3" Center 


• 
No Output 
Low Noise when Entering 
or Leaving 
High Impedance 
State 


• 
Industrial 
Temperature 
Range: 
-40° to +85°C 


The 18282and 18283are 8-bit bipolar latches with 3·state output buffers. They can be used to implement 
latches, buf- 


fers, or multiplexers. 
The 18283inverts the input data at its outputs while the 18282does not. Thus, all of the principal 


peripheral and input/output 
functions 
of a microcomputer 
system can be implemented 
with these devices. 


I_ 
r-------., 
I 
I 


00 
+-8 
I 
I 
I 


I 
I 
l 
_ 


~ 


l------- 


0" 
L 
_ 


~ 


Figure 2. Pin 
Configurations 
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8284A 
CLOCK GENERATOR AND DRIVER FOR 
iAPX 86, 88 PROCESSORS 


• 
Generates 
the System clock 
for the 


IAPX 86,88 
Processors 


• 
Uses a Crystal or a TTL Signal for 
Frequency 
Source 


• 
Provides 
Local READY and Multlbus™ 
READY Synchronization 


• 
18·Pln Package 


• 
Single +SV Power Supply 


• 
Generates System 
Reset Output 
from 


Schmitt 
Trigger Input 


• 
Capable of Clock Synchronization 
with 


Other 8284As 


RES 
0 
D 


RESET 


Xl 
XTAl 


OSCillATOR 


X2 
OSC 


CSYNC 
1 
F/C 
PClK 
Xl 
+3 
PClK 


AENl 
X2 


EFI 
SYNC 
RDYl 
ASYNC 


CSYNC 
READY 
EFI 


RDY2 
F/C 
RDYl 
Am 
OSC 


ClK 
ClK 
RES 
AENl 


GND 
RESET 


RDY2 


CKt 


AEN2 
D 
Q 
READY 


FFl 


ASYNC 


Figure 2. 


8284A Pin Configuration 
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Symbol 
Type 
Name and Function 


AEN1, 
I 
Address 
Enable: 
AEN is an active 
lOW 
AEN2 
signal. AEN serves to quality its respective 
Bus Ready Signal 
(RDYl 
or RDY2). AENl 


",. 
validates RDYl while AEN2 validates RDY2. 


" 


Two AEN signal inputs are useful in system 
configurations 
which permit the processor to 
access two Multi-Master 
System Busses. In 
non Multi-Master 
configurations 
the AEN 
signal i~puts are tied true (lOW). 


RDY1, 
I 
Bus Ready: (Transfer Complete). ROY is an 
RDY2 
active HIGHsignal which is an indication from 
a device located on the system data bus that 
data has been received, or is available. RDYl 
is qualified by AENl while RDY2 is qualified 
by AEN2. 


ASYNC 
I 
Ready Synchronization 
Select: 
ASYNC is an 
ihput 
which 
defines 
the synchronization 
mode of the READY logic. When ASYNC is 


'. 
low, two stages of READYsynchronization are 


.' 
provided. When ASYNC is left open or HIGH a 
single 
stage of READY synchronization 
is 
provided. 


READY 
0 
Ready: 
READY is an active 
HIGH signal 
which is the synchronized 
ROY signal input. 


READY is cleared alter the guaranteed hold 
time to the processor has been met. 


Xl, X2 
I 
Crystal In: Xl and X2 are the pins to which a 
crystal is attached. The crystal frequency is 3 
times the desired processor clock frequency. 


F/C 
I 
Frequency/Crystal 
Select: 
F/C is a strapping 
option. When strapped .LOW,F/G permits the 
processor's clock to be generated by the crys- 
tal. When F/Gis strapped HIGH, ClK is gener- 
ated from the EFI input. 


EFI 
I 
External 
Frequency: 
When F/C is strapped 
HIGH, ClK 
is generated from the input fre- 
quency 
appearing 
on this 
pin. The input 
signal is a square wave 3 times the frequency 
of the desired ClK output. 


FUNCTIONAL 
DESCRIPTION 


General 


The 8284A is a single 
chip 
clock 
generator/driver 
for the 
iAPX 
86, 88 processors. 
The 
chip 
contains 
a crystal- 
controlled 
oscillator, 
a divide-by-three 
counter, 
com- 


plete 
MULTIBUSTM 
"Ready" 
synchronization 
and 
reset 
logic. 
Refer to Figure 
1 for Block 
Diagram 
and Figure 
2 
for Pin Configuration. 


The oscillator 
circuit 
of the 8284A is designed 
primarily 
for 
use with 
an external 
series 
resonant, 
fundamental 
mode, 
crystal 
from 
which 
the basic 
operating 
frequency 


is derived. 


Symbol 
Type 
Name anI! Function 


ClK 
0 
Processor 
Clock: 
ClK 
is the clock 
output 


, 
used by the processor and all devices which 


I' 


directly connect to the processor's local bus 
(I.e., the bipolar support chips and other MOS 


"' 
devices). ClK has an output frequency which 
is V3 olthe crystal or EFIinput frequency and a 


Va duty cycle. An output 
HIGH of 4.5 volts 
(Vcc= 5V) is provided on this pin to drive MOS 
devices. 


PClK 
0 
Perlpharal 
Clock: 
PClK.is 
a TIl 
level pe- 


ripherill clock signal whose output frequency 
,is V2 that of ClK and has a 50% duty cyde. 


OSC 
0 
Oscillator Output: 
OSC is the TIl 
level out- 
put of the internal oscillator circuitry. Its fre- 
quency is equal to that of the crystal. 


RES 
I 
Resetln: 
RES is an active lOW signal which 
is used 
to generate 
RESET. The 8284A 
provides a Schmitt trigger input so that an RC 
connection 
can be used to estilblish 
the 


,. 
power-up reset of proper duration. 


RESET 
0 
Reset: 
RESETis an active HIGHsignal which 
is used to reset the 8086 family processors. Its 
timing 
characteristics 
are determined 
by 
RES. 


CSYNC 
I 
Clock Synchronization: 
CSYNC is an active 
HIGH signal which allows multiple 8284As to 
be synchronized to provide clocks that are in 
phase. When CSYNC is HIGH the internal 
counters are reset. When CSYNC goes lOW 
the internal counters are allowed to resume 
counting. CSYNC needs to be externally syn- 
chronized to EFI. When using the internal os- 
cillator 
CSYNC should 
be hardwired 
to 
ground. 


GND 
Ground. 
Ii 
" 
' 


Vcc 
Power: 
+5V supply. 
c, 


The crystal 
frequency 
should 
be selected 
at three 
times 
the 
required 
CPU clock. 
Xl 
and 
X2 are the two 
crystal 
input 
crystal 
connections. 
For the most stable 
operation 
of the oscillator 
(OSC) output 
circuit, 
two series 
resistors 
(R1 = R2 = 5100) 
as shown 
in the waveform 
figures 
are 
recommended. 
The output 
of the oscillator 
is buffered 
and 
brought'out 
on OSC so that 
other 
system 
timing 
signals 
can be derived 
from this stable, crystal-controlled 
source. 


For systems 
which 
have a VCC ramp time"" 
lV/ms 
and/or 
have inherent 
board 
capacitance 
between 
Xl 
or X2, ex- 
ceeding 
10pF (not including 
8284A pin capacitance), 
the 
configuration 
in Figures 
4 and 6 is recommended. 
This 
circuit 
provides 
optimum 
stability 
forthe 
oscillator 
in such 
extreme 
conditions. 
It 
is advisable 
to 
limit 
stray 
ca- 
pacitances 
bless 
than 
10pF on Xl 
and X2 to minimize 
deviation 
from 
operating 
at the fundamental 
frequency. 
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Clock Generator 


The clock generator consists of a synchronous divide- 
by-three counter with a special clear input that inhibits 
the counting. This clear input (CSYNC) allows the out- 
put clock to be synchronized with an external event 
(such as another 8284A clock). It is necessary to syn- 
chronize the CSYNC input to the EFI clock external to 
the 8284A. This is accomplished with two Schottky flip- 
flops. The counter output is a 33% duty cycle clock at 
one-third the input frequency. 


The FIG input is a strapping pin that selects either the 
crystal oscillator or the EFI input as the clock for the .•.3 
counter. If the EFI input is selected as the clock source, 
the oscillator 
section can be used independently for 


another clock source. Output is taken from OSC. 


Clock Outputs 


The CLK output is a 33% duty cycle MaS clock driver 
designed to drive the iAPX 86, 88 processors directly. 
PCLK is a TTL level peripheral clock signal whose out- 
put frequency is V2 that of CLK. PCLK has a 50% duty 
cycle. 


Reset Logic 


The reset logic provides a Schmitt trigger input (RES) 
and a synchronizing 
flip-flop 
to generate the reset 


timing. The reset signal is synchronized to the falling 
edge of CLK. A simple RC network can be used to 
provide power-on reset by utilizing this function of the 
8284A. 


READY Synchronization 


Two READYinputs (RDY1,RDY2)are provided to accom- 
modate two Multi-Master system busses. Each input 
has a qualifier (AEN1 and AEN2, respectively). The AEN 
signals validate their respective ROYsignals. If a Multl- 


Master system is not being used the AEN pin should be 
tied LOW. 


Synchronization is required for all asynchronous active- 
going edges of either ROY input to guarantee that the 
ROYsetup and hold times are met. Inactive-going edges 
of ROY in normally ready systems do not require syn- 
chronization but must satisfy ROYsetup and hold as a 
matter ot" proper system design. 


The ASYNC input defines two modes of READY syn- 
chronization operation. 


When ASYNC is LOW, two stages of synchronization 
are prOVidedfor active READY input signals. Positive- 
going asynchronous READY inputs will first 
be syn- 


chronized to flip-flop 
one at the rising edge of CLK 


and then synchronized to flip-flop two at the next falling 
edge of CLK, after which time the READYoutput will go 
active (HIGH). Negative-going asynchronous READY in- 
puts will be synchronized directly to flip-flop two at the 
falling edge of CLK, after which time the READYoutput 
will go inactive.ihis 
mode of operation is intended for use 


by asynchronous (normally not ready) devices in the sys- 
tem which cannot be guaranteed by design to meet the 
required RDY setup timing, TR1VCL, 
on each bus cycle. 


When ASYNC is high or left open, the first READYflip- 
flop is bypassed in the READY synchronization 
logic. 


READY inputs are synchronized by flip-flop two on the 
falling edge of CLK before they are presented to the 
processor. 
This mode is available for synchronous 


devices that can be guaranteed to meet the required 
ROYsetup time. 


ASYNC can be changed on every bus cycle to select the 
appropriate mode of synchronization for each device in 
the system. 
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°NOTlCE: Stresses above those listed under "Absolute 
Maximum Ratings" may cause permanent damage to the 
device. This is a stress rating only and functional opera- 
tion of the device at these or any other conditions above 
those indicated in the operational sections of this specifi- 
cation is not implied. Exposure to absolute· maximum 
rating conditions for extended periods may affect device 
reliability. 


Temperature Under Bias 
O·C to 70·C 
Storage Temperature 
-65·C 
to + 150·C 
All Output and Supply Voltages 
-0.5V 
to + 7V 


All Input Voltages 
-1.0V to +5.5V 
Power Dissipation 
1 Watt 


Symbol 
Parameter 
Min. 
Max. 
Units 
Test Conditions 


IF 
Forward Input Current (ASYNC) 
-1.3 
mA 
VF= 0.45V 
Other Inputs 
-0.5 
mA 
VF= 0.45V 


IR 
Reverse Input Current (ASYNC) 
50 
!JA 
VR=VCC 
Other Inputs 
50 
!JA 
VR= 5.25V 


Vc 
Input Forward Clamp Voltage 
-1.0 
V 
Ic= -5mA 


Icc 
Power Supply Current 
162 
mA 


V1L 
Input lOW Voltage 
0.8 
V 


V1H 
Input HIGH Voltage 
2.0 
V 


V1HR 
Reset Input HIGH Voltage 
2.6 
V 


VOL 
Output lOW Voltage 
0.45 
V 
5mA 


VOH 
Output 
HIGH Voltage ClK 
4 
V 
-1mA 
Other Outputs 
2.4 
V 
-1mA 


V1HR- V1LR 
RES Input Hysteresis 
0.25 
V 


A.C. CHARACTERISTICS 
(TA=O·C to 70·C, Vcc=5V± 
10%) 


TIMING REQUIREMENTS 


Symbol 
Parameter 
Min. 
Max. 
Units 
Test Conditions 


tEHEL 
External Frequency HIGH Time 
13 
ns 
90%-90% 
V1N 


tELEH 
External Frequency lOW Time 
13 
ns 
10%-10% 
V1N 


tELEL 
EFI Period 
tEHEL+ tELEH+ d 
ns 
(Note 1) 


XTAl 
Frequency 
12 
30 
MHz 


tR1VCL 
RDY1, RDY2 Active Setup to ClK 
35 
ns 
ASYNC= HIGH 


tR1VCH 
RDY1, RDY2 Active Setup to ClK 
35 
ns 
ASYNC=lOW 


tR1VCL 
RDY1, RDY2 Inactive Setup to ClK 
35 
ns 


tCLR1X 
RDY1, RDY2 Hold to ClK 
0 
ns 


tAYVCL 
ASYNC Setup to ClK 
50 
ns 


tCLAYX 
ASYNC Hold to ClK 
0 
ns 


tA1VR1V 
AEN1, AEN2 Setup to RDY1, RDY2 
15 
ns 


tCLA1X 
AEN1, AEN2 Hold to ClK 
0 
ns 


tYHEH 
CSYNC Setup to EFI 
20 
ns 


tEHYL 
CSYNC Hold to EFI 
20 
ns 


tYHYL 
CSYNC Width 
2· tELEL 
ns 


tl1HCL 
RES Setup to ClK 
65 
ns 
(Note 2) 


tCLl1H 
RES Hold to ClK 
20 
ns 
(Note 2) 


tlLlH 
Input Rise Time 
20 
ns 
From 0.8V to 2.0V 


tlLlL 
Input Fall Time 
12 
ns 
From 2.0V to 0.8V 
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A.C. CHARACTERISTICS 
(Continued) 
TIMING 
RESPONSES 


Symbol 
Para mater 
Min. 
Mall. 
Unit. 
Te.t Condition. 


ICLCL 
ClK 
Cycle Period 
100 
ns 


ICHCL 
ClK 
HIGH Time 
('I3lcLcJ+2 
for ClK 
Freq. <;;8 MHz 
ns 
Fig. 7 & Fig. 8 


('I3lcLcJ+6 
for ClK 
Freq.=10 
MHz 


ICLCH 
ClK 
lOW 
Time 
(% ICLCJ-15 
for ClK 
Freq.<;;8 MHz 
ns 
Fig. 7 & Fig. 8 


(% ICLCJ-14 
for ClK 
Freq.=10 
MHz 


tCH,CH2 
ClK 
Rise or Fall Time 
10 
ns 
1.0V 10 3.5V 


ICL2CL1 
1.'_ 


tPHPL 
PClK 
HIGH Time 
tCLCL-20 
ns 


tPLPH 
PClK 
lOW Time 
ICLcL-20 
ns 


tRYLCL 
Ready Inaclive 10ClK (See Nole 4) 
-8 
ns 
Fig. 9 & Fig. 10 


tRYHCH 
Ready Active 10 ClK (See Nole 3) 
(% tCLcJ-15 
for CLK Freq.<;;8 MHz 
ns 
Fig. 9 & Fig. 10 


(% ICLCJ-14 
for ClK Freq.=10 
MHz 


ICLIL 
ClK 
10 Resel Delay 
40 
ns 


ICLPH 
ClK 
10 PClK 
HIGH DELAY 
22 
ns 


ICLPL 
ClK 
10 PClK 
lOW 
Delay 
22 
ns 


10LC" 
OsC 10 ClK 
HIGH Delay 
-5 
22 
ns 


IoLcL 
OsC 10 ClK 
lOW 
Delay 
2 
35 
ns 


IoLoH 
Oulpul 
Rise Time (excepl 
ClK) 
20 
ns 
From 0.8V to 2.0V 


10HOL 
Oulpul 
Fall Time (excepl 
ClK) 
, 
12 
ns 
From 2.0V 10 0.8V 


NOTES: 


1. 0= EFI rise (5ns max)+ EFI fall (5ns max). 


2. selup 
and hold necessary 
only 10 guaranlee 
recognition 
al nexl clock. 


3. Applies 
only 10 T3 and TW slales. 


4. Applies 
only 10 T2 slales. 


2.0 ~.5-TESTPOINTS_1.5~ 
M5---.l"\ 
~~ 


t 
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RDY2 
OSC 


Fie 


AEN2 


CSYNC 


FI~ 
AEN1 


RDY2 
AE/iI2 


CSYNC READY 


NOTES, 


1,CL=100pF 
2. CL=30pF 
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M8284 
CLOCK GENERATOR 
AND DRIVER 
FOR MILITARY 
iAPX 86 
MILITARY 


• Military 
Temperature 
Range: 
- 55°C to +125°C 


• Generates 
the System Clock for the 
M8086 


• Generates 
System Reset Output 
from Schmitt 
Trigger Input 


• Uses a Crystal or TTL Signal for 
Frequency 
Source 


• Provides 
Local Ready and 
MULTIBUS™ Ready Synchronization 


• Capable of Clock Synchronization 


with other M8284's 


The M8284 is a bipolar 
clock 
generator/driver 
designed 
to provide 
clock 
signals 
for the Military 
iAPX 86 and peripherals. 
It 
also contains 
READY logic 
for operation 
with 
two 
MUL TIBUST• 
systems 
and provides 
the processors 
required 
READY 


synchronization 
and timing. 
Reset logic 
with 
hysteresis 
and synchronization 
is also provided. 


CSYNC 
VCC 


X2 
osc 
PClK 
X1 


CLK 
AEN1 
X2 


RDY1 
N.C. 
,;e 


.F1 
READY 
EFI 


PCLK 
RDY2 
Fie 


CSYNC 
AEN2 
OSC 


ROYl 
c, 
ClK 
RES 


•••EN1 


READY 


AEN2 
GND 
RESET 
RDY2 


Figure 1. Block Diagram 
Figure 2. Pin Configuration 


Xli 
X21 
FIe 
EFI 


CSYNC 


ROYlj 
RDY2 ! 
A'EN11 
AEN21 


CONNECTIONS 
FOR 
CRYSTAL 


CLOCK 
SOURCE 
SELECT 


EXTERNAL 
CLOCK 
INPUT 


CLOCK 
SYNCHRONIZATION 
INPUT 


READY 
SIGNAL 
FROM 
TWO 
MUlTIBUSI~ 
SYSTEMS 


RES 


RESET 
ose 
eLK 


PClK 
READY 
Vee 
ONO 


RESET 
INPUT 


SYNCHRONIZED 
REseT 
OUTPUT 


OSCIllATOR 
OUTPUT 


MOS 
CLOCK 
108086 


TTL 
CLOCK 
FOR 
PERIPHERALS 


SYNCHRONIZED 
READY 
OUTPUT 


+5 
VOLTS 


o VOLTS 


CLOCK GENERATOR AND DRIVER 
FOR iAPX 86, 88 PROCESSORS 
INDUSTRIAL 


• Industrial 
Temperature 
Range: 
• 18·Pln Package 
- 40°C to +85°C 
• Generates 
System Reset Output 
from 
• Generates 
the System Clock for the 
Schmitt 
Trigger Input 
Industrial 
iAPX 86/10 
• Provides Local Ready and MULTIBUS™ 
• Uses a Crystal or a TIL 
Signal for 
Ready Synchronization 
Frequency 
Source 
• Capable of Clock Synchronization 
with 
• Single 
+5V 
Power Supply 
other 8284's 


The 18284 is a bipolar clock generator/drive:r designed to provide clock signals for the iAPX 86, 88 and peripherals. 
It 
also contains READY logic for operation with two MULTIBUS™ systems and provides the processors required READY 
synchronization 
and timing. Reset logic with hysteresis 
and synchronization 
is also provided. 


CSYNC 
VCC 


x, 


X2 
PClK 
X1 


eLK 
AEN1 
X2 


RDY1 
N.C. 


FIe 


EF! 
READY 
EFI 


RDY2 
FIe 


CSYNC 
AEN2 
OSC 


ROY1 
eK 
ClK 
RES 


AEN1 
Am 


READY 


RDY2 
GND 
RESET 


Figure 1. Block Diagram 
Figure 2. Pin Configuration 


X'I 
x21 
Fie 


EFI 


CSYNC 
RDYl 
I 


RDY2 
I 


A"E"N11 


AEN21 


RES 


RESET 
ose 
ClK 
PClK 


READY 
vce 
GNO 


RESET 
INPUT 


SYNCHRqNIZEO 
RESET 
OUTPUT 


OSCILLATOR 
O~TPUT 


MOS 
CLOCK 
108086 
Tn 
CLOCK 
FOR PERIPHERALS 


SYNCHRONIZED 
READY 
OUTPUT 


+SVOLT$ 
o VOLTS 


CLOCK 
SOURCE 
SELECT 


EXTERNAL 
CLOCK 
INPUT 


CLOCK 
SYNCHRONIZATION 
INPUT 


READY 
SIGNAL 
FROM 
TWO 
MUl 
TIBUSflll 
SYSTEMS 


18286/8287 
OCTAL BUS TRANSCEIVER 


INDUSTRIAL 


• 
Data Bus Buffer 
Driver for iAPX 86,88, 
MCS·8Q®, MCS·85®, and MCS·48® 
Families 


• 
High Output 
Drive Capability 
for 
Driving 
System 
Data Bus 


• 
Fully Parallel 8·Bit Transceivers 


• 
3·State Outputs 


• 
20·Pin Package with 0.3" Center 


• 
No Output 
Low Noise when Entering 
or Leaving 
High Impedance 
State 


• 
Industrial 
Temperature 
Range: 
- 40°C to +85°C 


The 18286and 18287are 8·bit bipolar transceivers 
with 3-state outputs. The 18287 inverts the input data at its outputs 
while the 18286 does not. Thus, a wide variety of applications 
for buffering 
in microcomputer 
systems can be met. 


11211 
r-------, 
I 
I 
I 
I 


11287 
r-------, 
I 
I 


1 
I 


8288 
BUS CONTROLLER 
FOR iAPX 86, 88 PROCESSORS 


• 
Provides Wide Flexibility in System 
Configurations 
• 
Facilitates 
Interface to One or Two 
Multi·Master 
Busses 


The Intel~ 8288 Bus Controller is a 20-pin bipolar component for use with medium-to-Iarge iAPX 86, 88 processing 
systems. The bus controller provides command and control timing generation aswell as bipolar bus drive capability while 
optimizing system performance. 


{~- 


MRDC 


8086 
STATUS 
-. 
MWTC 
lOB 
VCC 


STATUS 
~-- 
DECODER 
S2-- 
COM· 
AMWC 
ClK 
so 
MAND 
MULT1BUS™ 


SIGNAL 
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Pin Configuration 
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Symbol 
Type 
Name and Function 
" 


Vcc 
Power: 
+5V supply. 
. 


GND 
Ground. 
50.5,.52 
I 
Statu. 
Input 
Plna: 
These 
pins 
are the 
status Input pins from the 8086, 8088 or 


. ~ 
". 
8089 processors. 
The 8288 decodes these 
inputs to generate command 
and control 
signals 
at the 
appropriate 
time. 
When 


\' 
'" 
these pins are not in use (passive) they are 
all HIGH. (See chart under Command and 
Control 
Logic.) 


CLK 
I 
Clock: 
This 
is' a clock 
signal 
from 
the 
8284 clock generator 
and serves to estab- 


lish when command 
and control 
signals 
are generated. 


A~E 
0 
Addre •• 
Latch 
Enable: 
This 
signal 


serves 
to 
strobe 
an address 
into 
the 


'.~ 
address latches. This signal is active HIGH 
and latching 
occurs 
on the falling 
(HIGH 


to LOW) transition. 
ALE is intended 
for 


use with transparent 
0 type latches. 


DEN 
0 
Data 
Enable: 
This signal 
serves to en- 
able 
data 
transceivers 
onto 
either 
the 
local or system data bus. This signal 
is 


active HIGH. 


DT/R 
0 
Data nan.mll/Recelve: 
This signal 
es- 


tablishes 
the 
direction 
of 
data 
flow 


through 
the transceivers. 
A HIGH on this 
line 
Indicates 
Transmit 
(write 
to I/O or 


memory) 
and a LOW indicates 
Receive 


(Read). 


AEN 
I 
Addre •• Enable: 
AEN enables command 
outputs of the 8288 Bus Controller 
at least 


115 ns after it becomes active (LOW). AEN 
going 
Inactive 
immediately 
3-states 
the 


.~ 
command 
output 
drivers. 
AEN does not 
affect the I/O command 
lines if the 8288 is 


in the I/O Bus mode (lOB tied HIGH). 


CEN 
I 
Command 
Enable: 
When this 
signal 
is 
LOW all 8288 command 
outputs 
and the 
DEN and PDEN control outputs are forced 
. 
to their inactive state. When this signal is 
HIGH, these same outputs 
are enabled. 


lOB 
I 
Inpul/Output 
Bu. Mode: 
When the lOB is 
strapped 
HIGH the 8288 functions 
in the 


i 


I/O Bus mode. When it is strapped 
LOW, 


the 8288 
functions 
in the 
System 
Bus 
mode. (See sections 
on I/O Bus·and Sys- 
tem Bus modes). 


Symbol 
Type 
Name and Function 


AIOWC 
0 
Advanced 
I/O 
Write 
Command: 
The 


AIOWC 
issues 
an 
I/O Write 
Command 


earlier 
in the machine 
cycle to give I/O 


devices an early indication 
of a write 
in- 


struction. 
Its timing 
is the same as a read 


command 
signal. AIOWC is active LOW. 


10WC 
0 
I/O Write Command: 
This command 
line 
instructs an I/O device to read the data on 
the data bus. This signal 
is active LOW. 


10RC 
0 
I/O Read Command: 
This command 
line 
instructs 
an I/O device 
to drive 
its data 


onto 
the data bus. This signal 
is active 


LOW. 


AMWC 
0 
Advanced 
Memory 
Write 
Com- 


mand: 
The AMWC issues a memory write 


command 
earlier in the machine 
cycle to 


give memory devices an early indication 
of a write 
instruction. 
Its timing 
is the 


same as a read command signal. AMWC is 
active LOW. 


MWTC 
0 
Memory 
Write 
Command: 
This 
com- 


mand line instructs the memory to record 
the data present 
on the data bus. This 


signal is active LOW. 


MRDC 
0 
Memory 
Read 
Command: 
This 
com- 


mand line instructs 
the memory to drive 


its data onto the data bus. This signal is 
active LOW. 


INTA 
0 
Interrupt 
Acknowledge: 
This command 


line tells an interrupting 
device 
that 
its 


interrupt 
has been 
acknowledged 
and 


that it should drive vectoring 
information 


onto the data bus. This signal 
is active 


LOW. 


MCE/PDEN 
0 
This is a dual function 
pin. 


'. 


MCE (lOB I. tied LOW): 
Master Cascade 


Enable 
occurs 
during 
an interrupt 
se- 
, 
quence 
and serves 
to 
read a Cascade 


Address from a master PIC (Priority 
Inter- 


rupt Controller) 
onto 
the data bus. The 


MCE signal is active HIGH. 
POEN 
(lOB 
Is tied 
HIGH): 
Peripheral 


Data Enable enables the data bus trans- 
ceiver for the I/O bus that DEN performs 
for the system bus. PDEN is active LOW. 


inter 


Command and Control Logic 


The command logic decodes the three 8086,8088or 8089 
CPU status lines (So,51, 521to determine what command 
is to be issued. 
This chart shows the meaning of each status "word". 


s; s, So 
ProcessorState 
8288Command 


0 
0 
0 
InterruotAcknowledae 
INTA 
0 
0 
1 
Read110Port 
10RC 
0 
1 
0 
Write110Port 
10WC,AIOWC 
0 
1 
1 
Halt 
None 


1 
0 
0 
CodeAccess 
MRDC 
1 
0 
1 
ReadMemorv 
MRDC 
1 
1 
0 
WriteMemorv 
MWTC,AMWC 
1 
1 
1 
Passive 
None 


The command is issued in one of two ways dependent 
on the mode of the 8288 Bus Controller. 
I/O Bus Mode - 
The 8288 is in the I/O Bus mode if the 
lOB pin is strapped HIGH. In the I/O Bus mode all I/O 
command lines (IORC, 10WC, AIOWC, INTA) are always 
enabled (I.e., not dependent on AEN). When an I/O com- 
mand is initiated by the processor, the 8288 immediately 
activates the command lines using PDEN and DT/R to 
control the I/O bus transceiver. The I/O command lines 
should not be used to control the system bus in this 
configuration 
because no arbitration 
is present. This 
mode allows one 8288 Bus Controller to handle two ex- 
ternal busses. No waiting 
is involved when the CPU 


wants to gain access to the I/O bus. Normal memory ac- 
cess requires a "Bus Ready" signal (AEN LOW) before it 
will proceed. It is advantageous to use the lOB mode if 
I/O or peripherals dedicated to one processor exist in a 
multi-processor system. 


System Bus Mode - 
The 8288 is in the System Bus mode 
if the lOB pin is strapped LOW. In this mode no command 
is issued until 115 ns after the AEN Line is activated 
(LOW).This mode assumes bus arbitration logic will in· 
form the bus controller (on the AEN line) when the bus is 
free for use. Both memory and I/Ocommands wait for bus 
arbitration. This mode is used when only one bus exists. 
Here, both I/O and memory are shared by more than one 
processor. 


COMMAND 
OUTPUTS 
The advanced write commands are made available to in· 
itiate write procedures early in the machine cycle. This 
signal can be used to prevent the processor from enter- 
ing an unnecessary wait state. 


The command outputs are: 


MRDC - 
Memory Read Command 
MWTC - 
Memory Write Command 
10RC 
- 
I/O Read Command 
10WC 
- 
I/O Write Command 
AMWC - 
Advanced Memory Write Command 
AIOWC - 
Advanced I/O Write Command 


INTA 
- 
Interrupt Acknowledge 


INTA (Interrupt Acknowledge) acts as an I/O read during 
an interrupt 
cycle. Its purpose is to inform an inter· 


rupting device that its interrupt is being acknowledged 
and that it should place vectoring information onto the 
data bus. 


CONTROL 
OUTPUTS 
The control outputs of the 8288 are Data Enable (DEN), 
Data Transmit/Receive 
(DT/Fl) and 
Master 
Cascade 
Enable/Peripheral Data Enable (MCE/PDEN). The DEN 
signal determines 
when the external 
bus should 
be 
enabled onto the local bus and the DT/R determines the 
direction of data transfer. These two signals usually go 
to the chip select and direction pins of a transceiver. 


The MCE/PDEN pin changes function 
with 
the two 
modes of the 8288. When the 8288 is in the lOB mode 
(lOB HIGH) the PDEN signal serves as a dedicated data 
enable signal for the I/O or Peripheral System bus. 


INTERRUPT 
ACKNOWLEDGE 
AND MCE 
The MCE signal is used during an interrupt acknowl· 
edge cycle if the 8288 is in the System Bus mode (lOB 
LOW). During any interrupt sequence there are two inter- 
rupt acknowledge cycles that occur back to back. Dur· 
ing the first interrupt cycle no data or address transfers 
take place. Logic should be provided to mask off MCE 
during this cycle. Just before the second cycle begins 
the MCE signal gates a master Priority Interrupt Con- 
troller's 
(PIC) cascade address onto the processor's 
local bus where ALE (Address Latch Enable) strobes it 
into the address latches. On the leading edge of the 
second interrupt cycle the addressed slave PIC gates an 
interrupt vector onto the system data bus where it is 
read by the processor. 


If the system contains only one PIC, the MCE signal is 
not used. In this case the second Interrupt Acknowledge 
signal gates the interrupt vector onto the processor bus. 


ADDRESS 
LATCH ENABLE AND HALT 
Address Latch Enable (ALE)occurs during each machine 
cycle and serves to strobe the current address into the 
~d~ss 
latches. ALE also serves to strobe the status (SQ, 


51, 521 into a latch for halt state decoding. 


COMMAND 
ENABLE 
The Command Enable (CEN) input acts as a command 
qualifier for the 8288. If the CEN pin is high the 8288 
functions 
normally. If the CEN pin is pulled LOW, all 
command 
lines are held in their 
inactive 
state (not 
3-state). This featore can be used to implement memory 
partitioning and to eliminate address conflicts 
between 
system bus devices and resident bus devices. 


Temperature Under Bias 
O°Cto 70°C 


Storage Temperature 
-65°C to +150°C 


All Output and Supply Voltages 
-0.5V to + 7V 


All Input Voltages 
-1.0V to +5.5V 


Power Dissipation 
1.5 watt 


'NOTICE: Stresses above those listed under "Absolute 
Maximum Ratings" may cause permanent damage to the 
device. This is a stress rating only and functional opera- 
tion of the device at these or any other conditions above 
those indicated in the operational sections of this specifi- 
cation is not implied. Exposure to absolute maximum 
rating conditions for extended periods may affect device 
reliability. 


Symbol 
Parameter 
Min. 
Max. 
Unit 
Test Conditions 


V" 
Input Clamp Voltage 
-1 
V 
I" = -5 mA 


Ice 
Power Supply Current 
230 
mA 


IF 
Forward Input Current 
-0.7 
mA 
VF = 0.45V 


IR 
Reverse Input Current 
50 
JJ.A 
VR= Vrr 


VOL 
Output low Voltage 
Command Outputs 
0.5 
.V 
loL = 32 mA 


Control Outputs 
0.5 
V 
InL = 16 mA 


VOH 
Output High Voltage 
Command Outputs 
2.4 
V 
IOH= -5 mA 


Control Outputs 
2.4 
V 
I"H = -1 mA 


V1L 
.. 
Input low Voltaae 
0.8 
V 


V1H 
Input High Voltage 
. 
2.0 
V 


loFF 
Output Off Current 
, 
100 
JJ.A 
V"FF = 0.4 to 5.25V 


A.C. CHARACTERISTICS 
(Vee = 5V ± 10%, TA = O°Cto 70°C) 


TIMING 
REQUIREMENTS 


Symbol 
Parameter 
Min. 
Max. 
Unit 
Test Conditions 


TClCl 
ClK Cycle Period 
100 
ns 


TClCH 
ClK low Time 
50 
ns 
,.. 
, 


TCHCl 
ClK Hiah Time 
30 
ns 


TSVCH 
Status Active Setup Time 
35 
ns 


TCHSV 
Status Active Hold Time 
10 
ns 


TSHCl 
Status Inactive Setup Time 
35 
ns 


.1 


TClSH 
Status Inactive Hold Time 
10 
ns 


TILIH 
Input, Rise Time 
20 
ns 
From 0.8V to 2.0V 


TIHll 
Input, Fall Time 
12 
ns 
From 2.0V to 0.8V 


inter 


A.C. CHARACTERISTICS 
(Continued) 
TIMING RESPONSES 


Symbol 
Parameter 
Min. 
Max. 
Unit 
Test Conditions 


TCVNV 
Control Active Delay 
5 
45 
ns 


TCVNX 
Control Inactive Delay 
10 
45 
ns 


TCLLH, 
ALE MCE Active Delay (from CLK) 
20 
ns 


TCLMCH 


TSVLH, 
ALE MCE Active Delay (from 
20 
ns 


TSVMCH 
Status) 


TCHLL 
ALE Inactive Delay 
4 
15 
ns 
MRDC 


TCLML 
Command Active Delay 
10 
35 
ns 
IORC 


TCLMH 
Command Inactive Delay 
10 
35 
ns 
MWTC 
IOL= 32 mA 


TCHDTL 
Direction Control Active Delay 
50 
ns 
IOWC 
IOH= -5 mA 


TCHDTH 
Direction Control Inactive Delay 
30 
ns 
INTA 
CL = 300 pF 


TAELCH 
Command Enable Time 
40 
ns 
AMWC 


TAEHCZ 
Command Disable Time 
- - 
40 
ns 
AfOWC 


TAELCV 
Enable Delay Time 
115 
200 
ns 
, 


IOL= 16 mA 


TAEVNV 
AEN to DEN 
20 
ns 
Other 
IOH= -1 mA 
TCEVNV 
CEN to DEN, PDEN 
25 
ns 
CL = 80 pF 


TCELRH 
CEN to Command 
. 
TCLML 
ns 


TOLOH 
Output, Rise Time 
20 
ns 
From 0.8V to 2.0V 


TOHOL 
Output, Fall Time 
12 
ns 
From 2.0V to 0.8V 


2.4 -----y5_TEST 
POINTS_,v--- 
O.45---.A---------~ 


A.C. TESTING: 
INPUTS ARE DRIVEN 
AT 2.4V FOR A LOGIC "'" 
AND OASV 
FOR 


A LOGIC "0." 
THE CLOCK 
1$ DRIVEN 
AT 4.3V AND O.2SV. TIMING 
MEASURE· 
MENTS 
ARE 
MADE 
AT 1.5V FOR BOTH 
A LOGIC 
"'" 
AND "0." 


1.SV 


J,.OQ 


OUT~ 
1 


300PF 


1.5V 
J 33Q 


OUT~ 
1 


300PF 


2.14V 
J 527Q 


OUT~ 


1300 
pF 


2.28V 


J,'4Q 


OUT~ ISOpF 
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BUS IS SHOWN 
ONLY 
FOfl 
FlEFERENCE 
PUFlPOSES. 


2. 
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OTHERWISE. 
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18288 
BUS CONTROLLER 
FOR iAPX 86, 88 PROCESSORS 
INDUSTRIAL 


• 
Bipolar 
Drive Capability 
• 
Configurable 
for Use with an 1/0 Bus 


• 
Provides 
Advanced 
Commands 
• 
Facilitates 
Interface 
to One or Two 


Multi-Master 
Busses 
• 
Provides 
Wide Flexibility 
in System 
Configurations 
• 
Industrial 
Temperature 
Range: 
- 40°C to 85°C 
• 
3-State Command 
Output 
Drivers 


The Intel"' 18288 Bus Controller 
is a 20-pin bipolar component for use with medium-to-Iarge iAPX 86 processing 
systems. The bus controller provides command and control timing generation as well as bipolar bus drive capability 
while optimizing system performance. 


A strapping option on the bus controller configures it for use with a multi-master system bus and separate I/O bus. 
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Figure 2. Pin Configuration 
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8289 
BUS ARBITER 


• Provides Multi-Master 
System Bus 
Protocol 


• Synchronizes iAPX 86, 88 Processors 
with Multi-Master 
Bus 


• Provides Simple Interface with 8288 
Bus Controller 


• Four Operating Modes for Flexible 
System Configuration 


• Compatible with Intel Bus Standard 
MULTIBUS™ 


• Provides System Bus Arbitration for 
8089 lOP in Remote Mode 


The Intel 8289 Bus Arbiter is a 20-pin, 5-volt-only bipolar component for use with medium to large iAPX 86, 88 multi- 
master/multiprocessing systems. The 8289 provides system bus arbitration for systems with multiple bus masters, such as 
an 8086 CPU with 8089 lOP in its REMOTE mode, while providing bipolar buffering and drive capability. 
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Symbol 
Type 
Name and Function 


Vcc 
Power: 
+SV supply 
±10%. 


GND 
Ground. 


SO,S1,S2 
I 
Status 
Input 
Pins: 
The status 
input pins 


from an 8086, 8088 or 8089 processor. The 
8289 decodes these pins to initiate bus re- 
quest and surrender 
actions. (See Table 2.) 


CLK 
I 
Clock: 
From 
the 
8284 
clock 
chip 
and 
serves to establish 
when 
bus arbiter 
ac- 


tions are initiated. 


LOCK 
I 
Lock: 
A processor generated signal which 
when activated 
(low) prevents the arbiter 
from surrendering 
the multi-mastersystem 
bus to any other bus artiter, regardless 
of 


its priority. 


CRQLCK 
I 
Common 
Request 
Lock: 
An active 
low 
signal which prevents the arbiter from sur- 
rendering 
the multi-master 
system bus to 
any other 
bus arbiter 
requesting 
the bus 
through 
the CBRQ input pin. 


RESB 
I 
Resident 
Bus: 
A strapping 
option to con- 
figure the arbiter to operate in systems hav- 
ing both a multi-master 
system bus and a 
Resident 
Bus. Strapped 
high, 
the multi- 


master system bus is requested or s~ 
dered 
as a function 
of the SYSB/RESB 


input 
pin. Strapped 
low, the SYSB/RESB 
input is ignored. 


ANYRQST 
I 
Any 
Request: 
A strapping 
option 
which 
permits the multi-master 
system bus to be 
surrendered 
to a lower priority 
arbiter as if 
it were an arbiter 
of higher 
priority 
(i.e., 


when a lower priority 
arbiter 
requests the 
use of the multi-master 
system bUs, the bus 
is surrendered 
as soon as it is possible). 


When ANYRQST is strapped low, the bus is 
surrendered 
according 
to Ta~f 
ANY· 


RQST is strapped 
high and CBRQ is ac- 
tivated, the bus is surrendered 
at the end of 
the present bus cycle. Strapping 
CBRQ low 
and ANYRQST high forces the 8289 arbiter 
to surrender 
the multi-master 
system bus 
after each transfer 
cycle. Note that when 
surrender 
occurs 
BREQ 
is driven 
false 


(high). 


lOB 
I 
10 Bus: 
A strapping 
option 
which 
confi- 
gures the 8289 Arbiter 
to operate 
in sys- 


tems having 
both 
an 10 Bus (Peripheral 
Bus) and a multi-master 
system bus. The 
arbiter requests and surrenders 
the use of 
,. 
the multi-master 
system bus as a function 
of the status line,~. 
The multi-master 
sys- 
tem 
bus is permitted 
to be surrendered 
while the processor 
is performing 
10 com- 


I' 
mands and is requested whenever the pro- 
cessor 
performs 
a memory 
command. 


Interrupt 
cycles 
are assumed 
as coming 
from. the peripheral 
bus and are treated as 
an 10 command. 


Symbol 
Type 
Name and Function 


AEN 
0 
Address 
Enable: 
The output 
of the 8289 


Arbiter to the processor's 
address latches, 


to the 
8288 
Bus Controller 
and 
8284A 


Clock Generator. AEN serves to instruct the 
Bus Controller 
and address 
latches when 


to tri-state 
their output 
drivers. 


SYSB/ 
I 
System 
Bus/Resident 
Bus: 
An 
input 


RESB 
signal when the arbiter is configured 
in the 


S.R. Mode (RESB is strapped 
high) which 


determines 
when the multi-master 
system 


bus is requested 
and multi-master 
system 


bus surrendering 
is permitted. 
The signal 


is intended 
to originate 
from 
a form 
of 


address-mapping 
circuitry, as a decoder or 


PROM attached 
to the 
resident 
address 


bus. Signal 
transitions 
and glitches 
are 


permitted on this pin from ¢1 ofT4to 
¢ 1 of 


T2 of the 
processor 
cycle. 
During 
the 


period from ¢1 ofT2to 
¢ 1 ofT4, only clean 


transitions 
are permitted 
on this 
pin (no 


glitches). 
If a glitch occurs, the arbiter may 


capture 
or miss it, and the multi-master 


system bus may be requested 
or surren- 


dered, 
depending 
upon 
the state of the 


glitch. 
The 
arbiter 
requests 
the 
multi- 


master system bus in the S.R. Mode when 
the state of the SYSB~ 
pin is high and 


permits 
the bus to be surrendered 
when 


this pin is low. 


CBRQ 
I/O 
Common 
Bus 
Request: 
An input 
signal 


which 
instructs 
the arbiter 
if there are any 


other arbiters 
of lower priority 
requesting 


the use of the multi-master 
system bus. 


The CBRQ pins (open-collector 
output) 
of 


all the 8289 Bus Arbiters 
which 
surrender 


to the multi-master 
system 
bus upon 
re- 


quest are connected 
together. 


The Bus Arbiter running 
the current trans- 


fer cycle will not itself pull the CBRQ line 
low. Any other 
arbiter 
connected 
to the 


CBRQ line can request 
the multi-master 


system bus. The arbiter 
presently 
running 


the current 
transfer 
cycle drops its BREQ 


signal 
and surrenders 
the bus whenever 


the 
proper 
surrender 
conditions 
exist. 


Strapping 
CBRQ low and ANYRQST high 


allows the multi-master 
system bus to be 


surrendered 
after each transfer 
cycle. See 


the pin definition 
of ANYRQST. 


INIT 
I 
Initialize: 
An active low multi-master 
sys- 


tem bus input signal used to reset all the 
bus arbiters 
on the multi-master 
system 


bus. After 
initialization, 
no arbiters 
have 


the use of the multi-master 
system bus. 


inter 


Symbol 
Type 
Name and Function 


BClK 
I 
Bus Clock: 
The multi-master 
system bus 
clock to which all multi-master system bus 
interface signals are synchronized. 


BREa 
0 
Bus Request: 
An active low output signal 
in the parallel Priority Resolving Scheme 
which the arbiter activates to request the 
use of the multi-master 
system bus. 


BPRN 
I 
Bus Priority 
In: The active low signal re- 
turned to the arbiter to instruct ilthat it may 
acquire the multi-master system bus on the 
next falling edge of BClK. BPRN indicates 
to the arbiter that it is the highest priority 
requesting 
arbiter 
presently 
on the bus. 
The loss of BPRN instructs the arbiter that 
it has lost priority 
to a higher 
priority 
arbiter. 


The 
8289 
Bus 
Arbiter 
operates 
in conjunction 
with 
the 
8288 Bus Controller 
to interface 
iAPX 86, 88 processors 
to 
a multi-master 
system 
bus (both 
the iAPX 86 and iAPX 88 
are configured 
in their 
max mode). 
The processor 
is un- 
aware 
of the arbiter's 
existence 
and issues 
commands 
as 
though 
it has exclusive 
use of the system 
bus. If the pro- 
cessor 
does 
not have the use of the multi-master 
system 
bus, the 
arbiter 
prevents 
the 
Bus 
Controller 
(8288), 
the 
data transceivers 
and the address 
latches 
from 
accessing 
the system 
bus (e.g. all bus driver 
outputs 
are forced 
into 
the high impedance 
state). Since the command 
sequence 
was not issued 
by the 8288, the system 
bus will appear 
as 


"Not 
Ready" 
and the processor 
will enter wait states. 
The 
processor 
will 
remain 
in Wait 
until 
the 
Bus 
Arbiter 
ac- 
quires 
the use of the multi-master 
system 
bus whereupon 
the arbiter 
will allow the bus controller, 
the data transceiv- 
ers, and the address 
latches 
to access 
the system. 
Typi- 
ca!ly, 
once 
the 
command 
has 
been 
issued 
and 
a data 
transfer 
has taken 
place, 
a transfer 
acknowledge 
(XACK) 
is returned 
to the processor 
to indicate 
"READY" 
from the 
accessed 
slave device. 
The processor 
then 
completes 
its 
transfer 
cycle. 
Thus the arbiter 
serves to multiplex 
a pro- 
cessor(or 
bus master) 
onto a multi-master 
system 
bus and 
avoid 
contention 
problems 
between 
bus masters. 


In general, 
higher 
priority 
masters 
obtain 
the bus when a 
lower 
priority 
master 
completes 
its 
present 
transfer 
cycle. 
Lower 
priority 
bus masters 
obtain 
the bus when 
a 
higher 
priority 
master 
is not accessing 
the system 
bus. 


A strapping 
option 
(ANYRaSTj 
is provided 
to allow 
the 
arbiter 
to surrender 
the bus to a lower 
priority 
master 
as 
though 
it were a master 
of higher 
priority. 
If there 
are no 
other 
bus masters 
requesting 
the bus, the arbiter 
main- 
tains 
the 
bus 
so long 
as its processor 
has not 
entered 


Symbol 
Type 
Name and Function 


BPRO 
0 
Bus Priority 
Out: An active 
low output 
signal used in the serial priority resolving 
scheme where BPRO is daisy-chained 
to 
BPRN of the next lower priority arbiter. 


BUSY 
1/0 
Busy: 
An active 
low 
open 
collector 
multi-master 
system bus interface signal 
used to instruct all the arbiters on the bus 
when the multi-master system bus is avail- 
able. When the multi-master system bus is 
available 
the highest 
requesting 
arbiter 
(determined by BPRN) seizes the bus and 
pulls BUSY low to keep other arbiters off of 
the bus. When the arbiter is done with the 
bus, it releases the BUSY signal, permitting 
it to go high and thereby allowing another 
arbiter to acquire the multi-master system 
bus. 


the HALT State. The arbiter 
will not voluntarily 
surrender 
the 
system 
bus 
and 
has 
to 
be forced 
off 
by another 
master's 
bus request, 
the HALT State 
being 
the only ex- 
ception. 
Additional 
strapping 
options 
permit 
other 
modes 
of 
operation 
wherein 
the 
multi-master 
system 
bus is surrendered 
or requested 
under 
different 
sets 
of 
conditions. 


Since 
there 
can be many 
bus masters 
on a multi-master 
system 
bus, some 
means 
of resolving 
priority 
between 
bus masters 
simultaneously 
requesting 
the bus must 
be 
provided. 
The 8289 Bus Arbiter 
provides 
several 
resolv- 
ing techniques. 
All the techniques 
are based 
on a priori· 
ty concept 
that 
at a given 
time 
one bus master 
will 
have 
priority 
above 
all the rest. There are provisions 
for using 
parallel 
priority 
resolving 
techniques, 
serial 
priority 
resolving 
techniques, 
and rotating 
priority 
techniques. 


PARALLEL 
PRIORITY 
RESOLVING 


The parallel 
priority 
resolving 
technique 
uses a separate 
bus 
request 
line (trnrn) 
for 
each 
arbiter 
on the 
multi- 
master 
system 
bus, see Figure 
4. Each BREa 
line enters 
into 
a priority 
encoder 
which 
generates 
the 
binary 
ad· 
dress 
of the 
highest 
priority 
BREa 
line which 
is active. 


The 
binary 
address 
is decoded 
by a decoder 
to select 
the 
corresponding 
BPRN 
(Bus 
Priority 
In) 
line 
to 
be 
returned 
to the 
highest 
priority 
requesting 
arbiter. 
The 
arbiter 
receiving 
priority 
(BPRN 
true) 
then 
allows 
its 
associated 
bus 
master 
onto 
the 
multi-master 
system 
bus as soon 
as it becomes 
available 
(I.e., the 
bus is no 
longer 
busy). 
When 
one 
bus 
arbiter 
gains 
priority 
over 
another 
arbiter 
it cannot 
immediately 
seize 
the 
bus, 
it 
must 
wait 
until 
the present 
bus transaction 
is complete. 


Upon completing 
its transaction the present bus occu, 


pant recognizes that it no longer has priority and sur- 
renders the bus by releasing BUSY. BUSY is an active 
low "OR" tied signal line which goes to every bus arbiter 
on the system bus. When BUSY goes inactive (high), the 
arbiter which presently has bus priority (BPRN true) then 


seizes the bus and pulls BUSY low to keep other arbiters 
off of the bus. See waveform timing diagram, Figure 5. 
Note that all'multi,master 
system bus transactions are 


synchronized to the bus clock (BClK). This allows the 
parallel priority resolving circuitry or any other priority 
resolving scheme employed to settle. 


74148 
PRIORITY 
ENCODER 


I 


HIGHER 
PRIORITY 
BUS ARBITER 
REQUESTS 
THE 
MULTI·MASTER 
SYSTEM 
BUS. 


2 
ATTAINS 
PRIORITY. 


3 
LOWER 
PRiORITY 
BUS ARBITER 
RELEASES 
BUSY. 


4 
HIGHER 
PRIORITY 
BUS ARBITER 
THEN 
ACQUIRES 
THE 
BUS 
AND 
PULLS 
BUSY 
DOWN. 


inter 


SERIAL 
PRIORITY 
RESOLVING 
The serial priority resolving technique eliminates the 
need for the priority encoder-decoder arrangement by 
daisy-chaining the bus arbiters together, connecting the 
higher priority bus arbiter's BPRO(Bus Priority Out) out- 
put to the BPRN of the next lower priority. See Figure 6. 


THE 
NUMBER 
OF 
ARBITERS 
THAT 
MAY 
BE 
DAISY·CHAINED 
TOGETHER 
IN 
THE 


SERIAL 
PRIORITY 
RESOLVING 
SCHEME 
IS A FUNCTION 
OF 
BClK 
AND 
THE 
PROPA· 


GATION 
DelAY 
FROM 
ARBITER 
TO 
ARBITER. 
NORMAllY, 
AT 
10 MH10NLY 
3 ARBI· 


TER 
MAY 
BE 
DAISY-CHAINED. 


ROTATING 
PRIORITY 
RESOLVING 
The rotating priority resolving technique is similar to 
that of the parallel priority resolving technique except 
that priority is dynamically re-assigned. The priority en- 
coder is replaced by a more complex circuit which ro- 
tates priority between requesting arbiters thus allowing 
each arbiter an equal chance to use the multi-master 
system bus, over time. 


Which Priority Resolving Technique To 
Use 


There are advantages and disadvantages for each of the 
techniques 
described 
above. 
The 
rotating 
priority 
resolving technique requires substantial external logic 
to implement while the serial technique uses no exter- 
nallogic but can accommodate only a limited number of 
bus arbiters before the daisy-chain propagation delay 
exceeds the multi-master's system bus clock (BCLK). 
The parallel priority resolving technique is in general a 
good compromise between the other two techniques. It 
allows for many arbiters to be present on the bus while 
not requiring too much logic to implement. 


·In some system configurations 
it is possible for a non-lID Processor to 
have access 
to more 
than one 
Multi-Master 
System 
Bus, see 8289 
Application 
Note. 


There aretwo types of processors in the iAPX86family. An 
Input/Output processor (the 8089lOP)and the iAPX86/10, 
88/10CPUs.Consequently, there are two basic operating 
modes in the 8289bus arbiter. One,the lOB (I/OPeripheral 
Bus) mode, permits the processor access to both an I/O 
Peripheral Bus and a multi-master system bus. The sec- 
ond, the RESB (Resident Bus mode), permits the pro- 
cessor to communicate over both a Resident Bus and a 
multi-master system bus. An I/O Peripheral Bus is a bus 
where all devices on that bus, including memory, are 
treated as I/O devices and are addressed by I/O com- 
mands. All memory commands are directed to another 
bus, the multi-master system bus. A Resident Bus can 
issue both memory and I/O commands, but it is a distinct 
and separate bus from the multi-master system bus. The 
distinction is that the Resident Bus has only one master, 
providing full availability and being dedicated to that one 
master. 


The lOB strapping option configures the 8289 Bus Ar- 
biter into the lOB mode and the strapping option RESB 
configures it into the RESB mode. It might be noted at 
this point that if both strapping options are strapped 
false, the arbiter interfaces the processor to a multi- 
master system bus only (see Figure 7). With both op- 
tions strapped true, the arbiter interfaces the processor 
to a multi-master system bus, a Resident Bus, and an I/O 
Bus. 


In the lOB mode, the processor communicates and con- 
trols a host of peripherals over the Peripheral Bus. When 
the I/O Processor needs to communicate with system 
memory, it does so over the system memory bus. Figure 
8 shows a possible I/O Processor system configuration. 


The iAPX 86 and iAPX 88 processors can communicate 
with a Resident Bus and a multi-master system bus. Two 
bus controllers and only one BusArbiter would be needed 
in such a configuration as shown in Figure 9. In such a 
system configuration the processor would haveaccessto 
memory and peripherals of both busses. Memory map- 
ping techniques are applied to select which bus is to be 
accessed.The SYSB/RESBinput on the arbiter serves to 
instruct the arbiter as to whether or not the system bus is 
to beaccessed.The signal connected to SYSB/RESBalso 
enables or disables commands from one of the bus 
controllers. 


A summary of the modes that the 8289 has, along with 
its response to its status lines inputs, is summarized in 
Table 2. 


Singlo 
~ 
StatuI lines From 
lOB Modo 
RESB (Modo) 
Only 
lOB Mode 
RESB Mode 
lOB = High 


8086 or 8088 or 8089 
Only 
lOB = High 
RESB - 
High 
lOB = Low RESB ~ High 
RESB = Low 


52 
51 
So 
lOB = Low 
SYSB/RESB 
= High 
SYSB/RESB 
= Low 
SYSB/RESB 
= High 
SYSB/RESB 
= Low 


0 
0 
0 
x 
x 
x 
x 
110 
0 
0 
1 
x 
x 
x 
COMMANDS 


x 


0 
1 
0 
x 
x 
x 
, 
x 


HALT 
0 
1 
1 
x 
x 
x 
x 
, 
x 
x 


1 
0 
0 
x 
x 
MEM 
0 
1 
x 
x 
1 
COMMANDS 
1 
1 
0 
x 
x 


IDLE 
1 
1 
1 
x 
x 
x 
x 
x 
x 


NOTES: 


1. X = Multi-Master 
System 
Bus is allowed 
to be Surrendered. 
2. 
V' = Multi-Master 
System 
Bus is Requested. 


Pin 
Multi·Master 
System 
Bus 
Mode 
Strapping 
Requested· 
• 
Surrendered· 


Single 
Bus 
10B= 
High 
Whenever the processor's 


HLT + TI· 
CBRQ+ 
HPBRQI 
Multi-Master 
Mode 
RESB= 
Low 
status 
lines 
go active 


RESB 
Mode 
Only 
10B= 
High 
SYSB/~= 
High. 
(SYSB/IITmi 
= Low 
+ TI) • 
RESB= 
High 
ACTIVE 
STATUS 
CBRQ+ 
HLT + HPBRQ 


lOB 
Mode 
Only 
10B= 
Low 
Memory 
Commands 
(I/O 
Status 
+ TI) • CBRQ 
+ 


RESB= 
Low 
HLT+HPBRQ 


10B= 
Low 
(Memory 
Command) 
• 
«I/O 
Status 
Commands)+ 


lOB 
Mode·RESB 
Mode 
SYSB/~ 
= LOW)) 
• CBRQ 
RESB= 
High 
(SYSBfRESB 
= High) 
+ HPBRQI 
+ HLT 


NOTES: 


*LOCK 
prevents 
surrender 
of Bus to any other arbiter, 
CRQlCK 
prevents 
surrender 
of Bus to any lower 
priority 
arbiter. 
"Except 
for HALT and Passive 
or IDLE Status. 


I HPBRO, 
Higher 
priority 
Bus request 
or BPRN = 1. 
1. lOB Active 
Low. 
2. RESB Active 
High. 
3. + is read as "OR" and. 
as "AND." 


4. TI = Processor 
Idle Status $2, 51, SO = 111 
5. HLT= 
Processor 
Halt Status 52,51, SO= 011 


inter 


MULnMA5TER 
SYSTEM 
COMMAND 
BUS 


MUlnMASTER 
SYSTEM 


ADDRESS 


BUS 


MULTI-MASTER 


SYSTEM 


DATA 
BUS 


AENl 


-;- 
12M" 


XACK {I/O BUS)>------ 
ROV1CLOC~0Y2 


=:) 


MULTI·MASTER 


-- 
SYSTEM 
COMMAND 
'"' 


MULTI-MASTER 
SYSTEM 
ADDRESS 
'US 


MULnMASTEA 
SYSTEM 
DATA 
'US 


inter 


AOOR 
LATCH 


828211283 
(20R31 


inter 


Temperature Under Bias 
O·C to 70·C 
Storage Temperature 
- 65·C to + 150·C 
All Output and Supply Voltages 
- 0.5V to + 7V 
All Input Voltages 
- 1.0V to + 5.5V 
Power Dissipation 
1.5 Watt 


'NOTICE: Stresses above those listed under "Absolute 
Maximum Ratings" may cause permanent damage to the 
device. This is a stress rating only and functional opera- 
tion of the device at these or any other conditions above 
those indicated in the operational sections of this specifi- 
cation is not implied. Exposure to absolute maximum 
rating conditions for extended periods may affect device 
reliability. 


Symbol 
Parameter 
Min. 
Max. 
Units 
Test Condition 


Ve 
Input Clamp Voltage 
-1.0 
V 
Vee=4.50V, 
le= 
-5 
mA 


IF 
Input Forward Current 
-0.5 
mA 
Vee=5.50V, 
VF=0.45V 


IA 
Reverse Input Leakage Current 
60 
"A 
Vee = 5.50, VA= 5.50 


VOL 
Output Low Voltage 
BUSY,CBRa 
0.45 
V 
IOL=20 mA 
AEN 
0.45 
V 
IOL= 16 mA 
BPRO,BREa 
0.45 
V 
tOL= 10 mA 


VOH 
Output High Voltage 
BUSY,CBRa 
Open Collector 


All Other Outputs 
2.4 
V 
IOH= 400 "A 


Ice 
Power Supply Current 
165 
mA 


V1L 
Input Low Voltage 
.8 
V 


V1H 
Input High Voltage 
2.0 
V 


Cin Status 
Input Capacitance 
25 
pF 


Cin (Others) 
Input Capacitance 
12 
pF 


A.C. CHARACTERISTICS 
(Vec = +5V ±10%, TA = o·C to 70°C) 


TIMING 
REQUIREMENTS 


Symbol 
Parameter 
Min. 
Max. 
Unit 
Test Condition 


TClCl 
ClK Cycle Period 
125 
ns 


TClCH 
ClK low Time 
65 
ns 


TCHCl 
ClK High Time 
35 
ns 


TSVCH 
Status Active Setup 
65 
TClCl-10 
ns 


TSHCl 
Status Inactive Setup 
50 
TClCl·10 
ns 


THVCH 
Status Active Hold 
10 
ns 


THVCl 
Status Inactive Hold 
10 
ns 


TBYSBl 
BUSYHSetup to BClK~ 
20 
ns 


TCBSBl 
CBROHSetup to BClK~ 
20 
ns 


TBlBl 
BClK Cycle Time 
100 
ns 


TBHCl 
BClK High Time 
30 
.65[TBlBl) 
ns 


TCllL1 
lOCK Inactive Hold 
10 
ns 


TCllL2 
lOCK Active Setup 
40 
ns 


TPNBl 
BPRNHto BClK Setup Time 
15 
ns 


TClSR1 
SYSB/RESB Setup 
0 
ns 


TClSR2 
SYSB/RESB Hold 
20 
ns 


TIVIH 
Initialization Pulse Width 
3 TBlBl+ 
ns 
3 TClCl 


TILIH 
Input Rise Time 
20 
ns 
From 0.8 to 2.0V 


TIHll 
Input Fall Time 
12 
ns 
From 2.0V to 0.8V 


intel 


A.C. CHARACTERISTICS 
(Continued) 


TIMING 
RESPONSES 


Symbol 
Parameter 
Min. 
Max. 
Unit 
Test Condition 


TBLBRL 
BCLK to BREQ Delay!t 
35 
ns 


TBLPOH 
BCLK to BPRO!t 
(See Note 1) 
40 
ns 


TPNPO 
BPRN!tto 
BPRO!tDelay 
25 
ns 


)' 
.' 


(See Note 1) 
, 


TBLBYL 
BCLK to BUSY Low 
60 
ns 


TBLBYH 
BCLK to BUSY Float (See Note 2) 
r 
35 
ns 


TCLAEH 
CLK to AEN High 
65 
ns 


TBLAEL 
BCLK to AEN Low 
40 
ns 


TBLCBL 
BCLK to CBRQ Low 
60 
ns 


TRLCRH 
BCLK to CBRQ Float (See Note 2) 
35 
nS 


TOLOH 
Output Rise Time 
20 
ns 
From O.BV to 2.0V 


TOHOL 
Output Fall Time 
12 
ns 
From 2.0V to O.BV 


NOTES: 
1. BCLK generates the first BPRO wherein subsequent 
BPRO changes lower in the chain are generated 
through 
BPRON. 
2. Measured at .5V above GND. 


2.4 ~5_TESTPOINTS_1X= 


0.45 


NOTES, 
,. 
LOCK 
ACTIVE 
CAN 
OCCUR 
DURING 
ANY 
STATE, 
AS 
LONG 
AS 
THE 
RELATIONSHIPS 
SHOWN 
ABOVE 
WITH 
RESPECTTO 
THE CLK ARE MAINTAINED. 


COCK 
INACTIVE 
HAS NO CRITICAL 
TIME 
AND 
CAN 
BE ASYNCHRONOUS. 


-GROLCK 
HAS NO CRITICAL 
TIMING 
AND 
IS CONSIDERED 
AN ASYNCHRONOUS 


INPUT 
SIGNAL 
2. GLiTCHING 
OF SYSBIReSB 
PIN IS PERMITTED 
DURING 
THIS TIME. 
AFTER 
~ 2 OF 


n, 
AND 
BEFORE 
~1 OF T4, SYSBIRESBSHOULD 
BE STABLE. 
3. AEN LEADING 
EDGE 
IS RELATED 
TO 
BCLK, 
TRAILING 
EDGE 
TO 
CLK. 
THE 


TRAILING 
EDGE 
OF AEN 
OCCURS 
AFTER 
BUS 
PRIORITY 
IS LOST. 


ADDITIONAL 
NOTES: 


The signals 
related 
to ClK 
are typical 
processor 
signals, 
and do not relate 
to the depicted 
sequence 
of events 
of the 


signals 
referenced 
to BClK. 
The signals 
shown 
related 
to the BClK 
represent 
a hypothetical 
sequence 
of events 
for 
illustration. 
Assume 
3 bus arbiters 
of priorities 
1, 2 and 3 configured 
in serial 
priority 
resolving 
scheme 
as shown 
in 
Figure 
6. Assume 
arbiter 
1 has the bus and is holding 
busy 
low. Arbiter 
#2 detect~rocessor 
wants 
the bus and 
pulls 
low BREQ#2. 
If BPRN#2 
is high (as shown), 
arbiter 
#2 will pUll low CBRQ line. CBRQ signals 
to the higher 
priority 


arbiter 
#1 that 
a lower 
priority 
arbiter 
wants 
the bus, [A higher 
priority 
arbiter 
would 
be granted 
BPRN when 
it makes 


the bus request 
rather 
than having 
to wait for another 
arbiter 
to release 
the bus through ClmOJ.·· Arbiter 
#1 will 
relin- 


quish 
the multi-master 
system 
bus when 
it enters 
a state 
not requiring 
it (see Table 
1), by lowering 
its EiPR5ii1 
(tied to 


Bl"RNl/2) 
and releasing 
BUSY. Arbiter 
#2 now sees that 
it has priority 
from 
BPRN#2 
being 
low and releases 
CBRQ. 
As 


soon as BUSY signifies 
the bus is available 
(high), arbiter 
#2 pulls 
BUSY low on next falling 
edge of BClK. 
Note that if 
arbiter 
#2 didn't 
want 
the bus at the time 
it received 
priority, 
it would 
pass priority 
to the next lower 
priority 
arbiter 
by 


lowering 
its BPRO #2 [TPNPOj. 


in+-.I@ 
MODEL 230 
~ 
INTELLEC SERIES II 
MICROCOMPUTER 
DEVELOPMENT SYSTEM 


Complete 
microcomputer 
development 
center for Intel MCS·86, MCS·80, MCS·85 
and MCS·48 microprocessor families 


LSI electronics 
board with CPU, RAM, 
ROM, I/O, and Interrupt circuitry 


64K bytes RAM memory 


Elght·level nested, maskable priority 
interrupt system 


Built·in interfaces for high speed paper 
tape reader/punch, printer, and universal 
PROM programmer 


Integral CRT with detachable upper/ 
lower case typewriter·style 
full ASCII 
keyboard 


Powerful ISIS· II Diskette Operating 
System software with relocating 
macroassembler, 
linker, and locater 


1 million bytes (expandable to 2.5M 
bytes) of diskette storage 


Supports PUM and FORTRAN high level 
languages 


Standard MULTIBUS with multiprocessor 
and DMA capability 


Compatible with standard Inteliec/iSBC 
expansion modules 


Software compatible with previous 
Intellec systems 


The Model 230 Intellec 
Series II Microcomputer 
Development 
System is a complete 
center for the development 
of 


microcomputer-based 
products. It includes a CPU, 64K bytes of RAM, 4K bytes of ROM memory, a 2000-character CRT, 


a detachable full ASCII keyboard, and dual double density diskette drives providing over 1 million bytes of on-line data 
storage. Powerful ISIS-II Diskette Operating System software allows the Model 230 to be used quickly and efficiently 
for assembling 
and/or compiling 
and debugging 
programs for Intel's MCS-S6, MCS-SO,MCS-S5, or MCS-4S microproc- 


essor families 
without 
the need for handling paper tape. ISIS-II performs all file handling operations, 
leaving the user 
free to concentrate 
on the details 
of his own application. 
When used in conjunction 
with an optional 
in-circuit 
emulator 
(ICE) module, the Model 230 provides all the hardware and software 
development 
tools necessary for the 
rapid development 
of a microcomputer-based 
product. 


The Intel lee Series II Model 230 is a packaged, highly 
integrated microcomputer development system consist- 
ing of a CRT chassis with a 6-slot cardcage, power sup- 
ply, fans, 
cables, 
and five 
printed 
circuit 
cards. A 
separate, full ASCII keyboard is connected with a cable. 
A second chassis contains two floppy disk drives capa- 
ble of double-density 
operation along with a separate 
power supply, fans, and cables for connection 
to the 


main chassis. 
A block diagram of the Model 230 is 


shown in Figure 1. 


CPU Cards - 
The master CPU card contains its own 
microprocessor, 
memory, I/O, interrupt and bus inter- 
face circuitry fashioned from Intel's high technology LSI 
components. Known as the integrated processor board 
(IPB), it occupies the first slot in the cardcage. A second 
slave CPU card is responsible for all remaining I/O con- 
trol including the CRT and keyboard interface. This card, 
mounted on the rear panel, also contains its own micro- 
processor, 
RAM and ROM memory, and I/O interface 


logic, thus, in effect, creating a dual processor environ- 
ment. Known as the I/O controller (IOC), the slave CPU 


card communicates 
with the IPB over an 8-bit bidirec- 


tional data bus. 


Memory and Control Cards - 
In addition, 32K bytes of 


RAM (bringing the total to 64K bytes) is located on a 
separate card in the main cardcage. Fabricated 
from 


Intel's 16K RAMs, the board also contains all necessary 
address 
decoding 
and refresh 
logic. 
Two additional 


boards in the cardcage are used to control 
the two 


double-density floppy disk drives. 


Expansion - 
Two remaining slots in the cardcage are 


available for system expansion. Additional expansion of 
4 slots can be achieved through the addition of an Intel- 
lee Series II expansion chassis. 


System Components 


The heart of the IPB is an Intel NMOS 8-bit microproces- 
sor, the 8080A-2, running at 2.6 MHz. 32K bytes of RAM 
memory are provided on the board using 
Intel 
16K 


RAMs. 4K of ROM is provided, preprogrammed with sys- 
tem bootstrap "self-test" 
diagnostics 
and the Intellec 


Series II System Monitor. The eight-level vectored prior- 
ity interrupt system allows interrupts to be individually 
masked. Using Intel's 
versatile 
8259A interrupt 
con- 


troller, the interrupt system may be user programmed to 
respond to individual needs. 


Input/Output 


IPB Serial Channels - 
The I/O sUbsystem In the Model 


230 consists of two parts: the 10C card and two serial 
channels on the IPB itself. Each serial channel is RS232 
compatible and is capable of running asynchronously 
from 110to 9600 baud or synchronously from 150to 56K 
baud. Both may be connected to a user defined data set 
or 
terminal. 
One 
channel 
contains 
current 
loop 


adapters. Both channels are implemented using Intel's 
8251A USART. They can be programmatically selected 
to perform a variety of I/O functions. Baud rate selection 
is accomplished programmatically through an Intel 8253 
interval timer. The 8253 also serves as a real·time clock 
for the entire system. I/O activity through both serial 
channels is signaled to the system through a second 
8259 interrupt 
controller, 
operating in a polled mode 


nested to the primary 8259. 


IOC Interface - 
The remainder of system I/O activity 


takes place in the 10C. The 10C provides interface for 
the CRT, keyboard, and standard Intellec peripherals 
including printer, high speed paper tape reader/punch, 
and universal PROM programmer. The 10C contains its 
own independent microprocessor, also an 8080A-2.The 
CPU controls all I/O operations as well as supervising 
communications 
with the IPB. 8K bytes of ROM contain 


all I/O control firmware. 8K bytes of RAM are used for 
CRT screen refresh storage. These do not occupy space 
in Intellec Series II main memory since the 10C is a 
totally independent microcomputer subsystem. 


Integral 
CRT 


Display - 
The CRT is a 12-inch raster scan type monitor 


with a 50/60 Hz vertical scan rate and 15.5kHz horizontal 
scan rate. Controls are provided for brightness and con· 
trast adjustments. The interface to the CRT is provided 
through an Intel 8275 single chip programmable CRT 
controller. The master processor on the IPB transfers a 
character for display to the 10C, where it is stored in 
RAM. The CRT controller reads a line at a time into its 
line buffer through an Intel 8257 DMA controller 
and 


then feeds one character at a time to the character gen- 
erator to produce the video signal. Timing for the CRT 
control is provided by an Intel 8253 interval timer. The 
screen display is formatted as 25 rows of 80 characters. 
The full set of ASCII characters are displayed, including 
lower case alphas. 


Keyboard - 
The keyboard interfaces directly to the 10C 


processor via an 8-bit data bus. The keyboard contains 
an Intel UPI·41 Universal Peripheral Interface, which 
scans the keyboard, encodes the characters, and buf· 
fers the characters to provide N·key rollover. The key· 
board itself is a high quality typewriter style keyboard 
containing the full ASCII character set. An upperllower 
case switch allows the system to be used for document 
preparation. 
Cursor 
control 
keys are also 
provided. 


Peripheral 
Interface 


A UPI-41Universal Peripheral Interface on the 10Cboard 
performs 
similar 
functions 
to the UPI·41 on the PIO 


board in the Model 210. It provides interface for other 
standard Intel lee peripherals including a printer, high 
speed paper tape reader, high speed paper tape punch, 


and 
universal 
PROM 
programmer. 
Communication 


between the IPB and 10C is maintained over a separate 
8·bit bidirectional 
data bus. Connectors 
for the four 


devices named above, as well as the two serial chan· 
nels, are mounted directly on the 10C itself. 


Control 


User control Is maintained through a front panel, con· 
sisting 
of a power switch 
and indicator, 
reset/boot 


switch, run/halt light, and eight interrupt switches and 
indicators. 
The front 
panel circuit 
board is attached 


directly to the IPB, allowing the eight interrupt switches 
to connect to the primary 8259A,as well as to the Intellec 
Series II bus. 


Diskette 
System 


The Intellec Series II double density diskette 
system 


provides direct access bulk storage, intelligent control· 
ler, and two diskette drives. Each drive provides V2 mil· 
lion bytes of storage with a data transfer rate of 500,000 
bits/second. The controller is implemented with Intel's 
powerful Series 3000 Bipolar Microcomputer 
Set. The 


controller provides an interface to the Intellec Series II 
system bus, as well as supporting up to four diskette 
drives. The diskette system records all data in soft sec- 
tor format. The diskette system is capable of performing 
seven different 
operations: 
recalibrate, 
seek, format 


track, write data, write deleted data, read data, and verify 
CRC. 


Diskette Controller 
Boards - 
The diskette 
controller 


consists of two boards, the channel board and the inter- 
face board. These two PC boards reside in the Intellec 
Series II system chassis and constitute 
the diskette 


controner. The channel board receives, decodes and 
responds to channel commands from the 8080A-2 CPU 
in the Model 230. The interface 
board provides the 


diskette controller with a means of communication with 
the diskette drives and with the Intellec system bus. The 
interface 
board validates data during 
reads using a 


cyclic redundancy check (CRC) polynomial and gener- 
ates CRC data during write operations. When the disk- 
ette controller requires access to Intellec system mem- 
ory, the interface board requests and maintains DMA 
master control of the system bus, and generates the 
appropriate memory command. The interface board also 
acknowledges I/O commands as required by the Intellec 
bus. In addition to supporting a second set of double 
density 
drives, the diskette 
controller 
may co-reside 


with the Intel single density controller to allow up to 2.5 
million bytes of on·line storage. 


MUL TIBUS Capability 


All Intellec 
Series II models implement 
the industry 


standard MULTIBUS. MULTIBUS enables several bus 
masters, such as CPU and DMA devices, to share the 
bus and memory by operating at different priority levels. 
Resolution of bus exchanges is synchronized by a bus 
clock 
signal 
derived 
independently 
from 
processor 


clocks. Read/write transfers may take place at rates up 
to 5 MHz. The bus structure is suitable for use with any 
Intel microcomputer family. 


Host Processor 
(IPB) 
RAM - 
64K (system monitor occupies 62K through 64K) 
ROM - 
4K (2K in monitor, 2K in boot/diagnostic) 


Diskette 
System Capacity 
(Basic Two Drives) 


Unformatted 
Per Disk: 6.2 megabits 
Per Track: 82.0 kilobits 
Formatted 
Per Disk: 4.1 megabits 
Per Track: 53.2 kilobits 


Diskette 
Performance 
Diskette System Transfer Rate - 
500 kilobits/sec 
Diskette System Access Time 
Track·to·Track: 10 ms 
Head Settling Time: 10 ms 
Average Random Positioning Time - 
260 ms 
Rotational Speed - 
360 rpm 
Average Rotational Latency - 
83 ms 
Recording Mode - 
M2FM 


Physical Characteristics 
Width - 
17.37 in. (44.12 em) 
Height - 
15.81 in. (40.16 em) 
Depth - 
19.13 in. (48.59 em) 
Weight - 
73 Ib (33 kg) 


Keyboard 
Width - 
17.37 in. (44.12 em) 
Height - 
3.0 in. (7.62 em) 
Depth - 
9.0 in. (22.86 em) 
Weight - 
6 Ib (3 kg) 


Dual Drive Chassis 
Width - 
16.88 in. (42.88 em) 
Height - 
12.08 in. (30.68 em) 
Depth - 
19.0 in. (48.26 em) 
Weight - 
64 Ib (29 kg) 


Electrical 
Characteristics 
DC Power Supply 


Volts 
Amps 
Typical 
Suppllad 
Supplied 
System 
ReqUirements 


+ 
5±5% 
30 
14.25 
+ 12± 5% 
2.5 
0.2 
-12±5% 
0.3 
0.05 
-10± 
5% 
1.5 
15 
+ 15± 5% 
1.5 
1.3 
+ 24± 5% 
1.7 


Environmental 
Characteristics 


Operating Temperature - 
0° to 35°C (95°F) 


Equipment 
Supplied 


Model 230 chassis 
Integrated processor board (IPB) 
I/O controller board (IOC) 
32K RAM board 
CRT and keyboard 
Double density floppy disk controller (2 boards) 
Qual drive floppy disk chassis and cables 
2 floppy disk drives (512K byte capacity each) 
ROM·resident system monitor 


ISIS·II system diskette with MCS·80/MCS·85 
macroassembler 


Reference 
Manuals 


9800558 - 
A Guide to Microcomputer 
Development 


Systems (SUPPLIED) 


9800550 - 
Intellec 
Series II Installation 
and Service 


Guide (SUPPLIED) 


9800306 - 
ISIS·II System User's Guide (SUPPLIED) 


9800556 - 
Intellec Series II Hardware Reference Man· 


ual (SUPPLIED) 


9800301 - 
8080/8085 Assembly 
Language Program· 


ming Manual (SUPPLIED) 


9800292 - 
ISIS·II 8080/8085Assembler Operator's Man· 


ual (SUPPLIED) 


9800605 - 
Intellec Series II Systems Monitor Source 


Listing (SUPPLIED) 


9800554 - 
Intellec Series II Schematic Drawings 
(SUPPLIED) 


Reference manuals are shipped with each product only 
if designated SUPPLIED (see above). Manuals may be 
ordered from any Intel sales representative, distributor 
office or from Intel Literature Department, 3065 Bowers 
Avenue, Santa Clara, California 95051. 


ORDERING 
INFORMATION 
Part Number 
Description 


MDS·230 
Intellec Series II Model 230 
microcomputer 
development system 
(110V/60 Hz) 


MDS·231 
Intellec Series II Model 230 
microcomputer 
development system 
(220V/50 Hz) 


inter 
MODEL 286 
INTELLEC® SERIES III 
MICROCOMPUTER 
DEVELOPMENT SYSTEM 


• Supports Intellec 432/100 Evaluation 


and Educational System 


• Compatible with iSBC-090 Series 90 


Memory System Upgrade: 512K Byte 
to 1M Byte 


• Complete 16-bit High Performance, 
Microcomputer 
Development Solution 


for Intel iAPX 86,88Applications. Also 
Supports MCS-85™,MCS-80 and 
MCS-48 Families 


• Supports Full Range of iAPX 86,88- 


Resident, High-Level Languages: Pl/M 
86/88, PASCAL 86/88, and FORTRAN 
86/88 


• 2 Host CPUs-iAPX 
86 and 8085A-for 


Enhanced System Performance and 
Two Native Execution Environments 


• 96K Bytes of User Program RAM 


Memory Available for iAPX 86,88 
Programs 


• Series 11/80and Series 11/85 


Upgradeable to 8085/iAPX 86 Series III 
Functionality 


• Intellec Model 800 Upgradeable to 


8080/iAPX 86 Series III Functionality 


• Compatible with Intellec Distributed 


Development Systems 


• Compatible with Previous Intellec 


Systems 


• Software Applications 
Debugger for 


User iAPX 86,88 Programs 


• Upgradeable to a Complete Ethernet* 


Communications 
Development System 


Environment, Using the Model 677 
Upgrade 


The Intellec 
Series-III Microcomputer 
Development 
System is a high-performance 
system solution 
designed 
specifically 
for iAPX 86,88 microprocessor 
development. 
It contains 
two host CPUs, an iAPX 86 and an 8085, 


that provide two native execution 
environments 
for optimum 
performance 
and compatibility 
with the lntellec 


software 
packages for both CPUs. The basic system includes 
96K bytes of iAPX 86,88 user RAM memory and a 
250K byte floppy disk drive. The powerful 
Disk Operating 
System maximizes 
system processing 
by utilizing 
the 
power of both host processors. 
Standard 
software 
includes 
a full range of iAPX 86,88 resident 
software. 
The 
high-level 
languages 
PUM 
86/88, PASCAL 86/88, and FORTRAN 86/88 are also available. 
A ROM resident 
software 
debugger 
not only provides 
self-test 
diagnostic 
capability, 
but also gives the user a powerful 
iAPX 


86,88 applications 
debugger. 


The following 
are trademarks 
ollnlel 
Corporation 
and its affiliates 
and may be used only to idenlify 
Intel products: 
i, Intel,INTEL, 
INTELLEC. 
MeS. 1m.ies, ICE. UP!. exp, iSBC, iSBX. 


InSlle, 
iRMX, 
System 
2000. CREDIT. 
IRMXlBO. MUL TIBUS, 
PROMPT, 
Promware, 
Megachassis. 
Library 
Manager. 
MAIN 
MUL TIMODULE, 
and the combination 
of MeS, 
ICE. iSBC. 
tRMX or 


iCS and a numerical 
suffix. © INTEL CORPORATION, 
1981. 
AFN-01588B 


8-124 
121670-002 


The Intellec Series III is contained 
in a single pack- 
age consisting 
of a CRT chassis 
with a 6-slot card 


cage, power supply, fans, cables, single floppy disk 
drive, detachable 
upper/lower 
case full ASCII key- 


board, and four printed 
circuit 
cards. A block diag- 
ram of the system is shown 
in Figure 1. 


Two CPU cards 
reside 
on the Intellec 
MUL T1BUS 


bus, 
each 
containing 
its 
own 
microprocessor, 
memory, 
I/O, interrupt 
and bus interface 
circuitry 
implemented 
with Intel's 
high technology 
LSI com- 


ponents. 
The integrated 
processor 
card (IPC-85), 


occupies 
the first 
slot in the cardcage. 
A second 


CPU card, the resident 
processor 
board 
(RPB-86) 


contains 
Intel's 16-bit HMOS microprocessor. 
These 


CPUs provide 
the dual processor 
environment. 


1'1mlro 
I.,t"'U cara penorms 
all remaining 
I/U InClua- 


ing interface 
to the CRT, integral 
floppy 
disk, and 


keyboard. 
This card, 
mounted 
on the rear panel, 


contains 
its own microprocessors, 
RAM and ROM 


memory, 
and I/O interface 
logic. 
Known 
as the I/O 


controller 
(IOC), this slave CPU card communicates 


with the IPC-85 over an 8-bit bidirectional 
data bus.A 


64K byte 
RAM expansion 
memory 
board 
is also 


included. 


Two additional 
slots 
in the system 
card cage are 


available 
for system expansion. 
The Intel lee expan- 


sion chassis Model 201 is available 
to provide 4 ad- 


ditional 
expansion 
slots for either memory or I/O ex- 


pansion. 


THE INTELLEC 
DEVELOPMENT 
SYSTEM 
FOR 


ETHERNET 
(DS/E) 


The Intellec Series III can be expanded 
to provide the 


user with 
the tools 
necessary 
to develop 
and test 


r--;;ER~E~-'lr-- 
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communications 
software 
and applications 
that will 
use Ethernet 
as a communications 
subsystem. 
The 
power of the Intellec Series III combined 
with Model 
677 allows 
the user to develop 
either 
8- or 16-bit 
Ethernet-based 
applications. 


THE INTELLEC 
432/100 
EVALUATION 
AND 
EDUCATIONAL 
SYSTEM 
The Intellec 
Series 
III provides 
a complete 
system 
environment 
necessary 
for evaluation 
of the Intel 
iAPX 432 32-bit micromainframe. 
The iSBC 432/100 
board 
plugs 
into a Multibus 
slot in the Intellec 
Se- 
ries III, sharing 
system 
memory 
and resources. 
A 
comprehensive 
set 
of 
docu mentation, 
system 
software 
and hardware 
provides the evaluation 
and 
educational 
environment 
for the powerful 
iAPX 432. 


The Intellec 
Series 
III provides 
a complete 
system 
environment 
for evaluation 
of the iAPX 286 micro- 
processor's 
architecture 
and its instruction 
set, seg- 
mentation 
timing, 
memory 
mapping 
and protection 
features. A user can begin the development 
of com- 
plex iAPX 286 programs, 
systems and operating 
sys- 
tem nuclei 
with the Intellec 
Series III and iAPX 286 
evaluation 
package. 


IPC·85 
The heart of the IPC-85 is an Intel NMOS 8-bit micro- 
processor, 
the 8085A-2, 
running 
at 4.0 MHz. 64K 
bytes of RAM memory 
are provided 
on the board 
using 
16K dynamic 
RAMs. 4K of ROM is provided, 


preprogrammed 
with system 
bootstrap 
"self-test" 


diagnostics 
and the Intellec 
System 
Monitor. 
The 
eight-level 
vectored 
priority 
interrupt 
system allows 
interrupts 
to be individually 
masked. 
Using Intel's 
versatile 
8259A interrupt 
controller, 
the interrupt 
system may be user programmed 
to respond to indi- 
vidual 
needs. 


RPB·86 
The heart 
of the RPB-86 
is an Intel 
HMOS 
16-bit 
microprocessor, 
the iAPX 86 (8086). running 
at 5.0 
MHz. 64K bytes of RAM memory are provided 
on the 
board. 
16K of ROM is provided 
on board, 
prepro- 
grammed 
with an iAPX 88/86 applications 
debugger 
which 
provides 
features 
necessary 
to debug 
and 
execute 
application 
software 
for the iAPX 88/86 
microprocessors. 


The 8085A-2 and iAPX 86 access two independent 
memory 
spaces. This allows 
the two processors 
to 
execute 
concurrently 
when an iAPX 88/86 program 
is run. 
In this 
mode, 
the IPC-85 
becomes 
an in- 
tellegent 
I/O processor 
board to the RPB-86. 


IPC-85 
SERIAL 
CHANNELS 
The I/O subsystem 
in the Series III consists 
of two 
parts: the 10C card and two serial channels 
on the 
IPC-85 itself. 
Each serial channel 
is independently 
configurable. 
Both are RS232-compatible 
and is ca- 
pable of running 
asynchronously 
from 
110 to 9600 
baud or synchronously 
from 
150 to 56K baud. Both 
may be connected 
to a user defined 
data set or ter- 
minal. One channel 
contains 
current 
loop adapters. 


Both channels 
are implemented 
using Intel's 8251A 
USART. They can be programmed 
to perform 
a var- 
iety of I/O functions. 
Baud rate selection 
is accom- 
plished 
through 
an Intel 8253 interval 
timer. 
The 
8253 also serves as a real-time 
clock 
for the entire 
system. 
I/O activity 
through 
each serial 
channel 
is 
independently 
signaled 
to the system 
through 
a 
second 8259A (slave) interrupt 
controller, 
operating 
in a polled 
mode nested to the master 8259A. 


IOC INTERFACE 
The remainder 
of the system I/O activity 
is handled 
by the 10C. The 10C provides the interface 
and con- 
trol for the keyboard, CRT, integral floppy disk drive, 
and standard 
Intellec-compatable 
peripherals 
in- 


cluding 
printer, 
high 
speed 
paper 
tape 
reader/ 


punch, 
and universal 
PROM programmer. 
The 10C 
contains 
its own independent 
microprocessor, 
an 
8080A-2. 
This 
CPU issues 
commands, 
receives 
status, 
and controls 
all I/O operations 
as well 
as 
supervising 
communications 
with the IPC-85. The 
10C contains 
interval timers, its own 10C bus system 
controller, 
and 8K bytes of ROM for all I/O control 
firmware. 
The 8K bytes of RAM are used for CRT 
screen 
refresh 
storage. 
Neither 
the ROM nor the 
RAM occupy 
space 
in the Intellec 
Series 
III main 
memory 
address range because the 10C is a totally 
independent 
microcomputer 
subsystem. 


Integral CRT 


DISPLAY 
The CRT is a 12-inch raster scan type monitor 
with a 
50/60 Hz vertical 
scan rate and 15.5 kHz horizontal 
scan rate. Controls 
are provided 
for brightness 
and 
contrast 
adjustments. 
The interface 
to the CRT is 
provided 
through 
an Intel 
8275 single 
chip 
pro- 
grammable 
CRT controller. 
The master 
processor 
on the IPC-85 transfers 
a character 
for display to the 
10C, where it is stored 
in RAM. The CRT controller 
reads a line at a time into its line buffer through 
an 
Intel 8257 DMA Controller. 
It then feeds one charac- 
ter at a time to the character 
generator 
to produce 
the video 
signal. 
Timing 
for the CRT control 
is 
provided 
by an Intel 8253 programmable 
interval 


timer. The screen display is formatted 
as 25 rows of 
80 characters. 
The full set of ASCII characters 
are 
displayed, 
including 
lower case alphas. 


KEYBOARD 
The keyboard 
interfaces 
directly 
to the IOC pro- 
cessor via an 8-bit data bus. The keyboard 
contains 
an Intel 
UPI-41A 
Universal 
Peripheral 
Interface, 
which scans the keyboard 
and encodes the charac- 
ters to provide N-key roll over. The keyboard itself is 
a typewriter 
style keyboard containing 
the full ASCII 
character 
set. An upper/lower 
case switch allows the 
system to be used for dor.wTlent preparation. 
Cursor 
control 
keys are also provided. 


A UPI-41A Universal Peripheral 
Interface 
on the IOC 
board 
provides 
built-in 
interface 
for 
standard 
Intellec-compatable 
peripherals 
including 
a printer, 
high 
speed 
paper 
tape 
reader, 
high 
speed 
paper 
tape 
punch, 
and 
universal 
PROM 
programmer. 
Communication 
between 
the 
IPC-85 
and 
IOC is 
maintained 
over a separate 8-bit bidirectional 
data 
bus. Connectors 
for the four devices named above, 


as well as the two serial channels, 
are mounted 
di- 
rectly on the IOC itself. 


User control 
is maintained 
through 
a front 
panel, 


consisting 
of a power 
switch 
and indicator, 
reset/ 


boot 
switch, 
run/halt 
light 
and 
eight 
interrupt 
switches and LED indicators. 
The front panel circuit 
board is attached directly to the IPC-85, allowing 
the 
eight 
interrupt 
switches 
to 
connect 
the 
master 
8259A, as well as to the Intellec 
Series III bus. 


User program control 
in the iAPX 88/86 environment 
of the Intellec Series III is also directed 
through 
key- 
board control 
sequences 
to transfer 
control 
to the 
iAPX 88/86 applications 
debugger, 
abort a user pro- 
gram 
or translator 
and 
returning 
control 
to the 
IPC-85. 


The integral floppy disk is controlled 
by an Intel 8271 
single 
chip, 
programmable 
floppy 
disk controller. 
The disk provides capacity of 250K bytes. It transfers 
data via an Intel 8257 DMA Controller 
between 
an 
IOC RAM buffer and the diskette. 
The 8271 handles 
reading 
and writing 
of data, formatting 
diskettes, 


and reading status, all upon appropriate 
commands 
from the IOC microprocessor. 


The Intellec 
Series III Double 
Density Diskette 
Sys- 
tem provides direct access bulk storage, 
intelligent 
controller 
and 
two 
diskette 
drives. 
Each 
drive 
provides 
1/2 million 
bytes of storage 
with 
a data 
transfer 
of 500,000 bits/second. 
The controller 
is 
implemented 
with Intel's powerful 
Series 3000 Bipo- 
lar Microcomputer 
Set and supports 
up to four disk- 
ette drives 
to allow 
more than 
2 million 
bytes of 
on-line storage. 


The diskette 
controller 
consists 
of two boards, the 
channel 
board and the interface 
board. These two 
PC boards 
reside 
in the Intellec 
Series 
III system 
chassis. The channel 
board receives, decodes 
and 
responds 
to channel 
commands 
from the 8085A-2 
CPU on the IPC-85. The interface board provides the 
diskette 
controller 
with a means of communication 
with the disk drives and with the Intellec system bus. 
The interface 
board also validates 
data during 
disk 
transactions. 


An 
additional 
cable 
and 
connectors 
are 
also 
supplied 
to optionally 
convert 
the integral 
floppy 
disk from single density to double 
density. 


The Intellec 
Series 
III Hard Disk System 
provides 
direct access bulk storage, intelligent 
controller 
and 
a disk drive containing 
one fixed 
platter 
and one 
removable 
cartridge. 
Each provides 
approximately 
3.65 million 
bytes of storage with a data transfer rate 
of 2.5 Mbits/second. 
The controller 
is implemented 
with Intel's Series 3000 Bipolar 
Microcomputer 
Set. 


The controller 
provides 
an interface 
to the Intellec 
Series III system bus, as well as supporting 
up to 2 
disk drives. The disk system records all data in Dou- 
ble Frequency 
(FM) on 2 surfaces 
per platter. 
Each 
platter 
can 
be write 
protected 
by a front 
panel 
switch. 


HARD 
DISK CONTROLLER 
BOARDS 
The disk controller 
consists 
of two boards 
which 
reside in the Intellec 
Series III system chassis. The 
disk system is capable of performing 
six operations: 


recalibrate, 
seek, format track, write data, read data, 


and verify CRC. In addition 
to supporting 
a second 
drive, 
the 
disk 
controller 
may co-exist 
with 
the 
double-density 
diskette 
controller 
to allow up to 17 
million 
bytes of on-line 
storage. 


All models 
of the Intellec 
Series III implement 
the 
industry 
standard 
MUL TIBUS protocol. 
The MUL TI- 
BUS architecture 
allows several bus masters, such 
as CPU and 
DMA devices, 
to share the bus and 
memory 
by operating 
at different 
priority 
levels. 


Resolution 
of bus exchanges 
is synchronized 
by a 
bus clock 
signal 
derived 
independently 
from 
pro- 
cessor clocks. 
Read/write 
transfers 
may take place 
at rates up to 5 MHz. The bus structure 
is suitable for 
use with any Intel microcomputer 
family. 


System Software Features 


The Model 286 offers many key advantages for iAPX 
86,88 applications 
and Intellec 
Development 
Sys- 
tems: enhanced system performance 
through 
a dual 
host 
CPU environment, 
a full 
spectrum 
of iAPX 
86,88-resident 
high-level 
languages, expanded 
user 
program space for iAPX 86,88 programs, 
and a pow- 
erful high-level 
software 
applications 
debugger 
for 
iAPX 86,88 microprocessor 
software. 


Dual Host CPU- 
The addition 
of a 16-bit 8086 to the 
existing 
8-bit host CPU increases 
iAPX 86,88 com- 
pilation 
speeds 
and provides 
for iAPX 86,88 code 
execution. 
When the 8086 is executing 
a program, 


the 8-bit CPU off-loads 
all I/O activity 
and operates 
as an intelligent 
I/O controller 
to double 
buffer data 
to and from 
the 8086. The 8086 also provides 
an 
execution 
vehicle for 8086 and 8088 object code. An 
added 
benefit 
of two host microprocessors 
is that 


8-bit translations 
and applications 
are handled 
by 
the 8-bit CPU, and 16-bit translations 
and applica- 
tions are handled by the 8086. This feature provides 
complete 
compatibility 
for 
current 
systems 
and 
means that software 
running 
on current 
Intellec De- 
velopment 
Systems will run on the new system. 


High-Level 
Languages 
for iAPX 86,88- 
The Model 
286 allows the current 
Intellec 
system user to take 
advantage 
of a breadth 
of new resident 
iAPX 86,88 
high-level 
languages: 
PUM 
86/88, PASCAL 86/88, 


and 
FORTRAN 
86/88. 
The 
iAPX 86,88 
Resident 
Macro 
Assembler 
and these 
high-level 
language 
compilers 
execute 
on the 8086 host CPU, thereby 
increasing 
system performance. 


Expanded 
Program 
Memory-By 
adding 
a Model 


286 to an existing Intellec Development 
System, 96K 
bytes of user program 
RAM memory are made avail- 
able for iAPX 86,88 programs. 
System 
memory 
is 
expandable 
by adding 
additional 
RAM 
memory 
modules. 
This, combined 
with 
the two 
host 
CPU 
system architecture, 
dramatically 
increases the pro- 
cessing 
power of the system. 


Software 
Applications 
Debugger-The 
RPB-86 
contains 
the applications 
debugger 
which 
allows 
iAPX 86,88 programs 
to be developed, 
tested, and 
debugged 
within 
the Intellec system. The debugger 
provides a subset of In-Circuit 
Emulator 
commands 
such as symbolic 
debugging, 
control 
structures 
and 
compound 
commands 
specifically 
oriented 
toward 
software 
debug 
needs. 


INTEGRATED 
PROCESSOR CARD 
-(IPC-85) 
8085A-2 based, operating 
at 4 MHz 
-64K 
RAM, 4K ROM (2K in monitor 
and 2K in boot/ 


diagnostic) 


RESIDENT PROCESSOR BOARD 
-(RPB-86) 
8086 based, operating 
at 5 MHz, 64K 
RAM, 16K ROM (applications 
debugger) 


BUS 
-MUL 
TIBUS bus, maximum 
transfer 
rate of 5 MHz 


DIRECT MEMORY ACCESS 
-(DMA) 
Standard capability 
on the MUL TIBUS bus; 


implemented 
for 
user 
selected 
DMA 
devices 
through 
optional 
DMA module 
-Maximum 
transfer 
rate of 2 MHz 


Capacity-250K 
bytes (formatted) 
Transfer 
Rate-160K 
bits/see 
Access Time- 
Track to Track: 
10 ms max. 
Average Random 
Positioning: 
260 ns 
Rotational 
Speed: 360 rpm 
Average Rotational 
Latency: 
83 ms 
Recording 
Mode: FM 


Capacity- 
Per Disk: 4.1 megabits 
(formatted) 
Per Track: 
53.2 kilobits 
(formatted) 
Transfer 
Rate-500 
kilobits/sec 
Access Time- 
Track to Track: 
10 ms 
Head Setting 
Time: 10 ms 
Average 
Random 
Positioning 
Time-260 
ms 


Rotational 
Speed-360 
rpm 
Average Rotational 
Latency: 83 ms 
Recording 
Mode: M2 FM 


- 
... 


" 


" 


Volts 
A~ps 
Typical 
System 
Supplied 
Sl.lppli~d 
Requirements 


+ 5 ± 5% 
30.0 
17.0 


+12 ± 5% 
2.5 
1.1 


-12 ± 5% 
0.3 
0.1 


-10 ± 5% 
1.0 
0.08 


+15±5%' 
1.5 
1.5 


+24 ± 5%' 
1.7 
" 
1.7 


" 


Type-5440 
top 
loading 
cartridge 
and one .fixed 
platter 
Tracks per Inch-200 
Mechanical 
Sectors per Track-12 
Recording 
Technique-double 
frequency 
(FM) 
Tracks per Surface-400 
Density-2,200 
bits/inch 
Bits per Track-62,500 
Recording 
Surfaces perPlatter-2 
Capacity- 
Per Surface-15M 
bits 
Per Platter-29M 
bits 
Per Drive-59M 
bits 
Per Drive-7.3M 
bytes (formatted) 
Transfer Rate-2.5M 
bits/see . 


Access Time- 
Track to Track: 13 ms max 
Full Stroke: 100 !TIS 
Rotational 
Speed: 2,400 rpm 


AC Requiremen'ts ~or Mainframe 


110V, 60 Hz-5.9 
Amp 
220V, 50 Hz-3.0 
Amp 


System Operating Te'mperature-O° 
to 35°C 
(32°F to 95°F) 


Humidity-20% 
to 80% 


Width-17.37 
in. (44.12 em) 
Height-15.81 
in. (40.16 em) 
Depth-19.13 
in. (48.59 em) 
Weight-81 
lb. (37 kg) 


KEYBOARD 
Width-17.37 
in. (44.12 em) 
Height-3.0 
in. (7.6 em) 
Depth-9.0 
in. (22.86 em) 
Weight-6 
lb. (3 kg) 


Intellec 
Series 
1/1Microcomputer 
Development 
Sys- 
tem Product 
Overview, 
121575 


A Guide 
to Intellec 
Series 
1/1Microcomputer 
Devel- 
opment 
Systems, 
121632-001 


Intellec 
Series /II Microcomputer 
Development 
Sys- 
tem Console 
Operating 
Instructions, 
121609 


Intellec 
Series 
1/1Microcomputer 
Development 
Sys- 


tem Pocket 
Reference, 
121610 


Intellec 
Series 
/II Microcomputer 
Development 
Sys- 
tem Programmer's 
Reference, 
121618 


iAPX 88/86 Family 
Utilities 
User's 
Guide 
for 
8086- 
Based 
Development 
Systems, 
121616 


8086/8087/8088 
Macro 
Assembly 
Language 
Refer- 


ence Manual 
for 8086-Based 
Development 
Systems, 


121627 


8086/8087/8088 
Macro 
Assembly 
Language 
Pocket 
Reference, 
9800749 


DUAL FLOPPY 
DRIVE 
SYSTEM 
(OPTION) 
Width-16.88 
in. (42.88 em) 
Height-12.08 
in. (30.68 em) 
Depth-1.0 
in. (48.26 em) 
Weight-64 
lb. (29 kg) 


HARD 
DISK DRIVE 
SYSTEM 
(OPTION) 
Width-18.5 
in. (47.0 em) 
Height-34.0 
in. (86.4 em) 
Depth-29.75 
in. (75.6 em) 
Weight-202 
lb. (92 kg) 


8086/8087/8088 
Macro 
Assembler 
Operating 
In- 


structions 
for 
8086-Based 
Development 
Systems, 
121628 


Intellec 
Series 
III Microcomputer 
Development 
Sys- 


tem Installation 
and 
Checkout 
Manual, 
121612 


Intellec 
Series 
III Microcomputer 
Development 
Sys- 


tem Schematic 
Drawings, 
121642 


ISIS-II 
CREDIT 
(CRT-Based 
Text 
Editor) 
User's 


Guide, 
9800902 


ISIS-II 
CREDIT (CRT-Based 
Text Editor) 
Pocket 
Ref- 


erence, 
9800903 


The 8086 Family 
User's 
Manual, 
9800722 


The 8086 
Family 
User's 
Manual, 
Numeric 
Supple- 


ment, 
121586 


For Series III Plus Hard Disk Systems Only: 


Model 
740 
Hard 
Disk 
Subsystem 
Operation 
and 


Checkout, 
9800943 


Part Number 
Description 


DS286 KIT 
Intellec 
Series III Model 
286 Micro- 


computer 
Development 
System 


(110V/60Hz) 


DS287 KIT 
Intellec 
Series III Model 
287 Micro- 


computer 
Development 
System 


(220V/50Hz) 


DS286FD KIT 
Intellec 
Series III Model 
286 Micro- 
computer 
Development 
System with 
Dual 
Double 
Density 
Flexible 
Disk 
System (110V/60Hz) 


DS287FD KIT 
Intellec 
Series III Model 
287 Micro- 


computer 
Development 
System with 


Dual 
Double 
Density 
Flexible 
Disk 


System (220V/50Hz) 


DS286HD KIT 
Intellec 
Series 
III Model 
286 Micro- 


computer 
Development 
System with 


Pedestal 
Mounted 
Hard 
Disk. 


(110V/60Hz) 


DS287HD KIT 
Intellec 
Series III Model 
287 Micro- 


computer 
Development 
System with 


Pedestal 
Mounted 
Hard 
Disk. 


(220V/50Hz) 


Requires 
Software 
license 


• 
Executes on Series III iAPX 86 
Processor for Fastest Compilations 


• 
Language Is Upward Compatible from 
PL!M 80, Assuring MCS-80/85 Design 
Portability 


• 
Supports 16-Bit Signed Integer and 
32-Bit Floating Point Arithmetic in 
Accordance 
with IEEE Proposed 
Standard 


• 
Easy- To-Learn Block-Structured 
Language Encourages 
Program 
Modularity 


• 
Improved Compiler Performance 
Now 
Supports More User Symbols and 
Faster Compilation Speeds 


• 
Produces Relocatable 
Object Code 
Which Is Linkable to All Other 8086 
Object Modules 


• 
Code Optimization Assures Efficient 
Code Generation 
and Minimum 
Application Memory Utilization 


• 
Built-In Syntax Checker Doubles 
Performance for Compiling Programs 
Containing Errors 


Like its counterpart 
for MCS-80/85 program 
development, 
PL/M 86/88 is an advanced, structured 
high-level 
programming 
language. The PL/M 86/88 compiler 
was created specifically 
for performing 
software 
develop- 
ment for the Intel 8086 and 8088 Microprocessors. 


PL/M is a powerful, 
structured, 
high-level system implementation 
language in which program statements 
can 
naturally 
express the program 
algorithm. 
This frees the programmer 
to concentrate 
on the logic of the pro- 


gram without 
concern for burdensome 
details of machine or assembly language programming 
(such as regis- 
ter allocation, 
meanings 
of assembler 
mnemonics, 
etc.). 


The PL/M 86/88 compiler 
efficiently 
converts free-form 
PL/M language 
statements 
into equivalent 
80.88/8086 
machine instructions. 
Substantially 
fewer PL/M statements 
are necessary for a given application 
than if it were 
programmed 
at the assembly 
language 
or machine 
code level. 


The use of PL/M high-level 
language for system programming, 
instead of assembly language, results in a high 
degree of engineering 
productivity 
during 
project development. 
Tflis translates 
into significant 
reductions 
in 
initial 
software 
development 
and follow-on 
maintenance 
costs for the user. 


NOTE: 
The 
IntelleeS 
Microcomputer 
Development 
System 
pictured 
here 
is not included 
with 
the PWM 
86188 
Software 
Package 
but merely 
depicts 
a language 
in its operating 


environment. 


The following 
are trademarks 
or Intel Corporation 
and may be used only to identify 
Intel products: 
i, Intel, INTEL. 
INTELLEC. 
MeS. 
im, les. 
ICE. UPI. exp, 
iSBC. 
iSBX, 
iNSlTE, 
iAt.1X. 


CREDIT, 
AMXlBO,I-tScope. 
Multibus, 
PROMPT, 
Promware. 
Megachassis, 
libraryManager, 
MAIN MULTI MODULE, 
and the combination 
of MeS. ICE. sec. RMX or ieS and a numerical 
suffiX; 
•. g., iSBC-80. 


© Intel Corporation 
1980 


Major features of the Intel PUM 86/88 compiler 
and 


programming 
language 
include: 


PUM source 
code is developed 
in a series of mod- 


ules, procedures, 
and blocks. Encouraging 
program 


modularity 
in this manner 
makes programs 
more 


readable, 
and easier to maintain 
and debug. 
The 


language becomes more flexible, by clearly defining 
the scope of user variables (local to a private proce- 
dure, global to a public 
procedure, 
for example). 


The use of procedures 
to break down a large prob- 


lem is paramount 
to productive 
software 
develop- 
ment. The PL/M 86/88 implementation 
of a block 


structure 
allows the use of REENTRANT (recursive) 
procedures, 
which 
are especially 
useful in system 


design. 


PUM 86/88 object modules are compatible 
with ob- 
ject 
modules 
generated 
by all other 
86/88 trans- 
lators. 
This 
means 
that 
PUM 
programs 
may be 


linked 
to 
programs 
written 
in any other 
86/88 


language. 


Object 
modules 
are compatible 
with 
ICE-88 and 


ICE-86 units; DEBUG compiler 
control 
provides the 


In-Circuit 
Emulators 
with 
symbolic 
debugging 


capabilities. 


PL/M 86/88 Language 
is upward-compatible 
with 


PUM 80, so that application 
programs may be easily 


ported to run on the iAPX 86 or 88. 


PUM makes use of five data types for various appli- 
cations. 
These data types range from one to fou r 


bytes, and facilitate 
various 
arithmetic, 
logic, 
and 


addressing 
functions: 


- 
Byte: 8-bit unsigned 
number 


- 
Word: 
16-bit unsigned 
number 
- 
Integer: 
16-bit signed number 


- 
Real: 32-bit floating 
point number 


- 
Pointer: 
16-bit 
or 
32-bit 
memory 
address 


indicator 


Another 
powerful 
facility 
allows the use of BASED 


variables 
that map more than one variable 
to the 


same memory location. 
This is especially 
useful for 


passing parameters, 
relative and absolute 
address- 
ing, and memory allocation. 


In addition to the five data types and based variables, 
PUM supports two data structuring 
facilities. 
These 


add flexibility 
to the referencing 
of data stored 
in 


large groups. 


- 
Array: Indexed list of same type data elements 


- 
Structure: 
Named collection 
of same or different 


type data elements 
- 
Combinations 
of Each: Arrays of structures 
or 


structu res of arrays 


PUM programs 
that use 32-bit 
REAL data may be 


executed 
using the Numeric Data Processor for im- 


proved 
performance. 
All floating-point 
operations 


supported 
by PUM may be executed 
on the iAPX 


86/20 or 88/20 NDP, or the 8087 Emulator (a software 
module) 
provided 
with the package. Determination 


of use of the chip or Emulator 
takes place at Iink- 


time, 
allowing 
compilations 
to 
be 
run-time 
independent. 


The PUM 86/88 language contains built-in functions 
for string 
manipulation. 
These byte and word func- 


tions perform the followi ng operations 
on character 


strings: 
MOVE, COMPARE, TRANSLATE, SEARCH, 


SKIP, and SET. 


PUM has the facility for generating 
interrupts 
to the 


iAPX 86 or 88 via software. 
A procedure 
may be 
defined 
with 
the 
INTERRUPT 
attribute, 
and 
the 


compiler 
will 
automatically 
initialize 
an interrupt 
vector 
at the 
appropriate 
memory 
location. 
The 


compiler 
will also generate 
code to same and re- 


store 
the 
processor 
status, 
for 
execution 
of the 


user-defined 
interrupt 
handler 
routine. 
The pro- 


cedure 
SET$INTERRUPT, 
the 
function 
retuning 


an 
INTERRUPT$PTR, 
and 
the 
PUM 
statement 


CAUSE$INTERRUPT 
all 
add 
flexibility 
to 
user 


programs 
involving 
interrupt 
and handling. 


Including 
several 
that have been mentioned, 
the 
PL/M 86/88 compiler 
offers 
more than 25 controls 
that facilitate 
such features as: 


- 
Conditional 
compilation 
- 
Including 
additional 
PL/M source files from disk 
- 
Intra- and Inter-module 
cross reference 
- 
Corresponding 
assembly 
language 
code in the 
listing 
file 
- 
Setting overflow conditions 
for run-time handling 


The PL/M 86/88 compiler 
takes full advantage 
of 
program 
addressing 
with the SMALL, COMPACT, 
MEDIUM, and LARGE segmentation 
controls. 
Pro- 
grams 
with 
less than 64KB total 
code space can 
exploit 
the 
most 
efficient 
memory 
addressing 


schemes, which lowers total memory requirements. 
Larger 
programs 
can exploit 
the flexibility 
of ex- 


tended 
one-megabyte 
addressing. 


The PL/M 86/88 compiler 
offers four levels of optimi- 


zation 
for significantly 
reducing 
overall 
program 


size. 


- 
Combination 
or 
"folding" 
of 
constant 
ex- 


pressions; 
and short-circuit 
evaluation 
of Boo- 


lean expressions. 


- 
"Strength 
reductions" 
(such as a shift left rather 


than multiply 
by 2); and elimination 
of common 


sub-expressions 
'within 
the same block. 


- 
Machine 
code 
optimizations; 
elimination 
of 


superfluous 
branches; 
re-use of duplicate 
code; 


removal of unreadable 
code. 


- 
Byte comparisons 
(rather 
than 20-bit 
address 


calculations) 
for pointer 
variables; 
optimization 


of based-variable 
operations. 


The PL/M 86/88 compiler 
has a very powerful feature 


to speed up compilations. 
If a syntax or program 


error 
is detected, 
the compiler 
will skip the code 


generation 
and optimization 
passes. This usually 


yields a 2X performance 
increase for compilation 
of 


programs 
with errors. 


A fully detailed set of programming 
and compilation 


errors is provided 
by the compiler. 


Performance 
benchmarks 
may provide valuable 
in- 


formation 
in estimating 
compile 
times for various 


programs. 
It is extremely 
important 
to understand, 


however, 
the effect of varying 
conditions 
on com- 


piler 
performance. 
Storage 
media, 
coding 
style, 


program length, and the use of INCLUDE files signif- 
icantly change the compiler's 
overall performance. 


We tested typical PL/M programs of varying lengths. 
The results are listed in Table 1. 


Program 
Size 
Compile 
Time(Sec) 
Lines/Minute 


SMALL 
(71) 
20 
213 


MEDIUM 
(610) 
54 
678 


LARGE 
(1710) 
128 
802 


LARGE 
(1403) 
129 
653 


(with very dense code, 
plus include 
file) 


MOO. 
. Beginning 
of module' 
------------.l 
PUBLIC and EXTERNAL 
attributes 
promote 


SORTPROC: 
PROCEOURE 
IPTR. 
COUNT. 
RECSIZE. 
KEYINDEX)~; 
I 
program 
modularity. 


DECLARE 
PTR POINTER. 
(COUNT. 
RECSIZE. 
KEYINDEX) 
INTEGER . 


• Parameters' 


PTR IS pOinter to first record 
COUNT 
IS number 
of records 
to be sorted. 


RECSIZE 
IS number 
of bytes 
In each record-max 
IS 128. 


KEY INDEX 
IS byte position 
within 
each 
record 
of a BYTE scalar 
to be used as sort key .. 


DECLARE 
RECORD 
BASED 
PTR 
II) 
BYTE. 


CURRENT 
(128) BYTE. 


(I. J) INTEGER: 


DO J 
1 TO COUNT-I: 
CALL 
MOVB(@RECORD(J·RECSIZE). 
. RECSIZE): 


I J. 


DO WHILE I 
0 
AND RECORDI(I 
1)'RECSIZE 
- KEYINDEX) 


CURRENTIKEYINDEX): 


CALL 
MOVB(@RECORD((I 
1)'RECSIZE). 


@RECORD(I'RECSIZE). 
RECSIZE): 


"Based" 
Variables 
allow 
manipulation 
of external 
data by 


passing 
the base of the data structure 
(a pOInter). ThIS 


minimizes 
the STACK space used for parameter 
passing. 
and 
the executIon 
time to perform 
many STACK 
operations. 


The "AT"' operator 
returns 
the address 
of a 
v8nable. 
instead of Its contents. 
ThiS 1$very useful 
In passing 
pointers 
for based variables. 


PL/M 86/88 is designed 
to be an efficient, 
cost- 
effective 
solution 
to the special 
requirements 
of 


iAPX 86 or 88 Microsystem 
Software 
Development, 


as illustrated 
by the following 
benefits of PL/M use: 


PL/M 86/88 is easy to learn and to use, even for the 
novice programmer. 


Critical 
projects 
are completed 
much earlier 
than 
otherwise 
possible 
because 
PL/M 86/88, a struc- 
tured 
high-level 
language, 
increases 
programmer 
productivity. 


Increases 
in programmer 
productivity 
translate 
im- 
mediately 
into lower 
software 
development 
costs 


because less programming 
resources 
are required 
for a given programmed 
function. 


PL/M 86/88 is designed to aid in the development 
of 


reliable 
software 
(PL/M 86/88 programs 
are simple 


statements 
of the program algorithm). 
This substan- 


tially reduces the risk of costly correction 
of errors in 


systems that have already 
reached full production 
status, as the more simply stated the program 
is, the 


more likely it is to perform 
its intended 
function. 


Easier Enhancements 
and Maintenance 


Programs written 
in PL/M tend to be self-document- 
ing, thus easier to read and understand. 
This means 


it is easier to enhance and maintain 
PL/M programs 


as the system capabilities 
expand 
and future 
prod- 


ucts are developed. 


inter 


REQUIRED 
SOFTWARE: 


ISIS-II Diskette 
Operating 
System, V4.1 or later 
Series III Operating 
System 
Operating 
Environment 


REQUIRED 
HARDWARE: 
Intellecl!!l Microcomputer 
Development 
System 


- 
Series III or equivalent 
Dual Diskette 
Drives 


- 
Single- 
or Double-Density 


System Console 
- 
CRT or Hardcopy 
Interactive 
Device 


PLIM-B6 
User's 
Guide 
for BOB6-based Development 


Systems 
(121636) 


OPTIONAL 
HARDWARE: 


Universal 
PROM Programmer 


line 
Printer 


ICE-86'M 


Part Number 
Description 


MDS-313* 
PLIM 86/88 Software 
Package 


·MDS is an ordering code only and is not used as a product name or trademark. MDS~ isa registered trademark of Mohawk Data Sciences 
Corporation. 


intel~ 


FORTRAN 86/88 
SOFTWARE PACKAGE 


• Features high-level language support 
for floating-point 
calculations, 
transcendentals, 
interrupt procedures, 
and run-time exception handling 


• Meets ANS FORTRAN 77 Subset 
Language Specifications 


• Supports iAPX 86/20, 88/20 Numeric 
Data Processor for fast and efficient 
execution of numeric instructions 


• Uses REALMATH Floating-Point 
Standard for consistent and reliable 
results 


• Offers powerful extensions tailored to 
microprocessor 
appli~ations 


• Offers upward compatibility 
with 
FORTRAN 80 


• Provides FORTRAN run-time support for 
iAPX 86,88-based design 
l 


• Provides users ability to do fo{matted 
and unformatted 
I/O with sequential or 
direct access methods 


FORTRAN 
86/88 meets the ANS FORTRAN 
77 Language 
Subset 
Specification 
and includes 
many features 
of 
the full standard. 
Therefore, 
the user is assured 
of portability 
of most existing 
ANS FORTRAN 
programs 
and of 
full 
portability 
from 
other 
computer 
systems 
with 
an ANS FORTRAN 
77 Compiler. 


FORTRAN 
86/88 programs 
developed 
and debugged 
on the iAPX 86 Resident 
Intellec 
Series III Microcomputer 
Development 
System 
may be: tested 
with 
the prototype 
using 
ICE symbolic 
debugging, 
and executed 
-on an 


RMX-86 
operating 
system, 
or on a user's 
iAPX 86,88-based 
operating 
system. 


FORTRAN 
86/88 is one of a complete 
family 
of compatible 
programming 
languages 
for iAPX 86,88 develop- 


ment: 
PUM, 
Pascal, 
FORTRAN, 
and Assembler. 
Therefore, 
users may choose 
the language 
best suited 
for a 


specific 
problem 
solution. 


NOTE: 
The 
Intellec. 
Microcomputer 
Development 
System 
shown 
here 
merely 
shows 
the operating 
environment 
and is not included 
with 
the software 
package. 


The following 
are trademarks 
of Intel Corporation 
and may be used only to identify 
Intel products: 
i, tntel, INTEL, 
INTELLEC, 
MeS, im, ies, 
ICE, UP!, exp, 
iSBC. 
iSBX, iNSITE. 
iAMX, 


CREDIT, 
RMXl80, 
ItScope, 
Multibus. 
PROMPT, 
Promware. 
Megachassis, 
Library 
Manager, 
MAIN 
MULTI MODULE. 
and the combination 
of MeS, 
ICE. sac, RMX or le$ and a numerical 
suffix: 
e.g., 
iSBC-80. 
© Intel 
Corporation, 
1981. 
8-,136 
APRIL 
1981 
AFN-()1653A 


Extensive High-Level Language 
Numeric Processing Support 
FORTRAN 86/88 language 
features 
support 
of iAPX 


86/20, 88/20 Numeric 
Data Processor 


Single (32-bit), double (64-bit), and double extended 
precision 
(80-bit) floating-point 
data types 


REALMATH 
Proposed 
IEEE 
Floating-Point 
Stan- 
dard) for consistent 
and reliable 
results 
' 
. 


Compiler 
generates 
in-line 
iAPX 86/20, 88/20 Nu- 


meric Data Processor 
object code for floating-point 
arithmetic 
(See Figu re 1) 


Full support 
for all other data types: integer, logical, 
character 


Intrinsics 
allow 
user to control 
iAPX 86/20, 88/20 
Numeric 
Data Processor 


Ability 
to use hardware 
(iAPX 86/20, 88/20 Numeric 
Data 
Processor) 
or software 
(simulator) 
floating- 
point 
support 
chosen 
at link time 


iAPX 86,88 architectural 
advantages 
used for index- 
ing and character-string 
handling 


Symbolic 
debugging 
of application 
using 
ICE-86 
and ICE-88 emulators 


TEMPER 
= 
(PRESS 
- 
VOLUM 
I 
QUEK) 
- 
3.45 
I 
(PRESS 
- 
VOLUM 
I 
QUEK) 
& 
- (PRESS 
- 
VOLUM 
I 
QUEK) 
* 
(PRESS 
- 
VOLUM 
I 
QUEK) 


I 
OBJECT CODE GENERATED I 


ASSEMBLER MNEMONICS I. 


I 
0013 
93D9060COO 
FLJ 
VDLUM 
0018 
9B08360000 
FDIV 
:;)UEK 
0010 
9B082E0800 
FSU8~ 
PRESS 
0022 
9BOO01 
FST 
T:lS+1H 
0025 
98ZE083EOOOO 
FDIV~ 
CS:@CONST 
0028 
9B09C9 
FXCrlG 
TOS+1H 
OOZE 
980002 
FST 
TDS+2H 
0031 
980EE9 
FSUBRP 
0034 
9B.09C1 
FLD 
T:JS+1H 
0037 
9B08C8 
FMUL 
TOS 
003A 
98DOC2 
FFREE 
TOS+2H 
0030 
980fE1 
FSUSP 
0040 
9B091E0400 
FSTP 
TEMPER 
0045 
98 
WAIT 


Figure 
1. Object 
Code Generated 
by FORTRAN 86/88 for a Floating-Point 
Calculation 
Using iAPX 86/20, 


88/20 Numeric 
Processor~ 


inter 


Microprocessor 
Application Support 


-Direct 
byte- or word-oriented 
port I/O 


-Reentrant 
procedures 


-Interrupt 
procedures 


Application 
object code may be executed in iAPX 86, 
88-based environment 
of user's choice: 


-a 
Series III Intellec Development 
System with Se- 


ries III Operating 
System 


-an 
iAPX 86,88-based system with iRMX-86 Operat- 
ing System 


-an 
iAPX 86,88-based system with 
user-designed 


Operating 
System 


Run-time 
exception 
handling 
for 
fixed-point 
nu- 


merics, floating-point 
numerics, 
and I/O errors 


Relocatable 
object 
libraries 
for complete 
run-time 


support 
of I/O and arithmetic 
functions. 
In-line code 


execution 
is generated 
for 
iAPX 86/20, 
88/20 Nu- 


meric Data Processor 


FORTRAN 86/88 provides 
a means of developing 


application 
software 
for the Intel iAPX 86,88 prod- 


ucts 
lines 
in a familiar, 
widely 
accepted, 
and 


industry-standard 
programming 
language. 
FOR- 
TRAN 86/88 will greatly enhance the user's ability to 
provide 
cost-effective 
software 
development 
for 


Intel 
microprocessors 
as 
illustrated 
by 
the 


following: 


FORTRAN 
is an 
industry-standard, 
high-level 


numerics 
processing 
language. 
FORTRAN 
pro- 


grammers 
can use FORTRAN 86/88 on micropro- 


cessor projects 
with little retraining. 
Existing 
FOR- 


TRAN software 
can 
be compiled 
with 
FORTRAN 


86/88 and programs 
developed 
in FORTRAN 86/88 


can run on other computers 
with ANS FORTRAN 77 


with 
little or no change. 
Libraries 
of mathematical 


programs 
using ANS 77 standards 
may be compiled 


with FORTRAN 86/88. 


Application Object Code 
Portability for a Processor Family 


FORTRAN 86/88 modules 
"talk" 
to the resident 
In- 


tellec 
development 
operating 
system using Intel's 


standard 
interface 
for 
all 
development-system 


software. 
This allows an application 
developed 
on 


the Series III operating 
system to execute on iRMXI 


86, or a user-supplied 
operating 
system by linking in 
the iRMX/86 or other appropriate 
interface library. A 


standard 
logical-record 
interface 
enables 
com- 


munication 
with non-standard 
I/O devices. 


Comprehensive, 
Reliable 
and Efficient Numeric Processing 


The unique 
combination 
of FORTRAN 86/88, iAPX 


86/20, 
88/20 
Numeric 
Data 
Processor, 
and 


REALMATH 
(Proposed 
IEEE Floating-Point 
Stan- 
dard) 
provide 
universal 
consistency 
in results 
of 


numeric 
computations 
and efficient 
object 
code 


generation. 


Operating 
Environment 


REQUIRED 
HARDWARE 


Intellecill> Series 
III Microcomputer 
Development 


System 


-System 
Console 


-Double-Density 
Dual-Diskette 
Drive. A Hard Disk 


is recommended 


-Hard 
Disk· 


REQUIRED 
SOFTWARE 
ISIS-II Diskette Operating 
System V4.1 or later 


Documentation 
Package 


FORTRAN 
86/88 User's 
Guide 
(121539-001) 


Flexible 
Diskettes 


-Single- 
and Double-Density 


PASCAL 86/88 
SOFTWARE PACKAGE 
• Resident on iAPX 86 Based Intellec® 
• Supports iAPX86/20, 88/20 Numeric 
Series III Microcomputer 
Development 
Data Processors 
System for Optimal Performance 
• Strict I.mplementation of ISO Standard 
• Object Compatible and Linkable with 
Pascal 
PL/M 86/88, ASM 86/88 and FORTRAN 
• Useful Extensions Essential for 
86/88 
Microcomputer 
Applications 
• ICE™Symbolic Debugging Fl;IlIy 
• Separate Compilation 
with Type- 
Supported 
Checking Enforced Between Pascal 
• Implements 
REALMATH for Consistent 
Modules· 


and Reliable Results 
• Compiler Option to Support Full Run- 
Time Range-Checking 


PASCAL 
86/88 conforms 
to and implements 
the ISO Draft 
Proposed 
Pascal 
standard. 
The language 
is 


enhanced 
to support 
microcomputer 
applications 
with special features, 
such as separate compilation, 
inter- 
rupt handling 
and direct port I/O. To assist the development 
of portable 
software, 
the compiler 
can be directed 


to flag all non-standard 
featu res. 


The PASCAL 86/88 compiler 
runs on the iAPX 86 Resident 
Intellec® Series III Microcomputer 
Development 


System. A well-defined 
I/O interface 
is provided 
for run-time 
support. 
This allows a user-written 
operating 
sys- 


tem to support 
application 
programs 
as an alternate 
to the development 
system environment. 
Program 
mod- 
ules compiled 
under 
PASCAL 86/88 are compatible 
and linkable 
with 
modules 
written 
in PUM 86/88, ASM 
86/88 or FORTRAN 86/88. With a complete 
family of compatible 
programming 
languages 
for the iAPX 86, 88 
one can implement 
each module 
in the language 
most appropriate 
to the task ,at hand. 


PASCAL 86/88 object 
modules 
contain 
symbol 
and type information 
for program 
debugging 
using ICE-86™ 
emulator. 
For final 
production 
version, 
the compiler 
can remove this extra ,information 
and code. 


Note: 
The lntellec.s 
microcomputer 
development 
system 
pictured 
here is not 
included 
with 
the Pascal 
86/88 
Software 
Package 
but merely 
depicts 
the 
language 
in its operating 
environment. 
~ 
. 
. 


The following 
are trademarks 
of Intel 
Corporation 
and may b9 used 
only 
to identify 
Intel 
products: 
exp, 
CREDIT, 
Intel lee, Muttibus, 
i, iSBC, 
MultimodlJ.1e, 
ICE, iSeX. 
PROMPT, 
iAMX, 


iCS, Library 
Manager, 
Promware, 
Insite, 
MCS, RMX, Intel, 
Megachassis, 
UPI, Intelevision, 
Micromap, 
~Scope 
and the combination 
of iCE, iCS, iSBC, 
iSeX, 
MeS, 
or RMX and a numeri- 


cal 
suffix. 


Includes all the language features of Jensen & Wirth 
Pascal as defined 
in the ISO Draft Proposed 
Pascal 


Standard. 


Supports 
required 
extensions 
for microcomputer 


applications. 


-Interrupt 
handling 


-Direct 
port I/O 


Separate compilation 
extensions 
allow: 


-Modular 
decomposition 
of large programs 


-Linkage 
with 
other 
Pascal 
modules 
as well as 


PUM 86/88, ASM 86/88 and FORTRAN 86/88. 


-Enforcement 
of type-checking 
at LINK-time 


Supports 
numerous 
compiler 
options to control 
the 


compilation 
process, 
to INCLUDE 
files, flag non- 


standard 
Pascal statements 
and others 
to control 


program 
listings 
and object 
modules. 


Utilizes the IEEE standard 
for Floating-Point 
Arith- 


metic (the Intel REALMATH standard) for arithmetic 
operations. 


Well-defined 
and documented 
run-time 
operating 


system interfaces 
allow the user to execute the ap- 


plications 
under user-designed 
operating 
systems. 


Provides 
a standard 
Pascal for iAPX 86, 88 based 


applications. 


-Pascal 
has gained 
wide acceptance 
as the port- 


able application 
language 
for microcomputer 


applications 


-It 
is being taught in many colleges and universities 


around 
the world 


-It 
is easy to learn, originally 
intended 
as a vehicle 


for teaching 
computer 
programming 


-Improves 
maintainability: 
Type mechanism 
is 


both strictly 
enforced 
and user extendable 


-Few 
machine 
specific 
language 
constructs 


Strict implementation 
of the proposed 
ISO standard 


for Pascal aids portability 
of application 
programs. A 


compile 
time 
option 
checks 
conformance 
to the 


standard 
making 
it easy 
to 
write 
conforming 


programs. 


PASCAL 
86/88 extensions 
via predefined 
proce- 


dures for interrupt 
handling 
and direct port I/O make 


it possible 
to code an entire 
application 
in Pascal 


without 
compromising 
portability. 


Standard 
Intel REALMATH 
is easy to use and pro- 


vides 
reliable 
results, 
consistent 
with 
other 
Intel 


languages 
and other 
implementations 
of the IEEE 


proposed 
Floating-Point 
standard. 


Provides 
run-time 
support 
for co-processors. 
All 


real-type 
arithmetic 
is performed 
on the 86/20 nu- 


meric 
data processor 
unit or software 
emulator. 


Run-time library routines, common 
between Pascal 


and other Intel languages (such as FORTRAN), per- 
mit efficient 
and consistently 
accurate 
results. 


Extended relocation 
and linkage support allows the 


user to link Pascal program 
modules 
with routines 


written 
in other 
languages 
for certain 
parts of the 


program. 
For example, 
real-time 
or hardware 
de- 


pendent routines written in ASM 86/88 or PUM 86/88 
can be linked to Pascal routines, 
further 
extending 


the user's ability 
to write 
structured 
and modular 


programs. 


PASCAL 
86/88 programs 
"talk" 
to the resident 


operating 
system using Intel's standard interface for 


translated 
programs. 
This allows 
users to replace 


the development 
operating 
system 
by their 
own 


operating 
systems in the final application. 


PASCAL 86/88 takes full advantage 
of iAPX 86, 88 


high 
level 
language 
architecture 
to 
generate 


efficient 
machine 
code 
without 
using 
time- 


consuming 
optimization 
algorithms. 


Compiler 
options 
can be used to control 
the pro- 


gram listings and object modules. While debugging, 
the user may generate additional 
information 
such 


as the symbol 
record 
information 
required 
and 


useful for debugging 
using ICE emulation. 
After de- 


bugging, the production 
version may be streamlined 


by removing 
this additional 
information. 


REQUIRED 
HARDWARE 
Intellec~ 
Series 
III Microcomputer 
Development 
System 
-System 
Console 
-Double 
Density Dual Diskette Drive OR Hard Disk 


nl:\iilUln~u~ur I 9""""100. 
ISIS-II Diskette 
Operating 
System V4.1 or later 


Flexible 
Diskettes 
-Single 
and Double 
Density 


Part Number 


MDS*-314 


Description 


PASCAL 86/88 Software Package 


PL/M·86 high level programming Ian· 
guage 


ASM86 macro assembler for 8086/8088 
assembly language programming 


CONV86 converter for conversion of 
8080/8085 assembly language source 
code to 8086/8088 assembly language· 
source code 


L1NK86 and LOC86 linkage and 
relocation utilities 


The 8086/8088 software development 
package provides a set of software development 
tools for the 8086 and the 8088 
microprocessors 
and iSSC 86/12 single board computer. The package operates under the 1515-11 operating 
system on 


Intellec Microcomputer 
Development Systems-Model 
800 or Series II-thus 
minimizing 
requirements 
for additional 
hardware or training 
for Intel Microcomputer 
Development 
System users. 


The package permits 8080/8085 users to efficiently 
convert existing 
programs into 8086/8088 object code from either 


8080/8085 assembly 
language source code or PLlM·80 source code. 


For the new Intel Microcomputer 
Development 
System user, the package operating 
on an Intellec 
Model 230 Micro· 


computer 
Development 
System provides total 8086/8088 software development 
capability. 


Sophisticated new complier design 
allows user to achieve maximum benefits 
of SOS6/S0SScapabilities 


Language Is upward compatible from 
PL/M·SO, assuring MCS·SO/S5design 
portability 


Supports 16·blt signed Integer and 32·blt 
floating point arithmetic 


Produces relocatable and linkable object 
code 


Supports full extended addressing 
features of the SOS6and the SOSS 
microprocessors 


Code optimization assures efficient code 
generation and minimum application 
memory utilization 


Like Its counterpart 
for MCS-80/85 program development, 
PLlM-86 Is an advanced structured 
high level programming 
language. PLlM·86 is a new compiler created specifically 
for performing 
software developmen~ for the Intel 8086 and 
8088 Microprocessors. 


PLlM-86 has significant 
new capabilities 
over PL/M·80 that take advantage of the new facilities 
provided by the 8086 
and the 8088 microprocessors, 
yet the PLlM·86 language remains upward compatible 
from PLlM-80. 


With the exception 
of Interrupts, 
hardware flags, and time·crltlcal 
code sequences, PLlM-80 programs may be recom· 
plied under PLM·86 with little or no conversion 
required. PLlM-86, like PLlM·80, Is easy to learn, facilitates 
rapid pro· 
gram development, 
and reduces program maintenance 
costs. 


PL/M Is a powerful, structured 
high level algorithmic 
language In which program statements 
can naturally express the 
program algorithm. 
This frees the programmer to concentrate 
on the system implementation 
without concern for bur· 


densome details of assembly language programming 
(such as register allocation, 
meanings of assembler mnemonics, 
etc.). 


The PLlM-86 compiler efficiently 
converts free·form PLiM language statements 
into equivalent 8086/8088 machine in· 


structlons. 
Substantially 
fewer PLiM statements 
are necessary for a given application 
than if it were programmed at 
the assembly 
language or machine code level. 


Since PLiM programs are Implementation 
problem oriented and more compact, use of PLiM resylts In a high degree of 
engineering 
productivity 
during 
project 
development. 
This translates 
Into significant 
reductions 
in Initial 
software 
development 
and follow·on 
maintenance 
costs for the user. 


Major features 
of the Intel PLlM-86 compiler 
and pro· 
grammlng 
language Include: 


• 
Supports 
Five Data Types 


- 
Byte: 8·blt unsigned 
number 
- 
Word: 16·blt unsigned number 
Integer: 16-blt signed number 
- 
Real: 32-bit floating 
point number 
- 
Pointer: 16·blt or 32-bit memory address Indicator 


• 
Block 
Structured 
Language 


- 
Permits 
use 
of 
structured 
programming 
tech· 
niques 


• 
Two 
Data 
Structuring 
Facilities 


- 
Array: Indexed list of same type data elements 
- 
Structure: 
Named collection 
of same or different 
type data elements 
- 
Combinations 
of Each: Arrays 
of structures 
or 
structures 
of arrays 


• 
Relocatable 
and Linkable 
Object 
Code 


- 
Permits 
PLlM-86 programs 
to be developed 
and 
debugged 
in small modules. These modules 
can 
be easily linked with other PLlM·86 or ASM86 ob· 
ject modules and/or library routines to form a com· 
plete application 
system. 


• 
Bullt·ln 
String 
Handling 
Facilities 


- 
Operates on byte strings or word strings 
- 
Six Functions: 
MOVE, COMPARE, TRANSLATE, 


SEARCH, SKIP, and SET 


• 
Automatic 
Support 
for 8088 
Extended 
Addressing 


- 
Three complier 
options 
offer a separate model of 
computation 
for programs 
up to 1·Megabyte 
In 
size 
- 
Language transparency 
for extended addressing 


• 
Support 
for ICE·86 
Emulator 
and Symbolic 
Debugging 


- 
Debug option for inclusion 
of symbol table in ob· 


ject 
modules 
for In·Circult 
Emulation 
with 
sym· 
bolic debugging 


Ilnll6 Jllnllll 
~1'\E:T\l' 
I\.D~ 
"'''-\leL~ 
••..• 
.e,'V",_vp"",rll-.-a:''Kage 
are all that is 
_~ ••v, ••v••,,' "u"'Vr,lln 
r1 
needed for development of software for the 8086 and 
• Included file or copy facility 
the 8088 microcomputers and iSBC 86/12 single board 
• Two levels of optimization 
computer. This further reduces development time and 
• Intra·module and inter·module cross reference 
costs because expensive (and remote) time sharing of 
• Arbitrary placement of compiler and user files 
large computers is not required. Present users of Intel 
on any available combination of disk drives 
Intellec Development Systems can begin to develop 
• Reentran~a!1dInterrupt Procedures 
8086 and 8088 designs without 
expensive hardware 
May be specified as user optiQns 
reinvestment or costly retraining: 


PLlM·86 is designed to be an efficient, 
cost·effective 
solution 
to 
the 
special 
requirements 
of 
8086/8088 
Microcomputer Software Development, as illustrated by 
the following benefits of PLlM·86 use: 


• Reduced Learning Effort - 
PLlM·86 is easy to learn 
and to use, even for the novice programmer. 


• Earlier 
Project Completion - 
Critical 
projects 
are 
completed 
much 
earlier 
than 
otherwise 
possible 
because PLlM·86, a structured high·levellanguage, in· 
creases programmer productivity. 


• 
Lower Development Cost - 
Increases in programmer 
productivity 
translate 
immediately 
into 
lower soft· 
ware development costs because less programming 
resources are required for a given programmed func· 
tion. 


• Increased Reliability - 
PLlM·86 is designed to aid in 
the development of reliable software (PLlM·86 pro· 
grams 
are 
simple 
statements 
of 
the 
program 
algorithm). This substantially reduces the risk of cost· 
Iy correction 
of errors in systems that have already 
reached full production 
status, as the more simply 
stated the program is, the more likely it is to perform 
its intended function. 


• Easier Enhancements and Maintenance - 
Programs 
written 
in PLiM tend to be self·documenting, 
thus 
easier to read and understand. This means it is easier 
to enhance 
and maintain 
PLiM 
programs 
as the 
system capabilities 
expand and future prOducts are 
developed. 


• Simpler Project Development - 
The Intellec Develop· 
ment Systems offer a cost·effective 
hardware base 


/*The procedure in this module computes the mean and 
variance of an array of data, X, of length N+ 1,according 
to the method of Kahan and Parlett (University of Cali- 
fornia, Berkeley, Memo no. UCB/EF:lLM77/21.*/ 


STAT: PROCEDURE(X$PTR,N,MEAN$PTR, 


VARIANCE$PTR) PUBLIC; 


DECLARE, 


(X$PTR,MEAN$PTR,VARIANCE$PTR) 
POINTER,X BASED X$PTR (1) REAL, 
N INTEGER, 
MEAN BASED MEAN$PTR REAL, 
VARIANCE BASED VARIANCE$PTR REAL, 
(M,O,DIFF) REAL, 
I INTEGER; 


M=X(O); 
M=O.O; 


DO 1= 1 TO N; 
DIFF = X(I)- M; 
M = M+ DIFF/FLOAT(I + 1); 
0= 0+ DIFF*DIFF*FLOAT(I)/FLOAT(I + 1); 


END; 


MEAN = M; 
VARIANCE = O/FLOAT(N); 


END STAT; 


Powerful and flexible text macro facility 
with three macro listing options to aid 
debugging 


Highly mnemonic and compact 
language, most mnemonics represent 
several distinct machine instructions 


"Strongly typed" assembler helps detect 
errors at assembly time 


High·level data structuring facilities 
such as "STRUCTUREs" 
and 
"RECORDs" 


Over 120 detailed and fully documented 
error messages 


Produces relocatable and linkable object 
code 


ASM86 is the "high-level" 
macro assembler for the 8086/8088 assembly language. ASM86 translates symbolic 
8086/8088assembly language mnemonics into 8086/8088machine code. 


ASM86 should be used where maximum code efficiency and hardware control is needed. The 8086/8088assembly 
language includes approximately 100instruction mnemonics. From these few mnemonics the assembler can generate 
over 3,800 distinct machine instructions. Therefore, the software development task is simplified, as the programmer 
need know only 100 mnemonics to generate all possible 8086/8088machine instructions. ASM86 will generate the 
shortest 
machine instruction 
possible 
given no forward 
referencing 
or given explicit 
information 
as to the 
characteristics 
of forward referenced symbols. 


ASM86 offers many features normally found only in high-level languages. The 8086/8088assembly language is strong· 
Iy typed. The assembler performs extensive checks on the usage of variables and labels. The assembler uses the at- 
tributes which are derived explicitly when a variable or label is first defined, then makes sure that each use of the sym· 
bol in later instructions conforms to the usage defined for that symbol. This means that many programming errors will 
be detected when the program is assembled, long before it is being debugged on hardware. 


Major features of the Intel 8086/8088 assembler and 
assembly language include: 


• Powerful and Flexible Text Macro Facility 
Macro calls may appear anywhere 
Allows user to define the syntax of each macro 
Built·in functions 
• conditional assembly (IF·THEN·ElSE, WHilE) 
• repetition (REPEAT) 
• string processing functions (MATCH) 
• support of assembly time I/O to console (IN, 
OUT) 
- 
Three Macro Listing Options include a GEN mode 
which provides a complete trace of all macro calls 
and expansions 


• Hlgh·Level Data Structuring Capability 


STRUCTURES:Defined to be a template and then 
used to allocate storage. The familiar dot notation 
may be used to form instruction addresses with 
structure fields. 
ARRAYS: Indexed list of same type data elements. 
RECORDS:Allows bit-templates to be defined and 
used as instruction 
operands and/or to allocate 
storage. 


• Fully Supports 8086/8088 Addressing Modes 


Provides for complex address expressions involv- 
ing base and indexing registers and (structure) 
field offsets. 


Powerful EQU facility allows complicated expres- 
sions to be named and the name can be used as a 
synonym 
for 
the 
expression 
throughout 
the 
module. 


• Powerful STRING MANIPULATION INSTRUCTIONS 


Permit direct transfers to or from memory or the 
accumulator. 
Can 
be 
prefixed 
with 
a 
repeat 
operator 
for 
repetitive execution with a count·down and a con· 
dition test. 


• Over 120 Detailed Error Messages 


Appear both in regular list file and error print file. 
User documentation fully explains the occurrence 
of each error and suggests a method to correct it. 


• Generates Relocatable and Linkable Object Code- 
Fully Compatible with LINK88, LOC88 and LIB88 


- 
Permits 
ASM86 
programs 
to be developed 
and 
debugged 
In small modules. These modules 
can 
be easily linked with other ASM86 or PLlM-86 ob· 
ject modules and/or library routines to form a com- 
plete application 
system. 


• Support for ICE·86 Emulation and Symbolic Debug· 
glng 


Debug options 
for inclusion 
of symbol 
table 
in 
object modules for In-Circuit 
Emulation 
with sym- 


bolic debugging. 


The 8086/8088 macro assembler 
allows 
the extensive 
capabilities 
of the 8086/8088 to be fully exploited. 
In any 


application, 
time 
and 
space 
critical 
routines 
can 
be 
effectively 
written 
in ASM86. The 8086/8088 assembler 


outputs 
relocatable 
and linkable object modules. These 
object 
modules 
may be easily 
combined 
with 
object 


modules 
written 
in PLlM-86-lntel's 
structured, 
high· 
level 
programming 
language. 
ASM86 
compliments 
PLM-86 as the programmer 
may choose to write each 
module 
in the language 
most appropriate 
to the task 
and then combine the modules into the complete 
appli- 


cations 
program 
using 
the 8086/8088 relocation 
and 
linkage utilities. 


CONV86 
MCS·80/85 to MCS·86 ASSEMBLY LANGUAGE 
CONVERTER UTILITY PROGRAM 


Translates 8080/8085 Assembly 
Language Source Code to 8086/8088 
Assembly Language Source Code 


Provides a fast and accurate means to 
convert 8080/8085 programs to the 8086 
and the 8088, facilitating 
program 
portability 


Automatically 
generates proper ASM·86 
directives to set up a "virtual 8080" 
environment that Is compatible with 
PLM·86 


In support of Intel's commitment 
to software 
portability, 
CONV86 Is offered as a tool to move 8080/8085 programs to 
the 8086 and the 8088. A comprehensive 
manual, "MCS-86 Assembly 
Language Converter Operating 
Instructions 
for 


ISIS-II Users" (9800642), covers the entire conversion 
process. Detailed methodology 
of the conversion process is fully 


described 
therein. 


CONV86 will accept as Input an error-free 8080/8085 assembly-language 
source file and optional controls, and produce 


as output, optional 
PRINT and OUTPUT files. 


The PRINT file Is a formatted 
copy of the 8080/8085 source and the 8086/8088 source file with embedded caution 


messages. 


The OUTPUT file Is an 8086/8088 source file. 


CONV86 Issues a caution 
message when It detects a potential 
problem in the converted 8086/8088 code. 


A transliteration 
of the 8080/8085 programs 
occurs, 
with each 8080/8085 construct 
mapped to its exact 8086/8088 
cou nterpart: 


-Registers 
-Condition 
flags 
-Instructions 
-Operands 
-Assembler 
directives 
-Assembler 
control lines 
-Macros 


Because CONV86Is 
a transliteration 
process, there Is the possibility 
of as much as a 15%-20% code expansion over 
the 8080/8085 code. For compactness 
and efficiency 
it is recommended 
that critical 
portions of programs be re-coded 


In 8086/8088 assembly 
language. 


Also, as a consequence 
of the transliteration, 
some manual editing 
may be required for converting 
instruction 
se- 


quences dependent 
on: 


-Instruction 
length, timing, 
or encoding 
-Interrupt 
processing 
-PLIM parameter passing conventions 
mechanical 
editing 
procedures 
for these are suggested 
in the converter manual. 


The accompanying 
diagram illustrates 
the flow of the conversion process. Initially, the abstract program may be repre- 
sented in 8080/8085 or 8086/8088 assembly 
language to execute on that respective 
target machine. The conversion 
process Is porting a source destined 
for the 8080/8085 to the 8086 or the 8088 via CONV86. 


SOURCE 
CODE 
IN 808618088 
ASSEMBLY 
LANG 


EXECUTE 
ON 
808618088 


_CONva6 
II 


ASSEMBLE 
FOR 
8086/8088 


Automatic 
combination 
of separately 
compiled or assembled 8086/8088 
programs into a relocatable module 
Automatic 
selection of required modules 
from specified libraries to satisfy 
symbolic references 


Extensive debug symbol manipulation, 
allowing line numbers, local symbols, 
and public symbols to be purged and 
listed selectively 


Automatic generation of a summary map 
giving results of the L1NK86 process 


Abbreviated control syntax 


Relocatable modules may be merged 
into a single module suitable for 
inclusion in a library 


Supports "incremental" 
linking 


Supports type checking of public and 
external symbols 


L1NK86 combines 
object modules specified 
in the L1NK86 input list into a single output 
module. L1NK86 combines 
segments 
from the input modules according 
to the order in which the modules are listed. 


Support for incremental 
linking 
is provided since an output 
module produced 
by L1NK86 can be an input to another 
link. At each stage in the incremental 
linking 
process, unneeded public symbols 
may be purged. 


L1NK86 supports 
type checking 
of public and external symbols 
reporting 
an error if their types are not consistent. 


L1NK86 will link any valid set of input modules without any controls. 
However, controls are available to control the out- 


put of diagnostic 
information 
in the L1NK86 process and to control 
the content 
of the output module. 


L1NK86 allows the user to create a large program as the combination 
of several smaller, separately compiled modules. 


After development 
and debugging 
of these component 
modules the user can link them together, 
locate them using 
LOC86, and enter final testing with much of the work accomplished. 


Automatic and independent 
relocation 
of segments. Segments may be 
relocated to best match users memory 
configuration 


Extensive debug symbol manipulation, 
allowing line numbers, local symbols, 
and public symbols to be purged and 
listed selectively 


Automatic generation of a summary map 
giving starting address, segment 
addresses and lengths, and debug 
symbols and their addresses 


Extensive capability to manipulate the 
order and placement of segments in 
8086/8088 memory 


Abbreviated control syntax 


Relocatability 
allows the programmer to code programs or sections 
of programs without 
having to know the final ar- 


rangement 
of the object code in memory. 


LOC86 converts relative addresses in an input module to absolute addresses. LOC86 orders the segments in the input 
module and assigns absolute 
addresses to the segments. The sequence in which the segments 
in the input module 
are assigned absolute addresses is determined 
by their order in the input module and the controls 
supplied with the 
command. 


LOC86 will relocate any valid input module without 
any controls. 
However, controls 
are available to control the output 
of diagnostic 
information 
in the LOC86 process, to control 
the content 
of the output 
module, or both. 


The program 
you are developing 
will almost 
certainly 
use some mix of random access memory (RAM), read-only 
memory (ROM), and/or programmable 
read-only memory (PROM). Therefore, the location of your program affects both 
cost and performance 
in your application. 
The relocation 
feature allows you to develop your program on the Intellec 
development 
system and then simply 
relocate the object code to suit your application. 


Converts an 8086/8088 absolute object 
module to symbolic hexadecimal 
format 


Facilitates 
preparing a file for later 
loading by a symbolic hexadecimal 
loader, such as the iSBC Monitor or 
Universal PROM Mapper 


Converts an absolute module to a more 
readable format that can be displayed 
on a CRT or printed for debugging 


The OH86 command 
converts an 8086/8088 absolute 
object module to the hexadecimal 
format. This conversion 
may 
be necessary to format a module for later loading by a hexadecimal 
loader such as the iSSC 86/12 monitor or Universal 
Prom Mapper. The conversion 
may also be made to put the module in a more readable format that can be displayed or 
printed. 


L1B86 is a library manager program 
which allows you to: 


Create specially formatted files to 
contain libraries of object modules 


Maintain these libraries by adding or 
deleting modules 


Print a listing of the modules and 
public symbols in a library file 


Libraries can be used as input to L1NK86 
which will automatically 
link modules 
from the library that satisfy external 
references in the modules being linked 


Libraries aid in the job of building 
programs. The library manager program, L1S86, creates and maintains 
files contain· 


ing object 
modules. The operaiion 
of L1S86 is controlled 
by commands 
to indicate 
which operation 
L1S86 is to per· 


form. The commands 
are: 


CREATE - 
creates an empty library file 
ADD - 
adds object modules to a library file 
DELETE - 
deletes modules 
from a library file 
LIST - 
lists the module directory 
of library files 
EXIT - 
terminates 
the L1S86 program and returns control to ISIS·II 


SPECIFICATIONS 


Operating 
Environment 


Required H.rdw •••• 


Intellec 
Microcomputer 
Development 
System 


- 
MDS-800, MDS-888 
- 
Series II MDS-220 or MDS-230 
64K Bytes of RAM Memory 


Dual Diskette 
Drives 


- 
Single or Double· 
Density 


System Console 


- 
CRT or Hardcopy Interactive 
Device 


Optlon.1 
H.rdware 


Universal PROM Programmer 
LIne Prlnter· 
ICE-86™. 


Required Software 


ISIS-II Diskette 
Operating 
System 


- 
Single or Double· 
Density 


Documentation 
Package 


PLlM-86 Programming 
Manual (9800466) 
ISIS-II PLlM-86 Complier 
Operator's 
Manual (9800478) 
MCS-86 User's Manual (9800722) 
MCS-86 Software 
Development 
Utilities 
Operating 
Instructions 
for ISIS-II Users (9800639) 
MCS-86 Macro Assembly 
Language Reference Manual 
(9800640) 
MCS-86 Macro Assembler 
Operating 
Instructions 
for 
ISIS-II Users (9800641) 
MCS-86 Assembly 
Language Converter Operating 
Instructions 
for ISIS-II Users (9800642) 
Universal PROM Programmer 
User's Manual 
(9800819A) 


Flexible 
Diskettes 


- 
Single and Double· 
Density 


ORDERING 
INFORMATION 


Part Number Description 


MDS-311 
808618088 Software 
Development 
Package 


Also 
available 
in the 
following 
development 
support 
packages: 


Part Number 


SP86A-KIT 
Description 


SP86A Support Package (for Intellec 
Model 800) 


Includes 
ICE-86 In-Circuit 
Emulator 
(MDS-86-ICE) and 8086/8088 
Software 
Development 
Package (MDS-311) 


SP86B Support Package (for Series II) 


Includes 
ICE-86 In-Circuit 
Emulator 
(MDS-86-ICE), 8086/8088 
Software 
Development 
Package (MDS-311), 
and Series II Expansion Chassis 
(MDS-201) 


inter 


8087 
SOFTWARE 
SUPPORT 
PACKAGE 


• 
Program Generation for the 8087 
Numeric Data Processor on the 
. 


IntelLec® Microcomputer 
Development 
System 


• 
Consists of: 8086/8087/8088 Macro 
Assembler, 
8087 Software Emulator 


• 
Macro Assembler 
Generates Code for 
8087 Processor or Emulator, While 
Also Supporting the 8086/8088 
Instruction 
Set 


• 
8087 Emulator Duplicates Each 8087 
Floating-Point 
Instruction 
in Software, 


for Evaluation of Prototyping, 
or for 


Use in an End Product 


• 
Macro Assembler 
and 8087 Emulator 


are Fully Compatible with Other 
8086/8088 Development 
Software 


• 
Implementation 
of the IEEE Proposed 
Floating-Point 
Standard (the Intel® 


Realmath Standard) 


The 8087 Software 
Support 
Package is an optional 
extention 
of Intel's 
8086/8088 
Software 
Development 


Package that runs under ISIS-II on an Intellec or Series II Microcomputer 
Development 
System. 


The 8087 Software 
Support 
Package consists 
of the 8086/8087/8088 
Macro Assembler, 
and the Full 8087 


Emulator. 
The assembler 
is a functional 
superset of the 8086/8088 
Macro Assembler, 
and includes 
instruc- 


tions for over sixty new floating-point 
operations, 
pi us new data types supported 
by,the 8087. 


The 8087 Emulator 
is an 8086/8088 
object module that simulates 
the environment 
of the 8087, and executes 
each floating-point 
operation 
using software algorithms. 
This emulator functionally 
duplicates 
the operation 
of the 8087 Numeric 
Data Processor. 


Also included 
in this package are interface 
libraries 
to link with 8086/8087/8088 
object modules, 
which are 
used for specifying 
whether the 8087 Processor or the 8087 Emulator 
is to be used. 'This enables the run-time 


environment 
to be invisible 
to the programmer 
at assembly time. 


The foUowing 
are trademarks 
of Intel Corporation 
and may be used onlV-lo 
identify 
Intel products: 
axe. CREon, Inlellee, 
Multibus, 
i, iSSC, 
Multimodule, 
ICE. iSBX, PROMPT. 
iRMX. 
iCS, Library 
Manager, 
Prom ware, Insite, MeS. RMX, Intel, 
Megachassis. 
UPI, Intelevision, 
Micromap, 
~Scope 
and the combination 
of iCE, leS, isac, ,sax, Mes, or AMX and a 
numerical 
suffix. 
© Intel Corporation 
1980 


intJ 


The 8086/8087/8088 
Macro 
Assembler 
translates 
symbolic 
macro 
assembly 
language 
instructions 
into appropriate 
machine 
instructions. 
It is an ex- 
tended version 
of the 8086/8088 Macro Assembler, 
and therefore 
supports 
all of the same features and 
functions, 
such 
as limited 
type 
checking, 
condi- 
tional 
assembly, 
data structures, 
macros, etc. The 
extensions 
are the new instructions 
and data types 
to 
support 
floating-point 
operations. 
Realmath 
floating-point 
instructions 
(see Table 
1) generate 
code capable of being converted 
to either 8087 in- 
structions 
or interrupts 
for the 8087 Emulator. 
The 
Processor/Emulator 
selection 
is made via interface 
libraries 
at 
LINK-time. 
In addition 
to 
the 
new 


floating-point 
instructions, 
the 
macro 
assembler 
also introduces 
two new 8087 data types: QWORD 
(8 bytes) and TBYTE (ten bytes). These support 
the 
highest 
pre<;i~ion of data proces~ed 
by the 8087. 


The Full 8087 Emulator 
is a 16-kilobyte 
object mod- 
ule that 
is linked 
to the' application 
program 
for 
floating-point 
operations. 
Its functionality 
is identi- 
cal to the 8087 chip, and is ideal for prototyping 
and 
debugging 
floating-point.' 
applications. 
The 
Emulator is an alternative to the use of the 8087 chip, 
although 
the latter execut€s floating-point 
applica- 
tions 
up to 100 times faster than an 8086 with the 
8087 Emulator. 
Furthermore, 
since 
the 8087 is a 
"co-processor," 
use of the chip will allow many op- 
erations 
to be perf~rmed 
in parallel 
with th~ 8086. 


FADD 
FADDP 
FIADD 


FSUB 
FSUBP 
FISUB 
FSUBR 
FSUBRP 


FISUBR 


FMUL 
FMULP 
FIMUL 


FDIV 
FDIVP 
FIDIV 
FDIVR 
FDIVRP 


FSQRT 
FSCALE 
FPREM 
FRNDINT 
FXTRACT 


Divide 
real 
Divide 
real' and 
pop 
Integer 
divide 
• 


Divide 
real'rev'ersed 
Divide 
real reversed 
.and 
pop 
.. 
' 


Integer 
divide 
reversed 


Other 
Operations 


Square 
root 
Scale 
Partial 
remainder 
Round 
to integer 
Extract 
exponent 
and 
significand 
Absolute 
value 
Change 
sign 


FINIT/FNINIT 


FDISI/FNDISI 


FENI/FNENI 


FLDCW 


FSTCW/FNSTCW 


FSTSWIFNSTSW 


FCLEX/FNCLEX 


, 
'.. 
, 


FSTENV/FNSTENV 


FLDENV 


FSAVE/FNSAVE 


FRSTOR 


FINCSTP 


FDECSTP 


FFREE 


FNOP 


FWAIT 


Initialize 
processor 


Disable 
interrupts 


Enable 
interrupts 


Load 
control 
word 


Store 
conirol 
word 


Store 
status 
word 


Clear 
exceptions 


Store 
en'viroriment 


Load 
environment 


• Save state 


Restore 
state 


Increment 
stack 
pointer 


Decrement 
stack 
pointer 


Free register 


No operation 


CPU wait' 


FCOM 


FCOMP 


FCOMPP 


FICOM 


FICOMP 


FTST 


FXAM 


Compare 
real 


Compare 
real and 
pop 


Compare 
real and 
pop 
twice .• 


Integer 
..compare 


Integer 
compare 
and 
pop 


Test 


Examine 


FPTAN 
Partial tangent 
Real Transfers 


FPATAN 
Partial arctangent 
FLD 
Load real 


F2XM1 
2'-1 
FST 
Store real 
FSTP 
Store real and pop 
FYL2X 
y. 10g,X 
FXCH 
Exchange registers 
FYL2XP1 
y. 10g,(X+ 1) 
Integer Transfers 


Constant 
Instructions 
FILD 
Integer load 


FIST 
Integer store 


FISTP 
Integer store and pop 
FLDZ 
Load +0.0 
Packed Decimal Transfers 
FLD1 
Load +1.0 
FBLD 
Packed decimal (BCD) 
FLDPI 
Load 1r 
load 


FLDL2T 
Load log,10 
FBSTP 
Packed decimal (BCD) 
store and pop 
FLDL2E 
Load log,e 


FLDLG2 
Load log,02 


FLDLN2 
Load log.2 


REQUIRED 
SOFTWARE 
ISIS-II Diskette 
Operating 
System 
-Single 
or Double 
Density 


8086/8088 Software 
Development 
Package 
Operating 
Environment 


REQUIRED 
HARDWARE 


IntellecQl>Microcomputer 
Development 
System 


-Model 
800 


-Series 
II (Models 
220, 225 or equivalent) 


64K Bytes of RAM Memory 


Minimum 
One Diskette 
Drive 


-Single 
or Double' 
Density 


System Console 
-CRT 
or Hardcopy 
Interactive 
Device 


8086/8087/8088 
Macro 
Assembly 
Language 
Refer- 


ence 
Manual 
for 
8080/8085-Based 
Development 


Systems 
(121623-001) 


8086/8087/8088 
Macro 
Assembler 
Operating 
In- 


structions 
for 
8080/8085-Based 
Development 
Sys- 
tems (121624-001) 


OPTIONAL 
HARDWARE 


Universal 
PROM Programmer' 


Line Printer' 


The 8086 Family 
Users Manual 
Supplement 
for the 


8087 Numeric 
Data Processor 
(121586-001) 


Part Number 
Description 


MDS'-387 
8087 Software 
Support 
Package 


Requires 
Software 
License 


'MDS is an ordering 
code only and is not used as a product 
name or trademark. 
MDS~ is a registered trademark 
of Mohawk Data Sciences 
Corporation. 


8089 I/O processor program generation 
on the Intellec Microcomputer 
Development 
System. 


Relocatable object module compatible 
with the 8086 and 8088 Microprocessors. 


Supports 8089-based addressing modes 
with a structure facility that enables easy 
access to based data. 


Includes software development 
utilities 
to facilitate 8089 design. 


-L1NK86: 
Combines 8086 or 8088 object 
modules with 8089 object 
modules and resolves 
external references. 


-LOC86: 
Assigns absolute memory 
addresses to 8089 object 
modules. 


-OH86: 
Converts 8086/8088/8089 
object code to symbolic 
hexadecimal format. 


-UPM86: 
A PROM programming aid 
which has been updated to 
support PROM programming 
for 8086, 8088 and 8089 
applications. 


The 8089Assembler Support Package extends Intellec microcomputer development system support to the 8089i/O 
Processor. The assembler translates 8089assembly language source instructions into appropriate machine opera- 
tion codes. The 8089Assembler Support Package allows the programmer to fully utilize the capabilities of the 80891/0 
Processor. 
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The 8089Assembler Support Package contains the 8089 
assembler 
(ASM89) as well as L1NK86 and LOC86- 


relocation 
and linkage 
utilities, 
OH86-8086/8088/8089 


object code to hexadecimal 
converter, 
and UPM86- 
PROMprogramming software updated to program object 
code in the 8086 formats. ASM89 translates symbolic 
8089 assembly 
language 
instructions 
into the appro- 


priate machine operation codes. The ability to refer to 
program addresses with symbolic names eliminates the 
errors of hand translation and makes it easier to modify 
programs when adding or deleting instructions. 


ASM89 provides 
relocatable 
object 
module 
compat- 
ibility 
with the 8086 and 8088 microprocessors. 
This 
object module compatibility, 
along with the 8086/8088 
relocation and linkage utilities, facilitates the designing 
of the 8089into an 8086or 8088system. 


ASM89 fully supports the based addressing 
modes of 


the 8089.A structure facility in the assembler provides 
easy access to based data. The structure facility allows 
the user to define a template that enables accessing of 
based data symbolically. 
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Required Hardware 


Intellec Microcomputer Development System 


-MDS·800, 
MDS-888 


-Series 
II Models 220 or 230 


64K Bytes of RAM Memory 


Minimum One Diskette Drive 


-Single 
or Double" Density 


System Console 


-CRT 
or Hardcopy Interactive Device 


Optional Hardware 


Universal PROM Programmer" 
Line Printer" 


Required Software 


ISIS·II Diskette Operating System 


-Single 
or Double" Density 


Documentation 
Package 


8089 Assembler User's Guide (9800938) 


8089 Assembler Pocket Reference (9800936) 


MCS-86 Software Development Utilities 
Operating Instructions 
for ISIS·II User's (9800639) 


MCS·86 Absolute Object File Formats (9800821) 


Universal PROM Programmer User's Manual (9800819) 


Part Number 


MDS-312 
Description 


8089Assembler Support Package 


ICE-86A™ 
iAPX 86 IN-CIRCUIT EMULATOR 


• 
Real-Time In-Circuit Emulation of iAPX 
86 Microsystems 
• 
Emulate Both Minimum and Maximum 
Modes of 8086 CPU 
• 
Full Symbolic Debugging 
• 
Breakpoints to Halt Emulation on a 
Wide Variety of Conditions 
• 
Comprehensive 
Trace of Program 
Execution 


• 
Disassembly of Trace or Program 
Memory from Object Code into 
Assembler 
Mnemonics 
• 
Software Debugging With or Without 
User System 
• 
Handles Full 1 Megabyte 
Addressability 
of iAPX 86 
• 
Enhance Existing ICE-86 ™ 
Emulators 
to ICE-86A ™ 
Capabilities 
with 
ICE-86U ™ 
Upgrade Package 


The Intel® ICE-86A In-Circuit 
Emulator 
provides 
sophisticated 
hardware 
and software 
debugging 
capabilities 
for iAPX 86 microsystems 
and iAPX 86 Single-Board 
Computers. 
These capabilities 
include 
In-Circuit 
Emula- 


tion for the 8086 Central 
Processing 
Unit plus extensions 
to debug systems including 
the 80891/0 
Processor 
and 8087 Numeric 
Processor 
Ex~ension. 
The emulator 
includes 
three 
circuit 
boards 
which 
reside 
in any 


Intellec® Microcomputer 
Development 
System. A cable and buffer box connect 
the Intellec system to the user 
system by replacing 
the user's 8086, thus extending 
powerful 
Intellec 
system debugging 
functions 
into the 
user system. 
Using 
the ICE-86A 
module, 
the designer 
can execute 
prototype 
8086 or 8089 software 
in 
continuous 
or single-step 
modes and can substitute 
blocks 
of Intellec 
system memory for user equivalents. 


Breakpoints 
allow the user to stop emulation 
on user-specified 
conditions 
of the iAPX 86 system, and the trace 
capability 
gives a detailed 
history of the program 
execution 
prior to the break. All user access to the prototype 
system software 
may be done symbolically 
by referring 
to the source 
program 
variables 
and labels. 


The ICE-86U In-Circuit 
Emulator 
upgrade 
package converts any existing 
ICE-86 module (non-A version) to the 
capabilities 
of an ICE-86A module. 


The following 
afe trademarks 
of Intel Corporation 
and its affiliates 
and may be used only to identify 
tntel products: 
i, Intel, INTEL, 
INTEL.LEG, 
MeS. 
1m, iCS, ICE, UPI, exp, 
iSBC, 
iSBX, 


lnsite, iRMX, 
System 
2000. CREDIT. 
iRMXl80, 
MULTIBUS. 
PROMPT, 
Promware, 
Megachassis, 
Library 
Manager, 
MAIN MULTI MODULE, 
a~d the combination 
of MeS, 
ICE, iSBC, iRMX or 


iCS and a numerical 
suffix. © INTEL 
CORPORATION, 
1981. 
8-157 
9800931-01 
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INTEGRATED HARDWARE/SOFTWARE 
DEVELOPMENT 


The ICE·86A emulator allows hardware and software 
development 
to proceed 
interactively. 
This is more 
effective than the traditional 
method of independent 
hardware 
and software 
development 
followed 
by 
system integration. 
With the ICE-86A module, 
pro- 
totype hardware 
can be added to the system as it is 
designed. 
Software 
and 
hardware 
testing 
occurs 
while the product 
is being developed. 


Conceptually, 
the 
ICE-86A emulator 
assists 
three 
stages of development: 


1. It can be operated 
without 
being 
connected 
to 
the 
user's 
system, 
so the 
ICE-86A 
module's 
debugging 
capabilities 
can be used to facilitate 
program 
development 
before 
any of the user's 
hardware 
is available. 


2. Integration 
of software 
and hardware 
can begin 
when any functional 
element of the user system 
hardware 
is connected 
to 
the 
8086 
socket. 


Through 
ICE-86A emulator 
mapping 
capabilities, 


Intellec memory, ICE module memory, or diskette 
memory can be substituted 
for missing prototype 
memory. 
Time-critical 
program 
modules 
are 
debugged 
before 
hardware 
implementation 
by 
using 
the 2K-bytes 
of high-speed 
ICE-resident 
memory. As each section of the user's hardware is 
completed, 
it is added 
to the prototype. 
Thus 
each 
section 
of the 
hardware 
and software 
is 


"system" 
tested as it becomes available. 


3. When the user's prototype 
is complete, 
it is tested 
with the final version of the user system software. 
The ICE-86A module 
is then 
used for real-time 
emulation 
of the 8086 to debug the system as a 
completed 
unit. 


Thus the ICE-86A module provides the user with the 
ability to debug a prototype 
or production 
system at 
any stage 
in its development 
without 
introducing 
extraneous 
hardware 
or software 
test tools. 


Symbols and PUM statement 
numbers 
may be sub- 
stituted 
for numeric 
values 
in any of the ICE-86A 
emulator 
commands. 
This allows the user to make 
symbolic 
references 
to 
I/O ports, 
memory 
ad- 


dresses, and data in the user program. Thus the user 
need not remember 
the addresses 
of variables 
or 
program 
subroutines. 


Symbols 
can be used to reference 
variables, 
pro- 
cedures, 
program 
labels, and source statements. 
A 
variable can be displayed or changed 
by referring 
to 
it by name rather 
than 
by its absolute 
location 
in 
memory. 
Using symbols 
for statement 
labels, pro- 
gram labels, and procedure 
names simplifies 
both 
tracing 
and 
breakpoint 
setting. 
Disassembly 
of a 
section 
of 
code 
from 
either 
trace 
or 
program 
memory 
into 
its assembly 
mnemonics 
is readily 
accomplished. 


PLUG 
INTO 
USER 
8086 
SOCKET 


r------------- 
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A typlcallAPX 
86development configuration. 
It Is based on Intellec<!lSeries III Development System, which 
hosts the ICE·86ATMemulator. The ICE·86ATMmodule Is shown connected to a user prototype system, Inthis 
case, an SDK·86. 


Furthermore, 
each 
symbol 
may 
have 
associated 
with it one of the data types BYTE, WORD, INTEGER, 
SINTEGER 
(for 
short, 
8-bit 
integer), 
POINTER, 
REAL, OREAL, or TREAL. Thus 
the user need not 
remember 
the type 
of a source 
program 
variable 
when 
examining 
or modifying 
it. For example, 
the 
command 
"!VAR" 
displays 
the value 
in memory 
of 
variable VAR in a format appropriate 
to its type, while 
the command 
"!'JAR = !VAR + 1" increments 
the 
value of the variable. 


The user symbol 
table generated 
along with the ob- 
ject file during 
a PUM-86, 
PASCAL-86 or FORTRAN- 
86 compilation 
or an ASM-86 assembly 
is loaded into 
memory 
along with the user program 
which is to be 
emulated. 
The user can utilize the available 
symbol 
table 
space 
more efficiently 
by using 
the SELECT 
option 
to choose 
which 
progr-am modules·will 
have 
symbols 
loaded 
in the symbol 
table. The user may 
also add to this symbol table any additional 
symbolic 
values 
for 
memory 
addresses, 
constants, 
or 
variables 
that 
are 
found 
useful 
during 
system 
debugging. 


The ICE-86A module 
provides 
access through 
sym- 
bolic definition 
to all of the 8086 registers 
and flags. 
The 
READY, 
NMI, 
TEST, 
HOLD, 
RESET, 
INTR, 
MN/MX, 
and 
RQ/GT pins of the 8086 can also 
be 
read. Symbolic 
references 
to key ICE-86A emulation 
information 
are also provided. 


The ICE-86A module provides 
a programmable 
diag- 
nostic facility 
which allows the user to tailor its oper- 
ation 
using 
macro 
commands 
and 
compound 
commands. 


A macro is a set of ICE-86A com mands which is given 
a single 
name. Thus, 
a sequence 
of 
commands 
which is executed 
frequently 
may be invoked 
simply 
by typing 
in a single command. 
The user first defines 
the macro 
by entering 
the entire 
sequence 
of com- 
mands which 
he wants 
to execute. 
He then 
names 
the macro and stores it for future 
use. He executes 
the macro by typing 
its name and passing 
up to ten 
parameters 
to the commands 
in the macro. 
Macros 
may be saved on a disk file for use in subsequent 
debugging 
sessions. 


Compound 
commands 
provide 
conditional 
execu- 
tion of commands 
(IF), and execution 
of commands 
until a condition 
is met or until they have been ex- 


ecuted 
a specified 
number 
of 
times 
(COUNT, 


REPEAT). 


Compound 
commands 
and macros 
may be nested 
any number 
of times. 
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Memory 
for the user system can be resident 
in the 
user system or "borrowed" 
from the Intellec System 
through 
the ICE-86A emulator's 
mapping 
capability. 
The speed of run emulation 
by the ICE-86A module 
depends on which mapping 
options 
are being used. 


The ICE-86A emulator 
allows the memory 
which 
is 
addressed 
by the 8086 to 
be mapped 
in 1K-byte 
blocks to: 


1. Physical memory in the user's system, which pro- 
vides 100 percent real-time emulation 
at the user- 
system 
clock 
rate (up to 5 MHz) with 
no wait 
states. 


2. Either of two 1K-byte blocks 
of ICE-86A module 
high-speed 
memory, 
which 
allow 
nearly 
full- 
speed emulation 
(with two additional 
wait states 
per 8086-controlled 
bus cycle). 


3. Intellec 
System memory, which 
provides 
emula- 
tion at approximately 
0.02 percent of real-time with a 
5 MHz clock. 


4. A random-access 
diskette 
file, with 
emulation 
speed comparable 
to Intellec 
System 
memory, 
ex- 
cept the emulation 
must wait when a new page is 
accessed 
on the diskette. 


The user can also designate 
a block 
of memory 
as 
non-existent. 
The ICE-86A module 
issues an error 
message when any such "guarded" 
memory 
is ad- 
dressed 
by the user program. 


As the 
user 
prototype 
progresses 
to 
include 
memory, emulation 
becomes 
real time. 


The ICE-86A software 
is a RAM-based 
program 
that 
provides 
the user with 
easy-to-use 
commands 
for 


initiating 
emulation, 
defining 
breakpoints, 
control- 
ling trace data collection, 
and displaying 
and con- 
trolling 
system parameters. 
ICE-86A commands 
are 
configured 
with 
a broad 
range of modifiers 
which 
provide the user with maximum 
flexibility 
in describ- 
ing the operation 
to be performed. 


Emulation 
commands 
to the ICE-86A emulator 
con- 
trol the process of setting up, running and halting an 
emulation 
of the user's iAPX 86 System. Breakpoints 
and tracepoints 
enable the ICE-86A module to halt 


emulation 
and provide 
a detailed 
trace of execution 
in any part of the user's program. A summary 
of the 
emulation 
commands 
is shown 
in Table 1. 


Table 
1. Summary 
of ICE-86A™ 
Emulation 
Commands 


Command 
Description 


GO 
Initializes emulation and allows the user to 
specify the starting point and breakpoints. 
Example: 


GO FROM.STARTTILL .DELAYEXECUTED 


where START and DELAY are statement 
labels. 


STEP 
Allows the user to single-step through the 
program. 


Breakpoints: 
The ICE-86A module 
has two break- 
point registers 
that allow the user to halt emulation 
when a specified 
condition 
is met. The breakpoint 
registers 
may 
be set 
up for 
execution 
or 
non- 
execution 
breaking. 
An execution 
breakpoint 
con- 
sists 
of a single 
address 
which 
causes 
a break 
whenever 
the 8086 executes 
from 
its queue 
an in- 
struction 
byte which was obtained 
from the address. 
A non-execution 
breakpoint 
causes 
an emulation 
break when a specified 
condition 
other than an in- 
struction 
execution 
occurs. A non-execution 
break- 
point 
condition, 
using 
one 
or 
both 
breakpoint 
registers, 
may be specified 
by anyone 
of or a combi- 
nation of: 


1. A set of address values. Break on a set of address 
values has three valuable 
features: 


a. Break on a single 
address. 


b. The ability 
to set any number 
of breakpoints 
within 
a limited 
range (1024 bytes maximum) 
of memory. 


c. The ability to break in an unlimited 
range. Ex- 


ecution 
is halted on any memory access to an 
address greater 
than (or less than) any 20-bit 
breakpoint 
address. 


2. A particular 
status of the 8086 bus (one or more 
of: memory or I/O read or write, instruction 
fetch, 


halt, or interrupt 
acknowledge). 


3. A set 
of 
data 
values 
(features 
comparable 
to 
break 
on a .set of address 
values, 
explained 
in 
point one). 


4. A segment 
register 
(break occurs when the regis- 
ter is used in an effective 
address 
calculation). 
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Emulation 
break 
can also be set to occur 
on an 
external 
signal 
condition. 
An external 
breakpoint 
match 
output 
and emulation 
status 
lines are 'pro- 
vided on the buffer box. These allow synchronization 
of other 
test equipment 
when 
a break 
occurs 
or 
when emulation 
is begun. 


Tracepoints: 
The 
ICE-86A 
module 
has 
two 
tracepoint 
registers 
which 
establish 
match 
condi- 
tions to conditionally 
start and stop trace collection. 


The trace information 
is gathered 
at least twice per 
bus cycle, first when the address 
signals 
are valid 
and second 
when the data signals 
are valid. If the 
8086 execution 
queue is otherwise 
active, additional 
frames of trace are collected. 


Each trace frame contains the 20 address/data 
lines 
and detailed 
information 
on the status of the 8086. 


The trace 
memory 
can store 
1,023 frames, 
or an 
average of about 300 bus cycles, 
providing 
ample 
data for detemining 
how the 8086' was reacting prior 
to emulation 
break. The trace memory 
contains 
the 


last 1,023 frames of trace data collected, 
even if this 
spans several separate emulations. 
The user has the 
option of displaying 
each frame of the trace data or 
displaying 
by instruction 
in actual 
ASM-86 Assem- 
bler mnemonics. 
Unless the user chooses to disable 
trace, the trace information 
is always availaple after 
an emulation. 
. 


Interrogation 
and utility' commands 
give the user 
convenient 
access to detailed information 
about the 
user program 
and the state of the 8086 that is useful 
in debugging 
hardware 
and software. 'Changes can 
be made in both 
memory 
and the 8086 registers, 
flags, input pins, and I/O ports. Commands 
are also 
provided for various 
utility operations 
such as load- 
ing and saving program 
files, defining 
symbols 
and 
macros, 
displaying 
trace 
data, 
setting 
up the 
memory map, and returning 
control to ISIS-II. A sum- 
mary 
of the 
basic 
interrogation 
and 
utility 
com- 
mands is shown in Table 2. 
' 


Memory/Register 
Commands 
Display or change 
the contents 
of: 


• 
Memory 
, 


• 8086 Reg isters 
• 8086 St~tus flags. 
• 8086 Input pins 
• 8086 I/O ports 
• 
ICE-86A Pseudo-Registers 
(e:g. emulation 
timer) 


Memory 
Mapping 
Commands 
Display, dec.lare, set, or reset the ICE-86A memory mapping. 


Symbol 
Manipulation 
Commands 
Display any or all symbols, 
program 
modules, and program 
line 
numbers 
and 
their 
associated 
values 
(locations 
in 
memory). 


Set the domain (choose the particular 
program 
module) for 
the line numbers. 


Define new symbols 
as they are needed in debugging. 


Remove 
any 
or all 
symbols, 
modules, 
and 
program 
statements. 


Change the value of any symbol. 


Select 
program 
modules 
whose 
symbols 
will 
be used in 
debugging. 


TYPE 
Assign or change the type of any symbol in the symbol table. 


DASM' 


Disassemble 
user program 
memory intoASM-86Assembler 
mnemonics. 
I.: 
I 


RQ/GT 
Set or display the status of the Request/Grant 
facility which 
enables the ICE·86A module to share the system bus with 
coprocessors. 


BUS 
Display which 
device In the user's iAPX 86 system 
is cu r- 
rently master of the system bus. 


CAUSE 
Display the cause of the most recent emulation 
break. 


PRINT 
Display the specified 
portion 
of the trace memory. 


LOAD 
Fetch user symbol table and object code from the inputfile. 


EVALUATE 
Display the value of an expression 
in binary, octal, decimal, 
hexadecimal, 
and ASOII. 


CLOCK 
Select the internal 
(ICE-86A module 
provided, 
for stand- 
alone 
mOde only) 
or an external 
(user-provided) 
system 
clock. 


RWTIMEOUT 
Allows the user to time out READ/WRITE command 
signals 
based on the time 
taken 
by the 8086 to access 
Intellec 
memory or diskette 
memory. 


ENABLE/DISABLE 
RDY 
Enable or disable 
logical AND of ICE-86A emulator 
Ready 
with the user Ready signal for accessing 
Intellec 
memory, 
ICE memory, or diskette 
memory. 
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The ICE-86A module has the extended capabilities 
to 


debug iAPX 86/20 microsystems 
which contain 
both 


the 
8086 
microprocessor 
and 
the 
8087 Numeric 


Processor 
Extension 
(NPX). An iAPX 86/20 system is 


configured 
in the 
8086's 
"maximum" 
mode 
and 


communication 
between 
the processors 
is accom- 


plished through 
the RQ/GT signals. 
Debugging 
can 


be done either 
using the 8087 chip itself (in which 


case the 8086 ESCAPE instruction 
is interpreted 
as a 


floating 
point 
instruction) 
or 
using 
the 
8087 


software 
emulator 
E8087 (where 
the 8086 INTER- 
RUPT instruction 
is interpreted 
as a floating 
point 


instruction). 
Three new data types are defined to use 


the NPX: 


REAL (4 byte Short 
Real) 


DREAL (8 byte Long Real) 
TREAL (10 byte Temporary 
Real) 


While the 8087 NPX is not a programmable 
part, it 


does interact closely with the 8086 and can execute 
instructions 
in parallel 
with it. The ICE-86A module 


provides 
information 
about 
the 
relative 
timing 
of 


instruction 
execution 
in each processor 
so that the 


complete 
system 
can be debugged. 
Other 
debug- 


ging 
capabilities 
available 
through 
the 
ICE-86A 


module 
are: symbolically 
disassemble 
NPX call in- 
structions 
from memory or trace history; 
display or 
change 
the control, 
status 
and flag values 
of the 


NPX; display the NPX stack either in hexadecimal 
or 


disassembled 
form; and display the last instruction 


address, 
last operand, 
and last operand 
address. 


The 8089 Real-Time 
Breakpoint 
Facility (RBF-89) is 


an extension 
of the ICE-86A emulator 
that aids in 


testing and trouble-shooting 
iAPX 86/11 systems de- 
signed around 
a combination 
of the 8086 CPU and 


the 8089 Input/Output 
Processor 
(lOP). RBF-89 in- 


terrogates 
8089 registers, 
sets breakpoints 
in 8089 


programs, 
and performs 
its other functions 
by pre- 
paring special 
control 
blocks 
in application 
system 


memory. 
It then 
issues 
input/output 
channel- 


attention 
commands 
to the 8089 in the user's system 


to perform 
these functions. 
While using the RBF-89 


extension, 
the user can also enter and execute the 


other standard 
ICE-86A emulator 
commands. 


RBF-89 
allows 
the 
user 
to 
load 
his application 


(channel) 
program 
from 
diskette 
into 
8089 
lOP 


memory and execute it in real time. The program 
can 


reside in either 
local (system) 
RAM (accessible 
by 


both the 8086 and 8089 microprocessors), 
or remote 


RAM (accessible 
by the 8089 lOP only). The user may 


request execution 
to begin at any location 
and con- 


tinue 
until 
normal 
termination, 
a specified 
break- 


point 
is reached, 
or 
the 
program 
is otherwise 


aborted. 
If a program 
is modified 
during 
a debug- 


ging session, RBF-89 can save the latest version 
by 


copying 
it from application 
system memory to a disk- 


ette file. 


RBF-89 supports 
setting 
up to twelve 
breakpoints 


(six per 8089 channel) 
in the user program. 
RBF-89 


implements 
each 
breakpoint 
by inserting 
a HALT 


instruction 
at the breakpoint 
location, 
while saving 


the overwritten 
instruction 
in temporary 
storage. 


When a breakpoint 
is reached 
during 
program 
ex- 


ecution the program 
halts. At this point the user can 


examine 8089 registers, 
flags, and memory, and op- 


tionally 
resume 
program 
execution. 
The 
invoked 


breakpoint 
address is recorded 
in one of two break- 


point registers-one 
register for each 8089 channel. 


Through 
simple 
RBF-89 
commands 
the 
user can 


display or change 
the contents 
of these registers. 


As in the ICE-86A emulator, 
the RBF-89 extension 


accepts 
symbolic 
references 
for 
variables 
and 


labels, 
including 
symbols 
in the 
symbol 
table 


generated 
by the ASM-89 assembler. 


Through 
RBF-89, the user can display 
and change 


the contents 
of : 


- 
memory, 
which 
can 
be 
displayed 
as either 


numeric 
data or disassembled 
(8089 assembly- 


language 
mnemonic) 
code. 


- 
all 8089 
registers 
except 
the 
channel 
control 


pointer 
(CCP) and status flags. 


The 
ICE-86A emulator 
and 
RBF-89 
support 
8089 


configurations 
in both local and remote modes. The 


ICE-86A emulator 
may be operating 
either 
in mini- 


mum or maximum 
mode. 
In maximum 
mode, the 


8086 RQ/GT lines are employed. This is required 
for 


the 8089 local mode configuration 
to provide 
local 


bus arbitration 
between 
the two processors. 
Using 


RBF-89, the user can: 
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Set 
RQ/GT 
to 
operate 
for 
a local 
or 
remote 
configuration. 


Display 
status 
to determine 
which 
processor 
con- 
trols 
the system 
bus. 


RBF-89 
permits 
the 8089 and emulated 
8086 to run 
simultaneously 
as well as sequentially. 
The user can 
specify 
breakpoints 
and begin program 
execution 
in 
three 
operating 
sequences: 


Set breakpoints, 
start the 8089, and return 
control 
to the console 
until a breakpoint 
is reached 
or the 
program 
runs to completion 
or is aborted. 
Use this 
sequence 
when 
the 8086 and 8089 programs 
do 
not need to be executed 
simultaneously. 


Set breakpoints, 
start 
the 8089, return 
control 
to 
the console, 
and start the 8086. This sequence 
lets 
both 
microprocessors 
run simultaneously. 


Set 
breakpoints, 
start 
the 
8086, 
and 
allow 
that 
program 
to 
drive 
the 
8089 
program 
in a mas- 


terlslave 
relationship. 
This 
sequence 
would 
be 
used, for instance, 
to verify 
the 8086 communica- 
tion 
driver 
program. 


RBF-89 
is furnished 
as a superset 
of the 
ICE-86A 
emulator 
software. 
Its main 
components 
are: 


A HOST PROGRAM 
that 
resides 
in Intellec 
devel- 
opment 
system 
RAM, where 
it serves 
as an exten- 
sion 
of 
the 
ICE-86A 
emulator's 
software 
driver. 
This program, 
executed 
by the development 
sys- 
tem, translates 
the user's keyboard 
input 
into low- 
level 
directives 
that 
can 
be 
processed 
by the 
RBF-89 
control 
program 
(described 
below), 
and 
converts 
information 
supplied 
by the control 
pro- 
gram 
into 
easily 
understood 
display 
output. 


A CONTROL 
PROGRAM 
that 
resides 
in 'ICE-86A 
emulator 
memory. 
Running 
on the emulator's 
8086 
microprocessor, 
the 
control 
program 
monitors 
such 
operations 
as preparing 
program 
control 
blocks 
for 
communication 
with 
the 
8089 
micro- 
processor; 
issuing 
commands 
to the 8089 to start, 
terminate, 
and 
continue 
the 
8089 task 
program; 


and 
directing 
the 8089 to start 
execution 
of the 
RBF-89 
utility 
program 
(described 
below). 


A UTILITY 
PROGRAM 
that resides 
in the 8089 RAM 
in the 
user's 
prototype 
application 
system. 
This 


program, 
running 
on the 8089, 
reads 
and writes 
data to and from 
8089 memory 
and registers, 
and 
sets and 
removes 
breakpoints 
in the 
user's 
task 
program. 


The 200 bytes of RAM required 
by the utility 
pro- 
gram 
must 
be 
accessible 
to 
both 
the 
ICE-86A 
emulator 
and the 8089. 


DC CHARACTERISTICS 
OF THE 
ICE-86A ™ 
MODULE USER CABLE 


A16/S3-A19/S7, 
BHE/S7, 
RD, 


LOCK, OSO, OS1, SO, S1, S2, 
WR, M/iO, DT/R, 
DEN, ALE, 


INTA 


IOL (Mln) 


12 mA 
(24 mA @ O.SV) 
8 mA 
(16 mA @ O.SV) 


2. Output 
High Voltages 
[VoH (Min)=2.4V] 
IOH (Min) 


ADO-AD1S 
- 3 mA 


A16/S3-A19/S7, 
BHE/S7, 
RD, 


LOCK, OSO, OS1, SO, S1, S2, 
WR, MIlO, DT/R, 
DEN, ALE, 
INTA, HLDA 


ADO-AD1S 
NMI, CLK 
READY 
. 


INTR, HOLD, TEST, RESET 
MN/MX 
(0.1 p,f to GND) 


3. Input 
Low Voltages 
[V1L (Max)=O.8V] 


IlL (Max) 


-0.2 
mA 


-0.4 
mA 


-0.8 
mA 


-1.4 
mA 


-3.3 
mA 


ADO-AD1S 
NMI, CLK 
READY 
INTR, HOLD, TEST, RESET 
MN/MX 
(0.1 p,F to GND) 


4. 
Input 
High Voltages 
[V1H (Min)=2.0V] 


IIH (Max) 


80p,A 
20p,A 
40p,A 


-0.4 
mA 


-1.1 
mA 


5. 
No current 
is taken 
from 
the 
user circuit 
at 
Vcc 
pin. 
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ICE-86A Operating Environment 


REQUIRED 
HARDWARE 


Intellec 
microcomputer 
development 
system with: 


1. Three adjacent 
slots for the ICE-86A module. 


2. 64K bytes of Intellec 
memory. 
If user prototype 
program 
memory 
is desired, 
additional 
memory 
above the basic 64K is required. 


System console 
Intel lee diskette 
operating 
system 
ICE-86A module 


REQUIRED 
SOFTWARE 
System Monitor 
ISIS-II, version 
3.4 or subsequent 


ICE-86A software 


Printed circuit 
boards (3) 
Interface 
cable and emulation 
buffer 
module 
Operator's 
manual 
ICE-86A software, 
diskette-based 


User system 
clock 
up to 5 MHz or 2 MHz ICE-86A 


internal 
clock 
in stand-alone 
mode 


PRINTED 
CIRCUIT 
BOARDS 
Width: 
12.00 in (30.48 em) 
Height: 
6.75 in (17.15 em) 
Depth: 
0.50 in (1.27 em) 
Packaged Weight: 
9.00 Ib (4.10 kg) 


DC POWER 
Vcc = +5V +5%-1% 
Icc = 17A maximum; 
11A typical 
VDD= +12V 
±5% 
IDD = 120 mA maximum; 
80 mA typical 
VBB = -10V 
±5% or -12V 
±5% 
(optional) 


IsB = 25 mA maximum; 
12 mA typical 


OPERATING 
TEMPERATURE 
rJ't04rJ'C 


OPERATING 
HUMIDITY 
Up to 95% relative humidity 
without 
condensation. 


MDS*-86A-ICE 


MDS*-86U-ICE 


iAPX 86 microsystem 
in-circuit 
emulator, 
cable assembly, and interactive 
software 


Upgrade 
kit to convert 
ICE-86 emulators 
to ICE-86A emulator 
capabilities. 


*MDS is an ordering code only and is not used as a product name or trademark. 
MD~ 
is a registered trademark 
of Mohawk Data 


Sciences Corporation. 


ICE 86A TM, ICE 88A™ 


iAPX 86,88 
IN-CIRCUIT EMULATOR 


• 
Disassembly 
of trace 
or memory 
from 
object 
code into assembler 
mnemonics 


• 
Full symbolic 
debugging 
support 
for all 


languages 
• 
Software 
debugging 
with or without 
user 


system 
• 
Supports 
iAPX 86/21, 88/21 


configurations 
• 
Handles 
full 1 megabyte 
addressability 
of iAPX 86, 88 


• 
Comprehensive 
trace 
of program 


execution, 
both conditional 
and 


unconditional 


The ICE-86A(88A) 
module provides in-circuit emulation for the 8086(88) 
microprocessor 
and the iSBC 86/12A 
single 
board computer. It includes three circuit boards which reside in Intellec Series II or Series III Microcomputer 
Development 
System. A cable and buffer box connect the Intellec system to the user system by replacing the user's 8086(88). 
Powerful 
Intellec debug functions are thus extended into the user system. Using the ICE·86A(88)A module. the designer can execute 
prototype software in continuous or single-step mode and can substitute blocks of Intellec system memory for user equiv- 
alents. Breakpoints allow the user to stop emulation on user-specified conditions, and the trace capability gives a detailed 
history of the program execution prior to the break. All user access to the prototype 
system software 
may be done 
symbolically by referring to the source program variables and labels for all languages. 


PLUG INTO 
USER 
8086 
SOCKET 
(8088) 
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INTEGRATED 
HARDWARE/SOFTWARE 


DEVELOPMENT 


The ICE·86A(88A) 
emulator allows hardware 
and soft- 


ware development to proceed interactively. 
This is more 


effective than the traditional method of independent hard- 
ware and software development followed by system inte- 
gration. 
With 
the 
ICE-86A(88A) 
module, 
prototype 


hardware can be added to the system as it is designed. 
Software and hardware testing occurs while the product is 
being developed. 


Conceptually, 
the ICE-86A(88A) 
emulator 
assists 
three 


stages of development: 


1. It Can be operated without being connected to the us- 
er's system, so ICE-86A(88A) debugging capabilities can 
be used to facilitate program development before any of 
the user's hardware is available. 


2. Integration of software and hardware can begin when 
any functional element of the user system hardware is con- 


nected to the 8086(88) 
socket. 
Through ICE-86A(88A) 


mapping capabilities, Intellec memory, ICE memory, or dis- 
kette memory can be substituted 
for missing prototype 


memory. Time-critical program modules are debugged be- 
fore hardware 
implementation by using the 2K-bytes 
of 


high-speed ICE·resident memory. As each section of the 
user's hardware is completed, it is added to the prototype. 
Thus each section of the hardware and software 
is ·sys- 


tem" tested as it becomes available. 


3. When the user's prototype is complete, it is tested with 
the final version of the user system software. 
The ICE- 


86A(88A) 
module is then used for real-time emulation of 


the 8086(88) 
to debug the system as a completed unit. 


Thus the ICE-86A(88A) module provides the user with the 
ability to debug a prototype 
or production system at any 


stage in its development without introducing extraneous 
hardware or software test tools. 


iAPX 86/20, 88/20 
Numerics Supplement 
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THE 8087 NUMERIC DATA PROCESSOR 


This supplement describes the 8087 Numeric Data 
Processor 
(NDP). 
Its organization 
is similar to 


chapters 
2 and 3 of The 8086 Family 
User's 


Manual: 


I. 
Processor Overview 


2. 
Processor Architecture 


3. 
Computation 
Fundamentals 


4. 
Memory 


5. 
Multiprocessing Features 


6. 
Processor Control and Monitoring 


7. 
Instruction Set 


8. 
Programming Facilities 


9. 
Special Features 


10. Programming Examples 


Section 
I covers both 
hardware 
and software 


topics at a general level. Sections 2 and 4 through 
6 are largely hardware-oriented, 
while sections 3 
and 7 through 
10 are of greatest interest to pro- 
grammers. 
Section 9 describes features 
of ,the 
NDP that will be of interest to specialized groups 
of users; it is not necessary to understand this sec- 
tion to successfully use the 8087 in most applica- 
tions. Hardware 
coverage in this supplement 
is 
limited to discussing processor facilities in func- 
tional 
terms. 
Timing, 
electrical characteristics, 
and other physical interface data may be found in 
Appendix B, as well as in Chapter 4 of The 8086 
Family User's Manual. 


Note that throughout 
this supplement 
the term 


"CPU" 
refers to either an 8086 or 8088 con- 
figured in maximum mode. To make best use of 
the material in this publication, 
readers should 
have a good understanding of the operation of the 
8086/8088 CPUs. 


The 
8087 
Numeric 
Data 
Processor 
is 
a 
coprocessor 
that performs 
arithmetic 
and com- 
parison operations 
on a variety of numeric data 
types; 
it 
also 
executes 
numerous 
built-in 
transcendental 
functions 
(e.g., tangent 
and log 


functions). As a coprocessor to a maximum mode 
8086 or 8088, the NDP effectively extends the 


register and instruction sets of the host CPU and 
adds several new data types as well. The pro- 
grammer generally does not perceive the 8087 as a 
separate 
device; 
instead, 
the 
computational 
capabilities of the CPU appear greatly expanded. 


The 8087 is the only chip required to add exten- 
sive high-speed numeric processing capabilities to 
an 8086- or 8088-based system. It is specifically 
designed to deliver stable, correct results when 
used in a straightforward 
fashion by program- 
mers who are not expert in numerical analysis. Its 
applicability 
to 
accounting 
and 
financial 
environments, 
in 
addition 
to 
scientific 
and 
engineering 
settings, 
further 
distinguishes 
the 
8087 from 
the 
"floating 
point 
accelerators" 
employed in many computer 
systems, including 
minicomputers 
and 
mainframes. 
The NDP 
is 
housed in a standard 40-pin dual in-line package 
(figure S-I) and requires 
a single +5V power 
source. 


The 
description 
of 
the 
8087 in 
this 
section 
deliberately omits some operating details in order 
to 
provide 
a 
coherent 
overall 
view 
of 
the 
processor's 
capabilities. 
Subsequent 
sections of 
the supplement 
describe these capabilities, 
and 
others, in more detail. 


The performance 
of first- and second-generation 
microprocessor-based 
systems 
was 
limited 
in 
three 
principal 
areas: 
storage 
capacity, 
input/output 
speed, and numeric computation. 
The 8086 and 8088 CPUs broke the 64k memory 
barrier, 
allowing larger and more time-critical 
applications 
to 
be 
undertaken. 
The 
8089 
Input/Output 
Processor eliminated many of the 
I/O bottlenecks 
and permitted 
microprocessors 
to 
be 
employed 
effectively 
in 
I/O-intensive 
designs. The 8087 Numeric Data Processor clears 
the third roadblock by enabling applications with 
significant 
computational 
requirements 
to 
be 
implemented with microprocessor technology. 


Figure S-2 illustrates 
the progression 
of Intel 
numeric products and events that have led to the 
development of the 8087. In the mid-1970's, Intel 
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Figure 8-1. 8087 Numeric Data Processor 
Pin 
Diagram 


made the commitment 
to expand the computa- 


tional 
capabilities 
of 
microprocessors 
from 


addition and subtraction of integers to an array of 
widely useful operations on real numbers. (Real 
numbers 
encompass 
integers, 
fractions, 
and 


irrational numbers such as 1! and V2.) In 1977, 
the corporation 
adopted 
a standard 
for repre- 


senting 
real 
numbers 
in 
a 
"floating 
point" 


format. Intel's Floating Point Arithmetic Library 
(FPAL) was the first product to utilize this stan- 
dard format. FP AL is a set of subroutines for the 
80S0/S0S5 
microprocessors. 
These routines per- 
form arithmetic 
and limited standard 
functions 


on single precision 
(32-bit) real numbers; 
an 


FPAL multiply executes in about 
1.5 ms (1.6 


MHz SOSOACPU). The next product, 
the iSBC 


31O™ High 
Speed 
Math 
Unit, 
essentially 
implements 
FP AL 
in 
a 
single 
iSBC™ card, 
reducing a single-precision multiply to about 100 
JAs.The Intel® S232 is a single-chip arithmetic pro- 
cessor for the SOSO/SOS5 family. The S232 accepts 
double 
precision 
(64-bit) operands 
as well as 


single precision 
numbers. 
It performs 
a single 


precision multiply in about 100 JAsand multiplies 
double precision numbers in about S75 JAS(2 MHz 
version). 
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Figure 8-2. SOS7Evolution 
and Relative 


Performance 


In 1979, a working committee of the Institute for 
Electrical and Electronic Engineers (IEEE) pro- 
posed an industry standard for minicomputer and 
microcomputer 
floating 
point arithmetic·. 
The 


intent of the standard is to promote portability of 
numeric programs between computers and to pro- 
vide a uniform 
programming 
environment 
that 


encourages the development of accurate, reliable 
software. 
The 
proposed 
standard 
specifies 


requirements and options for number formats as 
well as the results 
of computations 
on these 


numbers. The floating point number formats are 
identical to those previously adopted by Intel and 
used in the products described in this section. 


The SOS7 Numeric Data Processor 
is the most 


advanced development in Intel's continuing effort 
to 
provide 
improved 
tools 
for 
numerically- 


oriented 
microprocessor 
applications. 
It 
is a 
single-chip 
hardware 
implementation 
of 
the 


proposed IEEE standard, including all its options 
for single and double precision numbers. As such, 
it is compatible 
with previous 
Intel numerics 


products; programs written for the SOS7will be 
transportable 
to future products that conform to 


• J. Coonen, 
W. Kahan, 
1. Palmer, 
T. Pittman, 
D. Stevenson, 
"A Proposed 
Standard 
for Binary 
Floating 
Point 
Arithmetic," 
ACM SIGNUM Newsletter, 
October 
1979. 


the proposed IEEE standard. The NDP also pro- 
vides 
many 
additional 
functions 
that 
are 


extensions to the proposed standard. 


As figure S-2 indicates, the 8087 provides about 
10 times the instruction speed of the 8232 and a 
loo-fold 
improvement 
over 
FPAL. 
The 
8087 


multiplies 32-bit and 64-bit real numbers in about 
19 /As and 27 /As, respectively. Of course, the actual 
performance 
of the 
NDP 
in a given system 


depends 
on 
numerous 
application-specific 


factors. 


Table S-I compares the execution times of several 
8087 instructions 
with the equivalent operations 


executed in software on a 5 MHz 8086. The soft- 
ware equivalents are highly optimized assembly 
language procedures from the 8087 emulator, an 
NDP development 
tool discussed later in this 


section. 


The performance 
figures quoted in this section 


are 
for 
operations 
on 
real 
(floating 
point) 


numbers. 
The 8087 also has instructions 
that 


enable it to utilize fixed point binary and decimal 
integers of up to 64 bits and 18 digits, respec- 
tively. Using an 8087, rather than multiple preci- 
sion software algorithms for integer operations, 
can provide speed improvements of 10-100times. 


The 8087's unique coprocessor 
interface to the 


CPU can yield an additional performance 
incre- 


ment beyond that of simple instruction speed. No 
overhead is incurred in setting up the device for a 
computation; 
the 8087 decodes its own instruc- 
tions automatically 
in parallel 
with the CPU. 


Moreover, 
built-in coordination 
facilities allow 


the CPU to proceed with other instructions while 
the 8087 is simultaneously executing its numeric 
instruction. 
Programs 
can exploit this processor 


parallelism to increase total system throughput. 


Viewed strictly from the standpoint of raw speed, 
the 8087 enables serious computation-intensive 
tasks to be performed by microprocessors 
for the 
first time. The 8087 offers more than just high 
performance, 
however. By synthesizing advances 


made by numerical analysts in the past several 
years, the NDP provides a level of usability that 
surpasses existing minicomputer 
and mainframe 


arithmetic units. In fact, the charter of the 8087 
design team was first to achieve exceptional func- 
tionality and then to obtain high performance. 


The 8087 is explicitly designed to deliver stable, 
accurate 
results 
when 
programmed 
using 


straightforward 
"pencil and paper" 
algorithms. 


While this statement 
may seem trivial, 
experi- 


enced users of "floating 
point processors" 
will 


Approximate Execution Time (/As) 
(5 MHz Clock) 
Instruction 


8087 
8086 
Emulation 


Multiply(single precision) 
19 
1,600 
Multiply(double precision) 
27 
2,100 
Add 
17 
1,600 
Divide(single precision) 
39 
3,200 


I 


Compare 
9 
1,300 
Load (single precision) 
9 
1,700 
Store (single precision) 
18 
1,200 
Square root 
36 
19,600 
Tangent 
90 
13,000 
Exponentiation 
100 
17,100 


recognize 
its 
fundamental 
importance. 
For 
example, most computers can overflow when two 
single 
precision 
floating 
point 
numbers 
are 
multiplied together and then divided by a third, 
even if the final result is a perfectly valid 32-bit 
number. The 8087 delivers the correctly rounded 
result. 
Other 
typical 
examp~s 
of undesirable 
machine behavior in straightforward 
calculations 
occur when solving for the roots of a quadratic 
equation: 
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or computing 
financial 
rate 
of return, 
which 
involves the expression: 
(l +i)n. Straightforward 
algorithms 
will not deliver consistently 
correct 
results (and will not indicate when they are incor- 
rect) on most machines. To obtain correct results 
on 
traditional 
machines 
under 
all conditions 
usually 
requires 
sophisticated 
numerical 
tech- 
niques that are foreign to most programmers. 
General application 
programmers 
using straight- 
forward 
algorithms 
will produce 
much 
more 
reliable programs on the 8087. This simple fact 
greatly reduces the software investment required 
to 
develop 
safe, .accurate 
computation-based 
products. 


Beyond traditional 
numerics support for "scien- 
tific" applications, 
the 8087 has built-in facilities 
for 
"commerical" 
computing. 
It can process 
decimal numbers of up to 18digits without round- 
off errors, and it performs exact arithmetic 
on 
integers as large as 264. Exact arithmetic is vital in 
accounting 
applications 
where rounding 
errors 
may 
introduce 
money 
losses 
that 
cannot 
be 
reconciled. 


The NDP contains a number of facilities that can 
optionally 
be invoked 
by sophisticated 
users. 
Examples of these advanced features include two 
models of infinity, 
directed 
rounding, 
gradual 
underflow, 
and traps to user-written 
exception 
handling software. 


The NDP's versatility and performance 
make it 
appropriate 
to a broad 
array 
of numerically- 
oriented 
applications. 
In general, 
applications 


that exhibit any of the following characteristics 
can benefit by implementing numeric processing 
on the 8087: 


• 
Numeric 
data 
vary over a wide range of 
values, or include non-integral values; 


• 
Algorithms produce very large or very small 
intermediate results; 


• 
Computations 
must be very precise, i.e., a 
large number of significant 
digits must be 
maintained; 


• 
Performance 
requirements 
exceed 
the 
capacity of traditional microprocessors; 


• 
Consistently 
safe, reliable results must be 
delivered using a programming 
staff that is 
not expert in numerical techniques. 


Note also that 
the 8087 can reduce 
software 
development costs and improve the performance 
of systems that do not utilize real numbers but 
operate 
on 
multi-precision 
binary 
or decimal 
integer values. 


A few examples, which show how the 8087 might 
be utilized in specific numerics applications, 
are 
described below. In many cases, these types of 
systems have been implemented in the past with 
minicomputers. The advent of the 8087 brings the 
size 
and 
cost 
savings 
of 
microprocessor 
technology to these applications for the first time. 


• 
Business data processing-The 
NDP's ability 
to accept 
decimal 
operands 
and 
produce 
exact decimal results of up to 18digits greatly 
simplifies accounting 
programming. 
Finan- 
cial calculations which use power functions 
can 
take 
advantage 
of 
the 
8087's 
exponentiation and logarithmic instructions. 


• 
Process control-The 
8087 solves dynamic 
range 
problems 
automatically 
and 
its 
extended precision allows control functions 
to be fine-tuned for more accurate and effi- 
cient 
performance. 
C.ontrol 
algorithms 
implemented with the NDP also contribute 
to improved reliability and safety, while the 
8087's speed can be exploited in real-time 
operations. 


• 
Numerical control-The 
8087 can move and 
position 
machine 
tool heads with extreme 
accuracy. Axis positioning also benefits from 
the hardware trigonometric support provided 
by the 8087. 


• 
Robotics-Coupling 
small size and modest 
power requirements with powerful computa- 
tional abilities, the NDP is ideal for on-board 
six-axis positioning. 


• 
Navigation-Very 
small, light weight, and 
accurate 
inertial 
guidance 
systems can be 
implemented 
with 
the 
8087. 
Its 
built-in 
trigonometric 
functions 
can 
speed 
and 
simplify the calculation 
of position 
from 
bearing data. 


• 
Graphics terminals-The 
8087 can be used in 
graphics terminals to locally perform many 
functions which normally demand the atten- 
tion of a main computer; these include rota- 
tion, 
scaling, 
and 
interpolation. 
By also 
including an 8089 Input/Output 
Processor to 
perform 
high 
speed 
data 
transfers, 
very 
powerful and highly self-sufficient terminals 
can be built from a relatively small number 
of 8086 family parts. 


• 
Data acquisition-The 
8087 can be used to 
scan, scale, and reduce large quantities 
of 
data 
as it is collected, 
thereby 
lowering 
storage 
requirements 
as well as the time 
required to process the data for analysis. 


The 
preceding 
examples 
are 
oriented 
toward 
"traditional" 
numerics applications. 
There are, 
in addition, many other types of systems that do 
not appear to the end user as "computational," 
but can employ the 8087 to advantage. 
Indeed, 
the 8087 presents the imaginative system designer 
with an opportunity 
similar to that created by the 
introduction 
of the microprocessor 
itself. Many 
applications can be viewed as numerically-based 
if sufficient computational 
power is available to 
support 
this 
view. 
This 
is analogous 
to 
the 
thousands of successful products that have been 
built 
around 
"buried" 
microprocessors, 
even 
though 
the 
products 
themselves 
bear 
little 
resemblance to computers. 


The combination of an 8086 or 8088 CPU and an 
8087 generally appears to the programmer 
as a 
single machine. 
The 8087, in effect, adds new 
data types, registers, and instructions to the CPU. 
The programming 
languages and the coprocessor 
architecture 
take 
care 
of 
most 
interprocessor 
coordination automatically. 


Table S-2 lists the seven 8087 data types. Inter- 
nally, the 8087 holds all numbers in the temporary 
real format; the extended range and precision of 
this format 
are key contributors 
to the NDP's 
ability -to consistently 
deliver 
stable, 
expected 
results. The 8087's load and store instructions 
convert operands between the other formats and 
temporary 
real. The fact that these conversions 
are made, 
and 
that 
calculations 
may be per- 
formed on converted numbers, 
is transparent 
to 
the 
programmer. 
Integer 
operands, 
whether 
binary or decimal, yield correct integer results, 
just as real operands 
yield correct real results. 
Moreover, a rounding error does not occur when 
a number in an external format is converted to 
temporary real. 


Computations 
in the 8087 center 
on the pro- 
cessor's register stack. These eight 80-bit registers 
provide the equivalent capacity of 40 of the 16-bit 
registers found in typical CPUs. 
This generous 
register 
space 
allows 
more 
constants 
and 
intermediate results to be held in registers during 
calculations, reducing memory access and conse- 
quently improving execution speed as well as bus 
availability. The 8087 register set is unique in that 
it can be accessed both as a stack, with instruc- 
tions operating implicitly on the top one or two 
stack elements, and as a fixed register set, with 
instructions 
operating 
on explicitly 
designated 
registers. 


Table S-3 lists the 8087's major instructions 
by 
class. Assembly language programs are written in 
ASM-86, the 8086/8088/8087 
common assembly 
language. ASM-86 provides directives for defin- 
ing all 8087 data types and mnemonics 
for all 
instructions. The fact that some instructions in a 
program are executed by the 8087 and others by 
the CPU is usually of no concern to the pro- 
grammer. All 8086/8088 addressing modes may 
be used to access memory-based 
8087 operands, 
enabling 
convenient 
processing 
of 
numeric 
arrays, structures, based variables, etc. 


NDP routines may also be written in PLlM-86, 
Intel's high-level language for the 8086 and 8088 
CPUs. PL/M-86 
provides the programmer 
with 
access to many 
8087 facilities 
while reducing 
the 
programmer's 
need 
to 
understand 
the 
architecture of the chip. 


Two 
features 
of 
the 
8087 hardware 
further 
simplify numeric application programming. 
First, 
the 8087 is invoked directly by the programmer's 
instructions. There is no need to write instructions 


Word integer 


Short integer 


Long integer 


Packed decimal 


Short real" 


Long rea" 


Temporary real 


Significant 
Digits (Decimal) 


4 


9 


18 


18 


6-7 


15-16 


19 


-32,768 ,;;;X,;;; + 32,767 


-2x109,;;;X,;;; + 2x109 


-9x1018,;;;X,;;; +9x1018 


-99 ...99';;;X,;;; +99 ...99(18digits) 


8.43x10-37,;;;Ixl,;;; 3.37x1038 


4.19x10-307,;;;lxl';;;1.67x10308 


3.4x10-4932,;;;IXI ,;;;1.2x104932 


'The short and long real data types correspond to the single and double precision data types 
defined in other Intel numerics products. 
' 


Data Transfer 


Arithmetic 


Comparison 


Transcendental 


Constants 


Processor Control 


Load (all data types), Store (all data types), Exchange 


Add, 
Subtract, 
Multiply, 
Divide, 
Subtract 
Reversed, 


Divide 
Reversed, 
Square 
Root, 
Scale, 
Remainder, 


Integer Part, Change Sign, Absolute Value, Extract 


Compare, Examine, Test 


Tangent, Arctangent, 2X-1, Y.Log2(X + 1), Y.Log2(X) 


0, 1,n,Log102, Loge2, Log210,Log2e 


Load Control 
Word, Store Control Word, Store Status 
Word, 
Load 
Environment, 
Store 
Environment, 
Save, 


Restore, 
Enable 
Interrupts, 
Disable 
Interrupts, 
Clear 
Exceptions, Initialize 


that "address" 
the NDP as an "I/O 
device", or 
to incur the overhead of setting up a DMA opera- 
tion to perform data transfers. Second, the NDP 
automatically 
detects exception 
conditions 
that 
can potentially damage a calculation at run-time. 
On-chip 
exception 
handlers 
are 
automatically 
invoked by default to field these exceptions so 
that a reasonable result is produced and execution 
may 
proceed 
without 
program 
intervention. 


Alternatively, the 8087 can interrupt the CPU and 
thus trap to a user procedure when an exception is 
detected. 


Besides 
the 
assembler 
and 
compiler, 
Intel 
provides a software emulator 
for the 8087. The 
8087 emulator (E8087) is a software package that 
provides the functional equivalent of an 8087; it 


executes entirely on an 8086 or 8088 CPU. The 
emulator 
allows 8087 routines 
to be developed 
and 
checked 
out 
on an 
8086/8088 
execution 
vehicle before prototype 8087 hardware is opera- 
tional. 
At the source 
code level, there 
is no 


difference between a routine that will ultimately 
run on an 8087 or on a CPU emulation 
of an 


8087. At link time, the decision is made whether 
to use the NDP or the software emulator; no re- 
compilation 
or re-assembly is necessary. Source 


programs are independent of the numeric execu- 
tion vehicle: except for timing, the operation 
of 


the emulated NDP is the same as for "real hard- 
ware". The emulator also makes it simple for a 
product 
to 
offer 
the 
NDP 
as 
a 
"plug-in" 


performance 
option 
without 
the 
necessity 
of 


maintaining two sets of source code. 


As a coprocessor to an 8086 or 8088, the 8087 is 
wired directly to the CPU as shown in figure S-3. 
The CPU's 
queue status lines (QSO and QSl) 
enable the NDP to obtain and decode instructions 
in synchronization 
with the CPU. 
The NDP's 
BUSY signal informs the CPU that the NDP is 
executing; the CPU WAIT instruction 
tests this 
signal to ensure that the NDP is ready to execute a 
subsequent instruction. 
The NDP can interrupt 
the CPU when it detects an exception. The NDP's 
interrupt 
request line is typically routed 
to the 
CPU through an 8259A Programmable 
Interrupt 
Controller. 


The 
NDP 
uses 
one 
'of 
its 
host 
CPUJs 
request/grant 
lines to obtain control of the local 
bus for data transfers 
(loads and stores). The 
other CPU 
request/grant- 
line is available 
for 
general system use, for example, by a local 8089 
Input/Output 
Processor. 
A local 8089 may also 
be connected to the 8087's'RQ/GTl 
line. In this 
configutation, 
the 8087 passes the request/grant 
handshake signals between the CPU and the lOP 


when the 8087 is not in control of the local bus. 
When it is in control of the bus, the 8087 relin- 
quishes the bus (at the end of the current bus 
cycle) upon a request from the connected lOP, 
giving the lOP higher priority than itself. In this 
way, two local 8089's can be configured 
in a 
module that also includes a CPU and an 8087. 


All processors utilize the same clock generator 
and system bus interface components 
(bus con- 
troller, 
latches, 
transceivers, 
and bus arbiter). 


Thus, no additional 
hardware 
beyond the 8087 
is 
required 
to 
add 
powerful 
cOIUPutational 
capabilities to an 8086- or 8088-based system. 


As shown in figure S-4, the NDP is internally 
divided into two processing elements, the control 
unit (CU) and the numeric execution unit (NEU). 
In essence, the NEU executes all numeric instruc- 
tions, while the CU fetches instructions, 
reads 
and writes memory operands, 
and executes the 
processor control class of instructions. 
The two 


8284 
CLOCK 
GENERATOR 


eLK 


MULTIMASTER 
SYSTEM 
BUS 


elements are able to operate 
independently 
of 
one 
another, 
allowing 
the 
CU 
to 
maintain 


synchronization 
with the CPU while the NEU 


executes numeric instructions. 


The CU keeps the 8087 operating in synchroniza- 
tion with its host CPU. 
8087 instructions 
are 


intermixed 
with CPU 
instructions 
in a single 
instruction 
stream 
fetched 
by 
the 
CPU. 
By 
monitoring the status signals emitted by the CPU, 
the NDP control 
unit can determine 
when an 


instruction is being fetched. When the instruction 
byte or word becomes available on the local bus, 
the CU taps the bus in parallel with the CPU and 
obtains that portion of the instruction. 


The CU maintains 
an instruction 
queue that is 


identical 
to the queue 
in the host 
CPU. 
By 


monitoring the CPU's queue status lines, the CU 
is able to obtain and decode instructions from the 
queue in synchronization with the CPU. In effect, 
both processors fetch and decode the instruction 
stream in parallel. 


The two processors execute the instruction stream 
differently, however. The first five bits of all 8087 
machine 
instructions 
are 
identical; 
these 
bits 


designate the coprocessor 
escape (ESe) class of 


instructions. The control unit ignores all instruc- 
tions that do not match these bits, since these 
instructions are directed to the CPU only. When 
the CU decodes 
an instruction 
containing 
the 


escape code, 
it either executes the instruction 


itself, or passes it to the NEU, depending on the 
type of instruction. 


The CPU distinguishes between ESC instructions 
that reference memory and those that do not. If 
the instruction refers to a memory operand, 
the 


CPU calculates the operand's 
address and then 


performs a "dummy 
read" 
of the word at that 


location. This is a normal read cycle, except that 
the CPU ignores the data it receives. If the ESC 
instruction does not contain a memory reference, 
the CPU simply proceeds to the next instruction. 


A given 8087 instruction (an ESC to the CPU) will 
either require loading an operand from memory 
into the 8087, or will require storing an operand 
from the 8087 into memory, or will not reference 
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& 
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memory at all. In the first two cases, the CU 
makes use of the "dummy 
read" 
cycle initiated 
by the CPU. 
The CU captures 
and saves the 
operand address that the CPU places on the bus 
early in the "dummy 
read". 
If the instruction is 
an 8087 load, the CU additionally 
captures the 
first (and possibly only) word of the operand 
when it becomes available 
on the bus. If the 
operand to be loaded is longer than one word, the 
CU immediately obtains the bus from the CPU 
and reads the rest of the operand in consecutive 
bus cycles. In a store operation, the CU captures 
and saves the operand address as in a load, and 
ignores the data word that follows in the "dummy 
read" cycle. When the 8087 is ready to perform 
-the store, the CU obtains the bus from the CPU 
·and writes the operand at the saved address using 
as many consecutive bus cycles as are necessary to 
store the operand. 


The NEU executes all instructions 
that involve 
the 
register 
stack; 
these 
include 
arithmetic, 
comparison, 
transcendental, 
constant, 
and data 
transfer instructions. The data path in the NEU is 
68 bits wide and allows internal operand transfers 
to be performed at very high speeds. 


Each of the eight registers in the 8087's register 
stack is 80 bits wide, and each is divided into the 
"fields" 
shown 
in 
figure 
S-5. 
This 
format 
corresponds 
to the NDP's 
temporary 
real data 
type that is used for all calculations. 
Section S.3 
describes in detail how numbers are represented in 
the temporary real format. 


At a given point in time, the ST field in the status 
word (described 
shortly) 
identifies 
the current 
top-of-stack register. A load ("push") 
operation 
decrements ST by 1and loads a value into the new 
top register. A store-and-pop 
operation stores the 
value from 
the current 
top 
register and 
then 
increments ST by I. Thus, like 8086/8088 
stacks 
in memory, the 8087 register stack grows "down" 
toward lower-addressed registers. 


Instructions 
may address registers either implic- 
itly or explicitly. Many instructions operate on the 
register at the top of the stack. These instructions 
implicitly address the register pointed to by ST. 
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For example, 
the ASM-86 instruction 
FSQRT 
replaces the number at the top of the stack with its 
square root; this instruction 
takes no operands 
because the top-of-stack register is implied as the 
operand. 
Other instructions 
allow the program- 
mer to explicitly specify the register that is to be 
used. 
Explicit 
register 
addressing 
is 
"top- 
relative" 
where 
the 
ASM-86 
expression 
ST 
denotes the current 
stack top and ST(i) refers 
to the ith register from ST in the stack (O~ i ~7). 
For example, if ST contains OllB (register 3 is the 
top of the stack), the following instruction would 
add registers 3 and 5: 


FADD 
5T,5T(2) 


In typical use, the programmer may conceptually 
"divide" 
the registers into a fixed group and an 
adjustable group. The fixed registers are used like 
the conventional registers in a CPU, to hold con- 
stants, accumulations, 
etc. The adjustable group 
is used like a stack, with operands pushed on and 
results popped off. After loading, the registers in 
the fixed group are addressed 
explicitly, while 
those 
in the 
adjustable 
group 
are 
addressed 
implicitly. 
Of 
course, 
all 
registers 
may 
be 
addressed 
using either mode, 
and the "defini- 
tion" of the fixed versus the adjustable areas may 
be altered at any time. Section S.8 contains a pro- 
gramming example that illustrates typical register 
stack use. 


The stack organization 
and top-relative address- 
ing 
of 
the 
registers 
simplify 
subroutine 
programming. 
Passing subroutine parameters on 
the register stack eliminates 
the need for the 
subroutine 
to "know" 
which registers actually 
contain 
the 
parameters 
and 
allows 
different 
routines 
to call the 
same subroutine 
without 
having 
to 
observe 
a 
convention 
for 
passing 
parameters in dedicated registers. So long as the 
stack is not full, each routine simply loads the 
parameters on the stack and calls the subroutine. 
The subroutine 
addresses the parameters 
as ST, 
ST(l), etc., even though ST may, for example, 
refer to register 3 in one invocation and register 5 
in another. 
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the 8087; it may be examined by storing it into 
memory 
with 
an 
NDP 
instruction 
and 
then 


inspecting it with CPU code. The status word is 
divided into the fields shown in figure S-6. The 
busy field (bit 15) indicates whether the NDP is 
executing an instruction (B=I) or is idle (B=O). 


Several 8087 instructions (for example, the com- 
parison 
instructions) 
post 
their 
results 
to the 


condition 
code (bits 14 and 10-8 of the status 


word). The principal use of the condition code is 
for 
conditional 
branching. 
This 
may 
be 


accomplished by executing an instruction that sets 
the condition 
code, storing the status word in 
memory and then examining the condition code 
with CPU instructions. 


Bits 13-11 of the status word point to the 8087 
register that is the current stack top (ST). Note 
that 
if ST=OOOB, a "push" 
operation, 
which 


decrements 
ST, produces 
ST=IIIB; 
similarly, 
popping the stack with ST= III B yields ST=OOOB. 


Bits 5-0 are set to indicate 
that the NEU has 


detected an exception while executing an instruc- 
tion. Section S.3 explains these exceptions. 


To 
satisfy 
a 
broad 
range 
of 
application 


requirements, the NDP provides several process- 
ing options which are selected by loading a word 
from memory into the control word. Figure S-7 
shows the format and encoding of the fields in the 
control word; it is provided here for reference. 
Section S.3 explains the use of each of these 8087 
facilities except the interrupt-enable 
control field, 


which is covered in section S.6. 


The tag word marks the content of each register 
as shown in figure S-8. The principal 
function 


III 
L EXCEPTION FLAGS (1 = EXCEPTION HAS OCCURRED) 


INVALID OPERATION 


DENORMALIZED OPERAND 


ZERODIVIDE 


OVERFLOW 


UNDERFLOW 


PRECISION 


~--------- 
(RESERVED) 


~------------INTERRUPTREQUEST 
L....----ll-----'-------------- 
CONDITION CODEI') 


L.... 
STACK TOP POINTERI21 


BUSY 


(1) 
See descriptions 
of compare, 
test, 
examine 
and remainder 
instructions 
in section 
S.7 for 
condition code interpretation. 
(2) ST values: 
000 = register 0 is stack top 
0~1 = register 1 is stack top 


III 
L EXCEPTIONMASKS(1 = EXCEPTIONISMASKED) 


INVALIDOPERATION 


DENORMALIZEDOPERAND 


ZERODIVIDE 


OVERFLOW 


UNDERFLOW 


PRECISION 


L..----------(RESERVED) 


L..------------INTERRUPT·ENABLE 
MASK(I) 


L..--------------PRECISION 
CONTROL(2) 


L..-----------------ROUNDING 
CONTROL(3) 


"---------------------INFINlTy 
CONTROL(4) 


(RESERVED) 


(1) Interrupt.EnableMask: 


o = Interrupts Enabled 
1 = Interrupts Disabled(Masked) 
(2) PrecisionControl: 
00 = 24bits 
01 = (reserved) 
10 = 53bits 
11 = 64bits 
(3) RoundingControl: 


00 = Roundto Nearestor Even 
01 = RoundDown(toward-00) 
10= RoundUp(toward+00) 
11= Chop(TruncateTowardZero) 
(4) Infinity Control: 


o = Projective 
1 = Affine 


of 
the 
tag 
word 
is to 
optimize 
the 
NDP's 


performance 
under 
certain 
circumstances 
and 


programmers 
ordinarily 
need not be concerned 


with it. 


Exception 
Pointers 


The exception pointers (see figure S-9) are pro- 
vided 
for 
user-written 
exception 
handlers. 
Whenever the 8087 executes an instruction, 
the 


CU saves the instruction address and the instruc- 
tion 
opcode 
in 
the 
exception 
pointers. 
In 


addition, 
if the instruction 
references a memory 


operand, 
the address of the operand is retained 


also. 
An 
exception 
handler 
can 
store 
these 


pointers in memory and thus obtain information 
concerning 
the' 
instruction 
that 
caused 
the 


exception. 


5.3 Computation 
Fundamentals 


This section covers 8087 programming 
concepts 


that are common to all applications. 
It describes 


the 8087's internal number system and the various 
types of numbers that can be employed in NDP 
programs. The most commonly used options for 
rounding, precision and infinity (selected by fields 
in 
the 
control 
word) 
are 
described, 
with 


exhaustive 
coverage 
of 
less 
frequently 
used 


facilities deferred to section S.9. Exception condi- 
tions which may arise during execution of NDP 
instructions 
are also described 
along with the 


options that are available for responding to these 
exceptions. 


Tag values: 
00 = Valid (Normal or Unnormal) 
01 = Zero (True) 
10 = Special (Not-A-Number, 
"'. or Denormal) 
11 = Empty 


r 
I 
OPERAND ADDRESS(1) 
I INSTRUCTION OPCODE(2) 
I 
INSTRUCTION ADDRESS(1) 


10 
0 


(1) 20-bit physical address 


(2) 11 least significant 
bits of opcode; 5 most significant 
bits are always 8087 hook (11011B) 


The system of real numbers that people use for 
pencil 
and 
paper 
calculations 
is conceptually 
infinite and continuous. 
There is no upper or 
lower limit to the magnitude of the numbers one 
can employ in a calculation, 
or to the precision 
(number of significant digits) that the numbers 
can 
represent. 
When 
considering 
any 
real 
number, there are always an infinity of numbers 
both larger and smaller. There is also an infinity 
of numbers between (i.e., with more significant 
digits than) any two real numbers. For example, 
between 2.5 and 2.6 are 2.51, 2.5897, 2.5ססOO1, 
etc. 


While ideally it would be desirable for a computer 
to be able to operate on the entire real number 
system, in practice 
this is not possible. 
Com- 
puters, 
no matter 
how large, 
ultimately 
have 
fixed-size registers and memories that limit the 
system of numbers that can be accommodated. 
These limitations proscribe both the range and the 
precision 
of numbers. 
The result 
is a set of 
numbers that is finite and discrete, rather than 
infinite and continuous. This sequence is a subset 
of the real numbers which is designed to form a 
useful approximation of the real number system. 


Figure SolO superimposes 
the basic 8087 real 
number system on a real number line (decimal 
numbers are shown for clarity, although the 8087 
actually represents numbers in binary). The dots 
indicate the subset of real numbers the 8087 can 
represent as data and final results of calculations. 
The 8087's range is approximately 
±4.19xlO-307 


to ±1.67x10308. 
Applications that are required to 
deal with data and final results outside this range 
are rare. By comparison, 
the range of the IBM 
370 is about ±0.54xlO-78 to ±0.72x1076• 


The finite spacing in figure SolO illustrates that 
the NDP can represent a great many, but not all, 
of the real numbers in its range. There is always a 
"gap" 
between two "adjacent" 
8087 numbers, 
and it is possible for the result of a calculation to 
fall in this space. When this occurs, the NDP 
rounds the true result to a number that it can 
represent. Thus, a real number that requires more 
digits than the 8087 can accommodate 
(e.g., a 20 
digit number) is represented 
with some loss of 
accuracy. Notice also that the 8087's represent- 
able numbers are not distributed evenly along the 
real number line. There are, in fact, an equal 
number of representable 
numbers between suc- 
cessive powers 
of 2 (i.e., 
there 
are as many 
representable 
numbers 
between 
2 
and 
4 
as 
between 
65,536 and 
131,072). 
Therefore, 
the 
"gaps" 
between 
representable 
numbers 
are 
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"larger" 
as the numbers increase in magnitude. 
All integers 
in the range 
±264• 
however. 
are 
exactly representable. 


In 
its internal 
operations, 
the 
8087 actually 
employs a number 
system that is a substantial 
super set of that shown in figure S-10. The internal 
format (called temporary real) extends the 8087's 
range to about ±3AxlO-4932 
to ± 1.2x104932 , and 
its precision 
to about 
19 (equivalent 
decimal) 
digits. This format is designed to provide extra 
range 
and 
precision 
for 
constants 
and 
intermediate results, and is not normally intended 
for data or final results. 


From a practical standpoint, 
the 8087's set of real 
numbers is sufficiently "large" 
and "dense" 
so 
as not to limit the vast majority of microprocessor 
applications. 
Compared 
to 
most 
computers, 
including mainframes, 
the NDP provides a very 
good approximation 
of the real number system. It 
is important to remember, however, that it is not 
an exact representation, 
and that arithmetic on 
real numbers is inherently approximate. 


Conversely, and equally important, 
the 8087 does 
perform exact arithmetic on its integer subset of 
the reals. That is, an operation 
on two integers 


returns an exact integral result, provided that the 
true result is an integer and is in range. For exam- 
ple, 4+2 yields an exact integer, 1+3 does not, and 
240 x 230 + 1 does not, because the result requires 
greater than 64 bits of precision. 


The 8087 recognizes seven numeric data types, 
divided into three classes: binary integers, packed 
decimal integers, and binary reals. Section SA 
describes how these formats are stored in memory 
(the sign is always located in the highest- address- 
ed byte). Figure S-l1 summarizes the format of 
each data type. In the figure, the most significant 
digits of all numbers (and fields within numbers) 
are the leftmost 
digits. Table S-2 provides the 
range and number of significant (decimal) digits 
that each format can accommodate. 
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NOTES: 
S = Sign bit (0= positive, 1= negative) 
dn = Decimal digit (two per byte) 
X 
= Bits have no significance; 
8087 ignores when loading, zeros when storing. 


, 
= Position of implicit binary point 
I 
= Integer bit of significand; 
stored in temporary real, implicit in short and long real 
Exponent Bias (normalized values): 


Short Real: 127 (7FH) 
Long Real: 1023 (3FFH) 
Temporary Real: 16383 (3FFFH) 


are 0). The 8087 word integer format is identical 
to the 16-bit signed integer data type of the 8086 
and 8088. 
' 
The three binary 
integer formats 
are identical 


except for length, which governs' the range that 
can be accommodated 
in each format. The left- 
most 
bit 
is 
interpreted 
as 
the 
number's 


sign: O=positive 
and 
I=negative. 
Negative 


numbers are represented in standard two's com- 
plement notation (the binary integers are the only 
8087 format to use two's complement). The quan- 
tity zero is represented with a positive sign (all bits 


Decimal integers are stored 
in packed decimal 
notation, with two decimal digits "packed" 
into 
each byte, except the leftmost byte, which carries 
the sign bit (0 = positive, I = negative). Negative 


numbers are not stored in two's complement form 
and are distinguished from positive numbers only 
by the sign bit. The most significant digit of the 
number is the leftmost digit. All digits must be in 
the range OH-9H. 


The 8087 stores real numbers 
in a three-field 
binary 
format 
that 
resembles 
scientific, 
or 
exponential, 
notation. 
The number's 
significant 
digits 
are 
held 
in 
the significand 
field, 
the 


exponent field locates the binary point within the 
significant digits (and therefore 
determines 
the 


number's magnitude), and the sign field indicates 
whether the number is positive or negative. (The 
exponent and significand 
are analogous 
to the 


terms "characteristic" 
and "mantissa" 
used to 
describe floating point numbers on some com- 
puters.) 
Negative numbers 
differ from positive 
numbers only in their sign bits. 


Table S-4 shows how the real number 
178.125 


(decimal) is stored in the 8087 short real format. 
The table lists a progression of equivalent nota- 
tions that express the same value to show how a 
number 
can be converted 
from 
one form 
to 


another. 
The ASM-86 and PL/M-86 
language 


translators 
perform 
a similar 'process when they 


encounter programmer-defined 
real number con- 


stants. Note that not every decimal fraction has 
an exact binary equivalent. The decimal number 
1110, for example, cannot be expressed exactly 
in binary 
(just as the number 
1/3 
cannot 
be 


expressed exactly in decimal). When a translator 
encounters such a value, it produces a rounded 
binary approximation 
of the decimal value. 


The NDP usually carries the digits of the signifi- 
cand in normalized form. This means that, except 
for the value zero, the significand is an integer 
and a fraction as follows: 


where fJ. indicates an assumed binary point. The 
number of fraction bits varies according to the 
real format: 
23 for short, 52 for long and 63 for 


temporary real. By normalizing real numbers so 
that their integer bit is always a 1, the 8087 
eliminates leading zeros in small values (Ixl < 1). 
This technique maximizes the number of signifi- 
cant 
digits 
that 
can 
be 
accommodated 
in a 
significand of a given width. 
Note that in the 
short and long real formats 
the integer bit is 


implicit and is not actually stored; the integer bit 
is physically present in the temporary real format 
only. 


If one were to examine only the significand with 
its assumed 
binary 
point, 
all normalized 
real 


numbers would have values between 1 and 2. The 
exponent field locates the actual binary point in 
the significant digits. Just as in decimal scientific 
notation, 
a positive exponent 
has the effect of 


moving 
the binary 
point 
to the 
right 
and 
a 


negative exponent 
effectively moves the binary 


point 
to 
the 
left, 
inserting 
leading 
zeros 
as 


necessary. 
An 
unbiased 
exponent 
of 
zero 


Ordinary 
Decimal 


Scientific 
Decimal 


Scientific 
Binary 


Scientific 
Binary 
(Biased 
Exponent) 


178.125 


1tJ8125E2 


1/\0110010001 E111 


1/\ 0110010001 E10000110 


8087 Short 
Real 
(Normalized) 


Biased 
Exponent 


01100100010000000000000 
L 1/\ (implicit) 
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point. The exponent field, then, determines a real 
number's magnitude. 


In order 
to simplify comparing 
real numbers 
(e.g., for sorting), the 8087 stores exponents in a 
biased form. This means that a constant is added 
to the true exponent 
described above. The value 
of this bias is different for each real format (see 
figure S-ll). It has been chosen so as to force the 
biased exponent 
to be a positive 
value. 
This 
allows two real numbers (of the same format and 
sign) to be compared 
as if they are unsigned 
binary integers. That is, when comparing 
them 
bitwis~ from left to right (beginning with the left- 
most exponent 
bit), the first bit position 
that 
differs orders the numbers; 
there is no need to 
proceed further with the comparison. A number's 
true exponent can be determined simply by sub- 
tracting the bias value of its format. 


The short and long real formats exist in memory 
only. If a number 
in one of these formats 
is 


loaded 
into 
a 
register, 
it 
is 
automatically 
converted to temporary real, the format used for 
all internal operations. Likewise, data in registers 
can be converted to short or long real for storage 
in memory. The temporary 
real format may be 


used 
in 
memory 
also, 
typically 
to 
store 


intermediate 
results 
that 
cannot 
be 
held 
in 


registers. 


Most applications should use the long real form 
to store real number data and results; it provides 
sufficient range and precision to return correct 
results with a minimum of programmer attention. 
The short real format is appropriate 
for applica- 


tions that 
are constrained 
by memory, 
but it 


should be recognized that this format provides a 
smaller margin of safety. 
It is also useful for 


debugging algorithms because roundoff problems 
will manifest themselves more quickly in this for- 
mat. The temporary real format should normally 
be reserved for holding intermediate results, loop 
accumulations, 
and constants. 
Its extra length is 


designed to shield final results from the effects 
of rounding 
and overflow/underflow 
in inter- 


mediate calculations. 
When the temporary 
real 


format 
is used to hold data or to deliver final 


results, the safety features built into the 8087 are 
compromised. 
Furthermore, 
the range and preci- 


sion of the long real form are adequate for most 
microcomputer applications. 


Besides being 
able 
to 
represent 
pOSItive and 


negative numbers, the 8087 data formats may be 
used to describe 
other 
entities. 
These 
special 


values provide extra flexibility but most users do 
not need to understand 
them in detail to use the 


8087 successfully. Accordingly, they are discussed 
here only briefly; expanded coverage, including 
the bit encoding of each value, is provided 
in 


section S.9. 


The value zero may be signed positive or negative 
in the real and decimal integer formats; the sign 
of a binary integer zero is always positive. The 
fact 
that 
zero 
may 
be 
signed, 
however, 
is 


transparent to the programmer. 


The real number formats allow for the representa- 
tion of the special values +00 and 
-00. 
The 8087 


may generate these values as its built-in response 
to exceptions such as division by zero, or the 
attempt to store a result that exceeds the upper 
range limit of the destination 
format. 
Infinities 


may participate 
in arithmetic 
and comparison 


operations, and in fact the processor provides two 
different 
conceptual 
models for handling these 


special values. 


If a programmer attempts an operation for which 
the 8087 cannot deliver a reasonable result, it will, 
at the programmer's 
discretion, either request an 


interrupt, 
or return- the special value indefinite. 


Taking the square root of a negative number is an 
example of this type of invalid operation. 
The 


recommended 
action in this situation is to stop 


the computation 
by trapping 
to a user-written 


exception handler. If, however, the programmer 
elects to continue the computation, 
the specially 


coded indefinite value will propagate through the 
calculation and thus flag the erroneous computa- 
tion when it is eventually delivered as the result. 
Each format has an encoding that represents the 
special value indefinite. 


In the real formats, 
a whole range of special 


values, both positive and negative, is designated 
to represent a class of values called NAN (Not-A- 
Number). 
The 
special 
value 
indefinite 
is a 


reserved NAN encoding, but all other encodings 
are made available to be defined in any way by 
application software. Using a NAN as an operand 
raises the invalid operation 
exception, 
and can 


trap to a user-written routine to process the NAN. 
Alternatively, 
the 
8087's 
built-in 
exception 


RC Field 
Rounding 
Mode 
Rounding 
Action 


00 
Round to nearest 
Closertob 
ofa ore; 
if 
equally close, select even 
number (the one whose 
least significant bit is zero). 


01 
Round down (toward -00) 
a 


10 
Round up (toward + 00) 
e 


11 
Chop (toward 0) 
Smaller in magnitude of 
a ore 


handler will simply return the NAN itself as the 
result 
of 
the 
operation; 
in this 
way NANs, 


including indefinite, 
may be propagated 
through 


a calculation 
and delivered as a final, special- 


valued, result. One use for NANs is to detect 
uninitialized variables. 


As mentioned 
earlier, the 8087 stores non-zero 


real numbers 
in "normalized 
floating 
point" 
form. It also provides for storing and operating 
on reals that 
are not normalized, 
i.e., 
whose 
significands contain one or more leading zeros. 
Nonnormals arise when the result of a calculation 
yields a value that is too small to be represented in 
normal form. The leading zeros of non normals 
permit smaller numbers to be represented, at the 
cost of some lost precision (the number of signifi- 
cant digits is reduced by the leading zeros). In 
typical algorithms, 
extremely 
small values are 
most likely to be generated as intermediate, rather 
than final results. By using the NDP's temporary 
real format for holding intermediates, 
values as 
small as ±3.4xlO-4932 
can be represented; 
this 
makes the occurrence of nonnormal 
numbers a 
rare phenomenon 
in 8087 applications. 
Never- 
theless, the NDP can load, store and operate on 
nonnormalized real numbers. 


Internally, 
the 
8087 employs 
three 
extra 
bits 
(guard, round and sticky bits) which enable it to 
represent the infinitely precise true result of a 
computation; 
these 
bits are not 
accessible to 
programmers. 
Whenever 
the 
destination 
can 
represent 
the infinitely 
precise true result, 
the 
8087 delivers it. Rounding 
occurs in arithmetic 
and store operations 
when the format 
of the 


destination cannot exactly represent the infinitely 
precise true result. For example, a real number 
may be rounded if it is stored in a shorter real for- 
mat, or in an integer format. 
Or, the infinitely 


precise true result may be rounded 
when it is 


returned to a register. 


The NDP has four rounding modes, selectable by 
the RC field in the control word (see figure S-7). 
Given a true result b that cannot be represented 
by the target data type, the 8087 determines the 
two representable 
numbers 
a and e that most 


closely bracket b in value (a < b < c). The pro- 
cessor then rounds 
(changes) b to a or to c 


according to the mode selected by the RC field as 
shown in table S-5. Rounding introduces an error 
in a result that is less than one unit in the last 
place to which the result is rounded. 
"Round 
to 


nearest" 
is the default mode and is suitable for 


most applications; 
it provides the most accurate 


and statistically 
unbiased 
estimate 
of the true 


result. The "chop" 
mode is provided for integer 


arithmetic applications. 


"Round 
up" 
and 
"round 
down" 
are termed 


directed rounding and can be used to implement 
interval arithmetic. 
Interval arithmetic generates 


a certifiable result independent of the occurrence 
of rounding 
and other errors. 
The upper and 


lower bounds of an interval may be computed by 
executing an algorithm twice, rounding up in one 
pass and down in the other. 


The 8087 allows results to be calculated with 64, 
53, or 24 bits of precision as selected by the PC 
field of the control word. The default setting, and 


the one that is best-suited for most applications, is 
the full 64 bits. The other settings are required by 
the proposed IEEE standard, and are provided to 
obtain 
compatibility 
with the specifications 
of 


certain existing programming languages. Specify- 
ing less precision nullifies the advantages of the 
temporary real format's extended fraction length, 
and does not improve 
execution 
speed. When 


reduced precision is specified, the rounding of the 
fraction zeros the unused bits on the right. 


The 8087's system of real numbers may be closed 
by either of two models of infinity. These two 
means of closing the number system, projective 
and affine closure, are illustrated schematically in 
figure S-12. The setting of the IC field in the con- 
trol word selects one model or the other. 
The 


default means of closure is projective, and this is 
recommended for most computations. 
When pro- 


jective closure is selected, the NDP 
treats the 


special values +00 and -00 as a single unsigned 
infinity (similar to its treatment of signed zeros). 
In the affine mode the NDP respects the signs of 
+00and -00. 


While affine mode may provide more informa- 
tion than projective, there are occasions when the 
sign may in fact represent misinformation. 
For 


example, 
consider an algorithm 
that yields an 


intermediate 
result x of +0 and -0 
(the same 


numeric 
value) in different 
executions. 
If l/x 
were then computed in affine mode, two entirely 
different values (+00 and -(0) would result from 
numerically 
identical 
values 
of 
x. 
Projective 


mode, on the other hand, provides less informa- 
tion but never returns misinformation. 
In general, 


then, projective mode should be used globally, 


I 
a 
AFFINE CLOSURE 


with affine mode reserved for local computations 
where the programmer can take advantage of the 
sign and knows for certain that the nature of the 
computation will not produce a misleading result. 


During 
the 
execution 
of 
most 
instructions, 


the 8087 checks 
for 
six classes of exception 


conditions. 


The 8087 reports invalid operation 
if any of the 


following occurs: 


• 
An attempt 
to load a register that is not 


empty, (e.g., stack overflow), 


• 
An attempt 
to pop 
an operand 
from 
an 


empty register (e.g., stack underflow), 


• 
An operand is a NAN, 
• 
The 
operands 
cause 
the operation 
to be 


indeterminate (0/0, square root of a negative 
number, etc.). 


An invalid operation 
generally indicates a pro- 


gram error. 


If the exponent of the true result is too large for 
the destination 
real 
format, 
the 
8087 signals 


overflow. Conversely, a true exponent that is too 
small to be represented results in the underflow 
exception. If either of these occur, the result of 
the operation is outside the range of the destina- 
tion real format. 


Typical algorithms 
are most likely to produce 


extremely large and small numbers in the calcula- 
tion of intermediate, 
rather than final, results. 


Because of the great range of the temporary real 
format (recommended as the destination 
format 


for intermediates), 
overflow and underflow 
are 


relatively rare events in most 8087 applications. 


If division of a finite non-zero operand by zero is 
attempted, 
the 
8087 
reports 
the 
zerodivide 


exception. 


If an instruction attempts to operate on a denor- 
mal, 
the 
NDP 
reports 
the 
denormalized 


exception. This exception is provided 
for users 


who wish to implement, in software, an option of 
the proposed IEEE standard which specifies that 
operands must be prenormalized 
before they are 


used. 
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rounds 
the number 
and reports 
the precision 
S.9 contains a complete descnptlon 
01 
all 
I:J<I,;CIJ- 
exception. This exception occurs frequently and 
tion conditions and the NDP's masked responses. 


indicates 
that 
some 
(generally 
acceptable) 
accuracy has been lost; it is provided for applica- 
tions that need to perform exact arithmetic only. 


Invalid operation, 
zerodivide, and denormalized 
exceptions 
are 
detected 
before 
an 
operation 
begins, while overflow, underflow, and precision 
exceptions are not raised until a true result has 
been computed. 
When a "before" 
exception is 
detected, the register stack and memory have not 
yet been updated, and appear as if the offending 
instruction 
has 
not 
been 
executed. 
When 
an 


"after" 
exception is detected, the register stack 
and memory appear as if the instruction has run 
to 
completion, 
i.e., 
they 
may 
be 
updated. 


(However, in a store or store and pop operation, 
unmasked 
over/underfiow 
is handled 
like 
a 


"before" 
exception; memory is not updated and 


the stack is not popped.) In cases where multiple 
exceptions arise simultaneously, 
one exception is 


signalled according to the following precedence 
sequence: 


• 
Denormalized (if unmasked), 


• 
Invalid operation, 


• 
Zerodivide, 
• 
Denormalized (if masked), 


• 
Over/underflow, 


• 
Precision. 


(The 
terms 
"masked" 
and 
"unmasked" 
are 
explained shortly.) This means, for example, that 
zero divided by zero will result in an invalid 
operation and not a zerodivide exception. 


The 8087 reports an exception by setting the cor- 
responding flag in the status word to 1. It then 
checks the corresponding 
exception mask in the 
control word to determine if it should "field" 
the 
exception (mask=!), 
or if it should issue an inter- 
rupt request to invoke a user-written 
exception 
handler (mask=O). In the first case, the exception 
is said to be masked (from user software) and the 
NDP executes its on-chip masked response 
for 
that exception. In the second case, the exception 
is unmasked, 
and 
the processor 
performs 
its 
unmasked response. The masked response always 
produces a standard result and then proceeds with 
the instruction. 
The unmasked 
response always 
traps to user software by interrupting 
the CPU 


Note that when exceptions are masked, the NDP 
may detect multiple exceptions in a single instruc- 
tion, since it continues executing the instruction 
after 
performing 
its 
masked 
response. 
For 


example, the 8087 could detect a denormalized 
operand, 
perform 
its masked 
response 
to this 


exception, and then detect an underflow. 


By writing different 
values into the exception 


masks of the control word, the user can accept 
responsibility for handling exceptions, or delegate 
this to the NDP. Exception handling software is 
often difficult to write, and the 8087's masked' 
responses have been tailored to deliver the most 
"reasonable" 
result 
for 
each 
condition. 
The 


majority of applications will find that masking all 
exceptions other than invalid operation will yield 
satisfactory 
results with the least programming 


investment. 
An 
invalid 
operation 
exception 


normally indicates a fatal error in a program that 
must 
be corrected; 
this exception 
should 
not 


normally be masked. 


The exception 
flags are "sticky" 
and can be 
cleared 
only 
by executing 
the 
FCLEX 
(clear 


exceptions) instruction, 
by reinitializing the pro- 


cessor, 
or 
by 
overwriting 
the 
flags 
with 
an 


FRSTOR or FLDENV 
instruction. 
This means 


that the flags can provide a cumulative record of 
the exceptions encountered in a long calculation. 
A program can therefore mask all exceptions (ex- 
cept, 
typically, 
invalid 
operation), 
run 
the 


calculation and then inspect the status word to see 
if any exceptions were detected at any point in the 
calculation. Note that the 8087 has another set of 
internal exception flags that it clears before each 
instruction. 
It is these flags and not those in the 


status word that actually trigger the 8087's excep- 
tion response. The flags in the status word pro- 
vide a cumulative 
record of exceptions for the 


programmer only. 


If the NDP executes an unmasked response to an 
exception, 
it is assumed 
that a user exception 


handler will be invoked via an interrupt from the 
8087. The 8087 sets the IR (interrupt request) bit 
in the status word, but this, in itself, does not 
guarantee 
an 
immediate 
CPU 
interrupt. 
The 


interrupt 
request may be blocked 
by the IEM 


(interrupt-enable 
mask) in the 8087 control word, 


If one operand is NAN, return it; if 
both are NANs, return 
NAN with 
larger absolute value; if neither is 
NAN, return indefinite. 


Return 
00 signed with "exclusive 
or" of operand signs. 


Memory operand: 
proceed as 
usual. Register 
operand: 
convert 
to valid unnormal, then re-evaluate 
for exceptions. 


Return properly signed 
00. 


Invalid 
Operation 


Register 
destination: 
adjust 


exponent', 
store 
result, 
request 


interrupt. 
Memory 
destination: 


request interrupt. 


Register 
destination: 
adjust 
exponent', 
store 
result, 
request 


interrupt. 
Memory 
destination: 


request interrupt. 


Return 
rounded 
result, 
request 
interrupt. 


'On overflow, 24,576decimal is subtracted from the true result's exponent; this forces the exponent back 


into range and permits a user exception handler to ascertain the true result from the adjusted result that 
is returned. On underflow, the same constant is added to the true result's exponent. 


by the 
8259A Programmable 
Interrupt 
Con- 
troller, or by the CPU itself. If any exception flag 
is unmasked, 
it is imperative 
that the interrupt 
path to the CPU is eventually cleared so that the 
user's software can field the exception and the 
offending 
task can resume execution. 
Interrupts 
are covered in detail in section S.6. 


A user-written exception handler takes the form 
of an 8086/8088 
interrupt 
procedure. 
Although 
exception 
handlers 
will vary widely from 
one 
application 
to the next, most will include these 
basic steps: 


• 
Store the 8087 environment 
(control, status 
and 
tag ·words, 
operand 
and 
instruction 
pointers) 
as it existed at the time of the 
exception; 


• 
Clear the exception bits in the status word; 


• 
Enable interrupts on the CPU; 


• 
Identify 
the 
exception 
by exammmg 
the 
status 
and 
control 
words 
in 
the 
saved 
environment; 


• 
Take application-dependent 
action; 


• 
Return to the point of interruption, 
resuming 


normal execution. 


Possible 
"application-dependent 
actions" 


include: 


• 
Incrementing an exception counter for later 
display or printing; 


• 
Printing 
or 
displaying 
diagnostic 


information 
(e.g., the 8087 environment and 


registers); 


• 
Aborting further execution of the calculation 
causing the exception; 


• 
Aborting all further execution; 


• 
Using the exception 
pointers 
to build an 


instruction 
that will run without exception 


and executing it. 


• 
Storing a diagnostic value (a NAN) in the 
result and continuing with the computation. 


Notice that an exception mayor 
may not con- 


stitute an error depending on the application. For 
example, an invalid operation caused by a stack 
overflow 
could 
signal an ambitious 
exception 


handler to extend the register stack to memory 
and continue running. 


The 8087 can access any location 
in its host 


CPU's megabyte memory space. Because it relies 


on the CPU to generate the addresses of memory 
operands, 
the NDP can take advantage 
of the 


CPU's memory addressing modes and its ability 
to relocate code and data during execution. 


Data Storage 


Figures S-13 and S-14 show how the 8087 data 
types are stored in memory. The sign bit is always 
located in the highest-addressed 
byte. The least 
significant binary or decimal digits in a number 
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(or in a field in the case of reals) are those with the 
lowest addresses. 
The word 
integer 
format 
is 
stored exactly like an 
8086/8088 
l6-bit 
signed 
integer, 
and 
is directly 
usable by instructions 
executed on either the CPU or the NDP. 


A few special instructions access memory to load 
or store formatted 
processor 
control 
and state 
data. The formats of these memory operands are 
provided with the discussions of the instructions 
in section S.7. 


The host CPU always generates the address of the 
first 
(lowest-addressed) 
byte 
of 
a 
memory 
operand. The CPU interprets an 8087 instruction 
that references memory as an ESC (escape), and 
generates 
the operand's 
effective and physical 
addresses normally as discussed in section 2.3. 
Any 
8086/8088 
memory 
addressing 
mode- 
direct, register indirect, based, indexed or based 
indexed-can 
be used to access an 8087 operand 
in memory. This makes the NDP easy to use with 
data structures 
su"ch as arrays, 
structures, 
and 
lists. 


When the CPU emits the 20-bit physical address 
of the memory operand, 
the 8087 captures the 
address 
and 
saves it. If the instruction 
loads 
information 
into the NDP, the 8087 captures the 
lowest-addressed word when it becomes available 
on the bus as a result of the CPU's 
"dummy 
read." 
(The "dummy 
read" 
may require either 
one or two bus cycles depending on the CPU type 
and the alignment of the operand.) If the operand 
is longer than one word (all 8087 operands are an 
integral number of words), the 8087 immediately 
requests use of the local bus by activating its CPU 
requestl grant 
(RQ/GTO) 
line, as described 
in 
section S.6. When the NDP obtains the bus; it 
runs 
consecutive 
bus 
cycles incrementing 
the 
saved address until the rest of the operand has 
been obtained, returns the local bus to the CPU, 
and then executes the instruction. 


If an operation 
stores data 
from ~he NDP to 
memory, the NDP and the CPU both ignore the 
data placed on the bus by the CPU's 
"dummy 
read." 
The NDP does not request the bus from 
the CPU until it is ready to write the result of the 
instruction to memory. When it obtains the bus, 
the NDP writes the operand 
in successive bus 
cycles, incrementing 
the saved address as in a 
load. 


As described in section S.6, the 8087 automati- 
cally determines 
the identity of its host CPU. 
When the NDP is wired to an 8088, it transfers 
one byte per bus cycle in the same manner as the 
CPU. When used with an 8086, the NDP again 
operates like the CPU, accessing odd-addressed 
words in two bus cycles and even-addressed words 
in one bus cycle. If the 8087 is reading or writing 
more than one word of an odd-addressed operand 
in 8086 memory, 
it optimizes 
the transfer 
by 
accessing a byte on the first transfer, 
forcing the 
address to even, and then transferring 
words up 
to the last byte of the operand. 


To minimize operand transfer time and 8087 use 
Qfthe system bus, it is advantageous to align 8087 
memory operands 
on even addresses 
when the 
CPU is an 8086. Following the same practice for 
8088-based systems will ensure top performance 
without 
reprogramming 
if 
the 
application 
is 
transferred to an 8086. The ASM-86 EVEN direc- 
tive can be used to force word alignment. 


Since the host CPU takes care of both instruction 
fetching and memory 
operand 
addressing, 
the 
NDP may be utilized in systems that alter pro- 
gram 
addresses 
during 
execution. 
The 
only 
restriction on the CPU is that it should not change 
the address of an 8087 operand while the 8087 is 
executing an instruction 
which stores a result to 
that address. If this is done, the 8087 will store to 
the operand's 
old address (the one it picked up 
during the "dummy read"). 


Dedicated 
and Reserved 
Memory Locations 


The 
8087 does 
not 
require 
any 
addresses 
in 
memory to be set aside for special purposes. Care 
should 
be 
taken, 
however, 
to 
respect 
the 
dedicated and reserved areas associated with the 
CPU and the lOP 
(see sections 2.3 and 3.3). 


Using any of these areas may inhibit compatibility 
with current or future Intel hardware 
and soft- 
ware products. 


As a coprocessor to an 8086 or 8088 CPU, the 
NDP 
is 
by 
definition 
always 
used 
in 
a 
multiprocessing 
environment. 
This 
section 


describes the facilities built into the 8087 that 
simplify the coordinaton 
of multiple process'or 
systems. Included are descriptions of instruction 
synchronization, 
local and system bus arbitra- 
tion, and shared resource access control. 


In the execution of a typical NDP instruction, the 
CPU will complete the ESC long before the 8087 
finishes its interpretation 
of the same machine 
instruction. 
For example, the NDP 
performs 
a 
square root in about 180 clocks, while the CPU 
will execute its interpretation 
of this same instruc- 
tion in 2 clocks. Upon completion of the ESC, the 
CPU will decode and execute the next instruction, 
and the NDP's CU, tracking the CPU, will do the 
same. (The NDP "executes" 
a CPU instruction 
by ignoring it). If the CPU has work to do that 
does not affect the NDP, it can proceed with a 
series of instructions while the NDP is executing 
in parallel; the NDP's CU will ignore these CPU- 
only instructions as they do not contain the 8087 
escape code. This asynchronous 
execution of the 
processors 
can 
substantially 
improve 
the 
performance 
of systems that ca,n be designed to 
exploit it. 
, 


There are two cases, however, when it is necessary 
to synchronize the execution of the CPU to the 
NDP: 


1. 
An NDP instruction that is executed by the 
NEU must not be started if the NEUis 
still 
busy executing a previous instruction. 
" 
. 
, 
, 


2. 
The CPU should not execute an instruction 
that 
accesses 
a 
memory 
operand 
being 
referenced by the NDP llntil the NDP has 
actually accessed the location. 


The 8086/8088 
WAIT instruction allows software 
to synchronize the,CPU 
to the NDP so that the 
CPU will not execute the following instruction 
until the NDP is finished with its current (if any) 
instruction. 


, 
Iii 


Whenever the 8087 is executing an instructi~~, it 
actiyates its BUSY line. This signal is wired to the 
CPU's TEST input as shown in figure S-3. The 
NDP ignores the WAIT -Instruction, and the CPU 
executes 
it. 
The 
CpU 
interprets 
the 
WAIT 
instruction as "}Vait while TEST is active." 
The 
CPU examin~s the TEST pin every 5 clocks; if 
TEST is ,inactive, execution 
proceeds 
with the 


instruction 
following 
the 
WAIT. 
If TEST 
is 
active, the CPU examines the pin again. Thus, the 
effective execution time of aWAIT 
can stretch 
from 3 clocks (3 clocks are required for decoding 
and setup) to infinity, as long as TEST remains 
active; The WAIT instruction, 
then, prevents the 
CPU from decoding the next instruction until the 
8087 is not busy. The instruction 
following a 
WAIT 
is 
decoded 
simultaneously 
by 
both 
processors. 


To satisfy the first case mentioned above, every 
8087 instruction that affects the NEU should be 
preceded by aWAIT 
to ensure that the NEU is 
ready. 
All 
instructions 
except 
the 
processor 
control class affect tlie NEU. To simplify pro- 
gramming, the 8086 family language translators 
provide 
the 
WAIT 
automatically. 
When 
an 
assembly language programmer codes: 


FMUL 


FDIV 


;(multiply) 


;(divide) 


the 
assembler 
produces 
four 
machine 
instructions, as if the programmer had written: 


WAIT 
FMUL 
WAIT 
FDIV 


This ensures that the multiply runs to completion 
before the CPU and th'e 8087 CU decode the 
divide. 


To 
satisfy 
the 
second 
case, 
the 
programmer 
should l explicitly code the FWAIT 
instruction 
immediately 
before 
a 
CPU 
instruction 
that 
accesses a memory operand read or written by a 
previous 8087 instruction. 
This will ensure tbat 
the 8087 has read or written the memory operand 
before the CPU attempts to use it. (The FWAIT 
mnemonic causes the assembler to create a CPU 
WAIT instruction that can be eliminated at link 
time if the program is to run on an 8087 emulator. 
,See section S.8 for details.) 


Figure 
S-15 
is 
a 
hypothetical 
sequence 
of 
instructions 
that 
illustrates 
the 
effect 
of 
the 
WAIT instruction 
and parallel execution of the 
NDP with a CPU. 


The first two instructions in the sequence (FMUL 
and FSQRT) are 8087 instructions that illustrate 
the ASM-86 assembler's automatic generation of 


;ASSUME 
8087 
REGISTER 
STACK 
IS 
LOADED 
WITH 
OPERANDS, 
, 
NEU 
IS 
NOT 
BUSY, 
t 


, 
AND 
THAT 
'ALPHA' 
AND 
'BETA' 
ARE 
WORD 
, 
INTEGERS. 


FSQRT 
CMP 
JG 
MOV 
CONT INUE: 
F ISoT 
FWAIT 


MOV 


ALPHA,100 
CONTINUE 
ALPHA,100 
BETA 


;MULTIPLY 
TOP 
STACK 
;ELEMENTS 
;SQUARE 
ROOT 
OF 
PRODUCT 
;ALPHA 
> 
100' 
;YES, 
LEAVE 
UNALTERED 
;NO, 
SET 
TO 
100 
_ 


;STORE 
ROOT 
AS 
INTEGER 
WORD 
;WAIT 
FOR 
8087 
TO 
COMPLETE 
;STORE 
OF 
BETA 
;PROCEED 
TO 
PROCESS 
BETA 


_F_M_UL_II 
F_SQ_R_T 
I I_F_IS_T_I 
V 
V 
'--- 


I Ese II WAIT II 
MOV 1 


NOTES: 
.[W~~= Assemblerllenerated 
Instruction . 


• Instruction 
execution 
times are not drawn to scale. 


a preceding WAIT, and the effect of the WAIT 
when the NDP is, and is not, busy. Since the NDP 
is not busy when the first WAIT is encountered, 
the CPU executes it and immediately proceeds to 
the next instruction; the NDP ignores the WAIT. 
The next instruction is decoded simultaneously by 
both processors. The NDP starts the multiplica- 
tion and raises its BUSY line. The CPU executes 
the ESC and then the second WAIT. Since TEST 
is active (it is tied to BUSY), the CPU effectively 
stretches execution of this WAIT until the NDP 
signals completion 
of the multiply by lowering 
BUSY. The next instruction 
is interpreted 
as a 
square root by the NDP and another escape by 
the CPU. The CPU finishes the ESC well before 
the NDP completes the FSQRT. This time, in- 
stead of waiting, the CPU executes three instruc- 
tions (compare, jump if greater, and move) while 
the 8087 is working on the FSQRT. 
The 8087 
ignores these CPU-only 
instructions. 
The CPU 
then encounters the third WAIT, generated by the 
assembler immediately preceding the FIST (store 
stack top into integer word). 
When the NDP 
finishes the FSQRT, both processors proceed to 
the next instruction, FIST to the NDP and ESC to 
the CPU. The CPU completes the escape quickly 
and then executes an explicit programmer-coded 
FWAIT to ensure 
that 
the 8087 has updated 
BETA before 
it moves BETA's 
new value to 
register AX. 


The 8087 CU can execute most processor control 
instructions by itself regardless of what the NEU 
is doing: thus the 8087 can, in these cases, poten- 
tially 
execute 
two 
instructions 
at 
once. 
The 
ASM-86 assembler provides separate "wait" 
and 
"no wait" mnemonics for these instructions. For 
example, the instruction that sets the 8087 inter- 
rupt enable mask, and thus disables interrupts, 
can be coded as FDISI or FNDISI. The assembler 
does not generate a preceding WAIT if the second 
form is coded, so that interrupts can be disabled 
while the NEU 
is busy executing 
a previous 
instruction. 
The no-wait 
forms 
are principally 
used in exception handlers and operating systems. 


Whenever 
an NDP 
instruction 
writes data 
to 
memory, 
or reads more than 
one word from 
memory, the NDP forces the CPU to relinquish 
the local bus. 
It does this by means 
of the 
request/grant 
facility built into all 8086 family 
processors. For memory reads, the NDP requests 
the bus immediately upon the CPU's completion 
of its "dummy 
read" 
cycle; it follows from this 
that 
the 
CPU 
may 
"immediately" 
update 
a 
variable read by the NDP in the previous instruc- 
tion with the assurance that the NDP will have 
obtained the old value before the CPU has altered 
it. For memory 
writes, the NDP 
performs 
as 


much processing as possible before requesting the 
bus. In all cases, the 8087 transfers the data in 
back-to-back 
bus cycles and then immediately 
releases the bus. 


The 8087's RQ/GTO line is wired to one of the 
CP!J.:L..request/ grant 
lines. 
Connecting 
it 
to 
RQ/GTI 
on th!SPU 
(see figure S-3) leaves the 
higher priority RQ/GTO open for possible attach- 
ment of a local 8089 to the CPU. Note that an 
8089 on RQ/GTO will obtain the bus if it requests 
it 
simultaneously 
with 
an 
8087 attached 
to 
RQ/GTl; 
it cannot, however, preempt the 8087 if 
the 8087 has the bus. The NDP requests the local 
bus by pulsing its RQ/GTO line. If the CPU has 
the bus, it will grant it to the NDP by pulsing the 
same request/grant 
line. The CPU grants the bus 
immediately unless it is running a bus cycle, in 
which case the grant is delayed until the bus cycle 
is completed. The NDP releases the bus back to 
the CPU by sending a final pulse on RQ/GTO 
when it has completed the transfer. 


The 8087 provides a second request/grant 
line, 
RQ/GTl, 
that may be used to service local bus 
requests from an 8089 Input/Output 
Processor 
(see figure S-3). By using this line, a CPU, two 
lOPs (one is attached directly to the CPU) and an 
NDP can all reside on the same local bus, sharing 
a single set of system bus interface components. 


When the 8087 detects a bus request pulse on 
RQ/GTl, 
its response depends on whether it is 
idle, executing, or running a bus cycle. If it is idle 
or executing, 
the 8087 passes the bus request 
through 
to the CPU 
via RQ/GTO. 
The sub- 
sequent grant and release pulses are also passed 
between the CPU and the requesting device. If the 
8087 is running a bus cycle (or a series of bus 
cycles), it has already obtained the bus from the 
CPU so it grants the bus directly at the end of the 
current bus cycle rather than passing the request 
on to the CPU. When the 8089 releases the bus, 
the 8087 resumes the series of bus cycles it was 
running before it granted 
the bus to the 8089. 
Thus, to an 8089 attached to the 8087's RQ/GTI 
line, the NDP appears 
to be a CPU. An lOP 
attached 
to an NDP also effectively has higher 
local bus priority than the NDP, since it can force 
the NDP to relinquish the bus even in the midst of 
a multi-cycle transfer. 
This satisfies the typical 
system requirement 
for I/O transfers to be ser- 
viced as soon as possible. 


A single 8288 Bus Controller 
(plus latches and 
tranceivers as required) links both the host CPU 
and the NDP to the system bus. The 8087 per- 
forms system bus transfers exactly the same as its 
CPU; status, address, and data signals and timing 
are identical. 


In systems that allow multiple processing modules 
on separate local buses common access to a public 
system bus, the 8087 also shares its host CPU's 
8289 Bus Arbiter. The 8289 operates identically 
regardless of whether the system bus request is 
initiated by the CPU or the NDP. Since only one 
of the processors in the module will have control 
of the local bus at the time of a request to access 
the system bus, the transfer will be between the 
controlling proce.ssor and the system bus. If the 
8289 does not obtain the system bus immediately, 
it causes the bus to appear "not ready" 
(as if a 
slow memory were being accessed), and the 8087 
will stretch the bus cycle by adding the wait states. 


Because it presents the same system bus interface 
as a maximum mode 8086 family CPU, the NDP 
is 
also 
electrically 
compatible 
with 
Intel's 
Multibus™ shared system bus architecture. 
This 
means that the 8087 can be utilized in systems that 
are based on the broad 
line of iSBCTMsingle 
board computers, controllers, and memories. 


If an 8087 and a processor other than its host 
CPU can both update a variable, access to that 
variable 
should 
be 
controlled 
so 
that 
one 
processor at a time has exclusive rights to it. This 
may be implemented by a semaphore convention 
as described in section 2.5. However, 
since the 
8087 has no facility for locking the system bus 
during 
an 
instruction, 
the 
host 
CPU 
should 
obtain exclusive rights to the variable before the 
8087 accesses it. This can be done using an XCHG 
instruction 
prefixed 
by LOCK as discussed in 
section 2.5. When the NDP no longer needs the 
controlled 
variable 
the CPU 
should 
clear the 
semaphore 
to signal other 
processors 
that 
the 
variable is again available for use. 


The NDP may be initialized by hardware or soft- 
ware. Hardware initialization 
occurs in response 


to a pulse on the 8087's RESET line. When the 
processor detects RESET going active, it suspends 
all activities. 
When RESET 
subsequently 
goes 
inactive, the NDP initializes itself. The state of 
the NDP following initialization is shown in table 
S-7. Hardware initialization also causes the 8087 
to identify its host CPU and begin to track its 
instruction 
fetches and execution. 
Initialization 


does not affect the content of the registers or of 
the exception pointers (these have indeterminate 
values 
immediately 
following 
power 
up). 
However, since the stack is effectively emptied by 
initialization (ST = 0, all registers tagged empty), 
the contents of the registers should normally be 
considered "destroyed" 
by initialization. 


"',,, 
U"LlUU~ alSO inItIalize the processor. Unlike a 


RESET pulse, 
software 
initialization 
does not 


affect the 8087's tracking of the CPU. 


The 8087's bidirectional 
BHE (bus high enable) 


line is tied to pin 34 of the CPU (BHE on the 
8086, SSO on the 8088). The 8088 always holds 
SSO= 1 . The 8086 emits a 0 on BHE whenever it 
is accessing an even-addressed 
word or an odd- 


addressed byte. 


Following RESET, the CPU always performs a 
word 
fetch 
of 
its 
first 
instruction 
from 
the 


dedicated memory location: 
FFFFOH. The 8087 


identifies 
its host 
CPU 
by monitoring 
BHE 


during the CPU's first fetch following RESET. If 
BHE =1, the CPU is an 8088; if BHE =0, the 
CPU is an 8086 (because the first fetch is an even- 
addressed 
word). 
Note 
that 
to ensure 
proper 


operation, 
the same pulse must reset both the 
8087 and its host CPU. 


Field 
Value 
Interpretation 


Control Word 
Infinity Control 
0 
Projective 
Rounding Control 
00 
Round to nearest 
Precision Control 
11 
64bits 
Interrupt-enable 
Mask 
1 
Interrupts disabled 


Exception Masks 
111111 
All exceptions masked 


c 


Status Word 
Busy 
0 
Not busy 
Condition Code 
???? 
(Indeterminate) 


Stack Top 
000 
Empty stack 
Interrupt Request 
0 
No interrupt 
Exception Flags 
000000 
No exceptions 


Tag Word 
Tags 
11 
Empty 


Registers 
N.C. 
Not changed 


Exception Pointers 
Instruction Code 
N.C. 
Notchanged 
Instruction Address 
N.C. 
Not changed 
Operand Address 
N.C. 
Not changed 


The 8087 can request an interrupt of its host CPU 
via the 8087 INT (interrupt 
request) pin. This 
signal is normally 
routed 
to the CPU's 
INTR 
input 
via an 
8259A Programmable 
Interrupt 
Controller (PIC). The 8087 should not be tied to 
the CPU's NMI (non-maskable interrupt) line. 


All 8087 interrupt requests originate in the detec- 
tion of an exception. The interrupt request logic is 
illustrated in figure S-16. The interrupt 
request 
is made if the exception is unmasked and 8087 
interrupts 
are enabled, 
Le., both 
the relevant 
exception mask and the interrupt-enable 
mask are 
clear (0). If the exception is masked, the processor 
executes its masked response and does not set the 
interrupt request bit. 


If the exception is unmasked 
but interrupts 
are 
disabled (!EM = 1), the 8087's action depends on 
whether the CPU is waiting (the 8087 "knows" 
if 
the CPU is waiting because it decodes the WAIT 
instruction in parallel with the CPU). If the CPU 
is not waiting, the 8087 assumes that the CPU 
does not want to be interrupted 
at present and 
that it will enable interrupts on the 8087 when it 
does. The 8087 sets the interrupt request bit and 
holds its BUSY line active. The 8087 CU con- 
tinues 
to 
track 
the 
CPU, 
and 
if 
an 
8087 
instruction 
(without a preceding WAIT) comes 
along, it will be executed. Normally in this situa- 
tion the instruction 
would be FNENI 
(enable 
interrupts 
without 
waiting). This will clear the 
interrupt-enable 
mask 
and 
the 8087 will then 
activate INT. However, 
any instruction 
will be 
executed, and it is therefore conceivably possible 
to abort the interrupt 
request before it is ever 
handled. 
Aborting 
an interrupt 
request in this 
manner, however, would normally be considered 
a program error. 


If the CPU is waiting, then the processors are in 
danger 
of entering 
an endless wait condition 
(discussed shortly). To prevent this condition, the 
8087 ignores the fact that interrupts are disabled 
and activates 
INT even though 
the interrupt- 
enable mask is set. 


The interrupt 
request bit remains set until it is 
explicitly cleared (if INT is not disabled by !EM, 
it will remain active also). This can be done by the 
FNCLEX, FNSA VE, or FNINT instructions. The 
interrupt 
procedure 
that fields the 8087's inter- 
rupt request, 
i.e., 
the exception 
handler, 
must 


interrupt will immediately be generated again and 
the program will enter an endless loop. 


Most systems can be viewed as consisting of two 
distinct classes of software: 
interrupt 
handlers 
and application tasks. Interrupt handlers execute 
in response to external events; in the 8086 family 
they 
are 
implemented 
as 
interrupt 
service 
procedures. 
(Of 
course, 
the 
CPU 
interrupt 
instructions allow interrupt 
handlers to respond 
to internal "events" 
also.) A hardware interrupt 
controller, 
such as the 8259A, usually monitors 
the external events and invokes the appropriate 
interrupt 
handler by activating 
the CPU INTR 
line, and passing a code to the CPU that identifies 
the interrupt handler that is to service the event. 
Since the 8259A typically monitors several events, 
a priority-resolving technique is used to select one 


event when several occur simultaneously. 
Many 
systems 
allow 
higher-priority 
interrupts 
to 
preempt 
lower-priority 
interrupt 
handlers. 
The 
8259A supports several priority-resolving 
techni- 
ques; a system will normally select one of these by 
programming the 8259A at initialization time. 


Application tasks execute only when no external 
event 
needs 
service, 
i.e., 
when 
no 
interrupt 
handler is running. Application tasks are invoked 
by software, 
rather 
than hardware; 
typically a 
scheduling or dispatching 
algorithm 
is used to 
select one task for execution. In effect, any inter- 
rupt handler has higher priority than any applica- 
tion task, since the recognition of an interrupt will 
invoke 
the 
interrupt 
handler, 
preempting 
the 
application task that was running. 


There are two important 
questions 
to consider 
when assigning a priority to the 8087's interrupt 
request: 


• 
Who 
can 
cause 
8087 
exceptions-only 
application 
tasks, 
or interrupt 
handlers 
as 
well? 


• 
Who 
should 
be 
preempted 
by 
NDP 
exceptions-only 
applications tasks, or inter- 
rupt handlers as well? 


Given 
these 
considerations, 
the 
8087 should 
normally be assigned the lowest priority of any 
interrupting device in the system. This allows the 
interrupt 
handler 
(Le., 
the 
NDP 
exception 
handler) 
to preempt 
any application 
task that 
generates an 8087 exception, and at the same time 
prevents 
the 
exception 
NDP 
handler 
from 
interfering with other interrupt handlers. 


If an interrupt handler uses the 8087 and requires 
the service of the exception handler, it can effec- 
tively 
"raise" 
the 
priority 
of 
the 
exception 
handler 
by disabling 
all interrupts 
lower than 
itself and higher than· the 8087. Then, any un- 
masked exception caused by the interrupt handler 
will be fielded without interference 
from lower- 
priority interrupts. 


If, for some reason, the 8087 must be given higher 
priority than another interrupt source, the inter- 
rupt 
handler 
that 
services 
the 
lower-priority 
device may want to prevent interrupts 
from the 
8087 (which may originate in a long instruction 
still running 
on the 8087 when the interrupt 
handler 
is invoked) 
from 
preempting 
it. This 


should be done by executing the FNSTCW and 
FNDISI instructions before enabling CPU inter- 
rupts. 
Before returning, 
the interrupt 
handler 
should restore the original control word in the 
8087 by executing FLDCW. 


Users should consult "Using the 8259A Program- 
mable Interrupt 
Controller", 
Intel Application 
Note No. AP-59, for a description of the 8259A's 
various modes of operation. 


The 8087 and its host CPU can enter an endless 
wait condition 
when the CPU 
is executing 
a 
WAIT instruction and a pending interrupt request 
from the 8087 is prevented from being recognized 
by the CPU. Thus, the CPU will wait for the 8087 
to lower its BUSY line, while the NDP will wait 
for the CPU 
to invoke the exception 
handler 
interrupt 
procedure, 
and 
the 
task 
which 
has 
generated 
the exception 
will be blocked 
from 
further execution. 


Figure S-17 shows the typical path of an interrupt 
request from the 8087 to the interrupt procedure 
which is design~ted to field NDP exceptions. The 
interrupt 
request can be potentially 
blocked at 
three points along the path, creating an endless 
wait if the CPU is executing aWAIT 
instruction. 
The first block can occur at the 8087's interrupt- 
enable mask (IEM). If this mask is set, the inter- 
rupt request is blocked except that the 8087 will 
override the mask if the CPU is waiting (the 8087 
decodes 
the WAU 
instruction 
simultaneously 
with 
the 
CPU). 
Thus, 
the 
8087 detects 
and 
prevents one of the endless wait conditions. 


A given interrupt request, IRn, can be masked on 
the 8259A by setting the corresponding 
bit in the 
PIC's 
interrupt 
mask register (IMR). This will 
prevent 
a request 
from 
the 
8087 from 
being 
passed to the CPU. (The 8259A's normal priority- 
resolving activity 
can also block 
an interrupt 
request.) 
Finally, 
the 
CPU 
can 
exclude 
all 
interrupts tied to INTR by clearing its interrupt- 
enable flag (IF). In these two cases, the CPU can 
"escape" 
the endless wait only if another inter- 
rupt is recognized (if IF is cleared, the interrupt 
must arrive on NMI, the CPU's 
non-maskable 
interrupt line). Following execution of the inter- 
rupt procedure and resumption of the WAIT, the 
endless wait will be entered again, unless, as part 
of its response to the interrupt it recognizes, the 
CPU clears the interrupt path from the 8087. 


A user-written exception handler l.:anitself cause 
an unending wait. When the exception handler 
starts to run, the 8087 is suspended with its BUSY 
line active, 
waiting 
for 
the 
exception 
to 
be 


cleared, and interrupts on the CPU are disabled. 
If, in this condition, the exception handler issues 
any 8087 instruction, 
other than a no-wait form, 
the result will be an unending wait. To prevent 
this, the exception handler should clear the excep- 
tion on the 8087 and enable interrupts 
on the 
CPU 
before 
executing 
any instruction 
that 
is 


preceded by aWAIT. 


More generally, an instruction that is preceded by 
a WAIT (or an FWAIT instruction) should never 
be executed when CPU interrupts 
are disabled 


and there is any possibility that the 8087's BUSY 
line is active. 


When the 8087 has control of the local bus, it 
emits signals on status lines S2-S0 to identify 
the type of bus cycle it is running. 
The 8087 


generates the restricted (compared to a CPU) set 
of encodings 
shown 
in table 
S-8. These lines 


correspond 
exactly to the signals output 
by the 


8086 and 8088 CPU's, and are normally decoded 
by an 8288 Bus Controller. 


Table S-8. Bus Cycle Status Signals 


52 
51 
50 
Type of Bus Cycle 


1 
0 
1 
Read Memory 


1 
1 
0 
Write Memory 


1 
1 
1 
Passive; no bus cycle 


Status line S7 is currently identical to BHE of the 
same bus cycle, while S4 and S3 are both currently 
1; however, these signals are reserved by Intel for 
possible future use. Status line S6 emits 1 and S5 
emits O. 


This section describes the operation of each of the 
8087's 
69 instructions. 
The 
first 
part 
of the 


section describes the function of each instruction 
in detail. For this discussion, the instructions are 
divided into six functional groups: 
data transfer, 


arithmetic, 
comparison, 
transcendental, 
con- 


stant, 
and processor 
control. 
The second part 


provides instruction attributes 
such as execution 


speed, 
bus transfers, 
and 
exceptions, 
as well 


as a coding example for each combination 
of 


operands 
accepted 
by 
the 
instruction. 
This 


information is concentrated 
in a table, organizea 


alphabetically by instruction mnemonic, for easy 
reference. 


Throughout 
this section, 
the instruction 
set is 


described as it appears to the ASM-86 program- 
mer who is coding a program. Appendix A covers 
the actual machine instruction 
encodings, which 


are principally 
of use to those reading 
unfor- 


matted memory dumps, 
monitoring 
instruction 


fetches on the bus, or writing exception handlers. 


The instruction 
descriptions 
in this section con- 
centrate 
on describing 
the normal 
function 
of 


each operation. 
Table S-19 lists the exceptions 


that can occur for each instruction and table S-32 
details the causes of exceptions 
as well as the 


8087's masked responses. 


The typical NDP instruction 
accepts one or tW( 


operands 
as "inputs", 
operates 
on these, anc 


produces a result as an "output". 
Operands are 


most often (the contents of) register or memory 
locations. The operands of some instructions are 
predefined; for example, FSQRT always takes the 
square root of the number in the top stack ele- 
ment. Others allow, or require, the programmer 
to explicitly code the operand(s) 
along with the 
instruction 
mnemonic. 
Still others 
accept 
one 
explicit operand and one implicit operand, which 
is usually the top stack element. 


Whether supplied by the programmer 
or utilized 
automatically, 
there 
are 
two 
basic 
types 
of 
operands, 
sources 
and destinations. 
A source 
operand simply supplies one of the "inputs" 
to 
an instruction; it is not altered by the instruction. 
Even when an instruction 
converts 
the source 
operand from one format to another (e.g., real to 
integer), the conversion is actually performed 
in 
an internal work area to avoid altering the source 
operand. A destination operand may also provide 
an "input" 
to an instruction. 
It is distinguished 
from a source operand, however, because its con- 
tent may be altered when it receives the result 
produced by the operation; that is, the destination 
is replaced by the result. 


Many instructions allow their operands to be cod- 
ed in more than one way. For example, FADD 
(add real) may be written without operands, with 
only a source or with a destination and a source. 
The 
instruction 
descriptions 
in 
this 
section 
employ the simple convention of separating alter- 
native operand 
forms with slashes; the slashes, 
however, are not coded. Consecutive slashes in- 
dicate an option 
of no explicit operands. 
The 
operands for FADD are thus described as: 


This means that FADD may be written in any of 
three ways: 


FADD 
FADDsource 
FADD destination, source 


When 
reading 
this 
section, 
it 
is 
important 
to bear in mind that memory operands may be 
coded with any of the CPU's memory addressing 
modes. To review these modes-direct, 
register 
indirect, based, indexed, based indexed-refer 
to 
sections 2.8 and 2.9. Table S-22 in this chapter 
also provides several addressing mode examples. 


These 
instructions 
(summarized 
in table 
S-9) 
move operands 
among elements of the register 
stack, and between the stack top and memory. 
Any of the seven data types can be converted to 
temporary 
real 
and loaded 
(pushed) 
onto 
the 
stack in a single operation; 
they can be stored to 
memory in the same manner. 
The data transfer 
instructions 
automatically 
update 
the 8087 tag 
word to reflect the register contents following the 
instruction. 


FLD (load real) loads (pushes) the source operand 
onto the top of the register stack. This is done by 
decrementing the stack pointer by one and then 
copying the content of the source to the new stack 
top. The source may be a register on the stack 
(ST(i» or-any of the real data types in memory. 
Short and long real source operands are converted 
to temporary 
real automatically. 
Coding 
FLD 
ST(O)duplicates the stack top. 


FLD 
FST 
FSTP 
FXCH 


Load real 
Store real 
Store leal and pop 
El'.change registers 


FILD 
FIST 
FISTP 


Integer load 
Integer store 
Integer store and pop 


FBLD 
FBSTP 
Packed decimal (BCD) load 
Packed decimal (BCD) store and pop 


FST (store real) transfers 
the stack top to the 
destination, which may be another register on the 
stack or a short or long real memory operand. If 
the destination 
is short or long real, the signifi- 
cand is rounded 
to the width of the destination 


according to the RC field of the control word, 
and the exponent is converted to the width and 
bias of the destination format. 


If, however, the stack top is tagged special (it con- 
tains 
00, a NAN, or a denormal) then the stack 
top's significand is not rounded but is chopped 
(on the right) to fit the destination. Neither is the 
exponent converted, but it also is chopped on the 
right and transferred 
"as is". This preserves the 
value's identification 
as 
00 or a NAN (exponent 
all ones) or a denormal 
(exponent all zeros) so 
that it can be properly loaded and tagged later in 
the program if desired. 


FSTP (store real and pop) operates' identically to 
FST except that the stack is popped following the 
transfer. 
This is done by tagging the top stack 
element empty and then incrementirig ST. FSTP 
permits 
storing 
to 
a temporary 
real memory 
variable while FST does not. Coding FSTP ST(O) 
is equivalent to popping the stack with no data 
transfer. 


1" 


FXCH (exchange registers) swapsJhe contents of 
the destination and the stack top registers. If the 
destination is not coded explicitly, ST(l) is used., 
Many 8087 instructions operate only on the stack 
top; FXCH provides a simple means of effectively 
using these instructions on lower stack elements. 
For example, the following sequence takes the 
square root of the third register from the top: 


FXCH 
8T(3) 
F8QRT 
FXCH 
8T(3) 


FILD (integer load) converts the source memory 
operand 
from its binary integer format 
(word, 
short, 
or long) 
to temporary 
real 
and 
loads 
(pushes) the result onto the stack. The (new) stack 
top is tagged zero if all bits in the" source were 
zero, and is tagged valid otherwise. 


FIST (integer store) rounds 
the cQntent of the 
stack top to an integer according to the RC field 
of the control word and transfers the result to the 
destination. The destination may define a word or 
short integer variable. Negative zero is stored in 
the same encoding as,postive zero: 0000... 00. 


FISTP (integer store and pop) operates like FIST 
and also pops the stack following the tranSfer. 
The destination may be any of the binary integer 
data types. 


FBLD (packed decimaL (BCD) load) converts the 
content 
of 
the 
source 
operand 
from 
packed 
decimal to temporary real and loads (pushes) the 
result onto the stack. The sign of the source is 
preserved, including the case where the value is 
negative zero. FBLD is an exact operation; 
the 
source is loaded with no rounding error. 


The packed 
decimal 
digits of the source 
are 
assumed to be in the range 0-9H. The instruction 
does not check for invalid digits (A-FH) and the 
result of attempting to load an invalid encoding is 
undefined. 


FBSTP (packed decimaJ. (BCD) store and pop) 
converts the content of the stack top to a packed 
decimal integer, stores the result at the destination 
in memory, and pops the stack. FBSTP produces 
a rounded integer from a non-integral 
value by 
adding 0.5 to the value and then chopping. UseI:s 
who are concerned about rounding may precede 
FBSTP with FRNDINT. 


The 8087's arithmetic instruction set (table S-lO) 
provides a wealth of variations on the basic add, 
subtract, multiply, and divide operations, 
and a 
number of other useful functions. 
These range 
from a simple absolute value to a square root 
instruction that executes faster than ordinary divi- 


Addition 


FADD 
Add real 
FADDP 
Add real and pop 


FIADD 
Integer add 
,. 


Subtraction 


FSUB 
Subtract real 


FSUBP 
Subtract real and pop 


FISUB 
Integer subtract 


FSUBR 
Subtract real reversed 


FSUBRP 
Subtract real.reversed and pop 


FISUBR 
Integer subtract reversed 


Multiplication 


FMUL 
Multiplyreal 


FMULP 
Multiply real and pop 


FIMUL 
Integer multiply 


Division 
- 
. 


FDIV 
Divide real 


FDIVP 
Divide real and pop 


FIDIV 
Integer divide 


FDIVR 
Divide real reversed 


FDIVRP 
Divide real reversed and pop 


FIDIVR 
Integer divide reversed 


Other Operations 


FSQRT 
Square root 


FSCALE 
Scale 


FPREM 
Partial remainder 


FRNDINT 
Round to integer 


FXTRACT 
Extract exponent and significand 


FABS 
Absolute value 


FCHS 
Change sign 
~ 


sion; 8087 programmers 
no longer need to spend 


valuable 
time 
eliminating 
square 
roots 
from 


algorithms 
because they run too slowly. Other 
arithmetic 
instructions 
perform 
exact 
modulo 


division, 
round 
real numbers 
to integers, 
and 


scale values by powers of two. 


The 8087's basic arithmetic 
instructions 
(addi- 


tion, 
subtraction, 
multiplication, 
and division) 


are designed to encourage 
the development 
of 


very efficient algorithms. In particular, they allow 


the programmer 
to minimize memory references 


and to make optimum 
use of the NDP register 


stack. 


Table 
S-ll 
summarizes 
the 
available 
opera- 


tion/operand 
forms that are provided for basic 


arithmetic. In addition to the four normal opera- 
tions, 
two -"reversed" 
instructions 
make 
sub- 


traction and division "symmetrical" 
like addition 


and multiplication. The variety of instruction and 
operand 
forms 
give the 
programmer 
unusual 


flexibility: 


• 
operands 
may 
be located 
in registers 
or 


memory; 


• 
results 
may 
be deposited 
in a choice 
of 
registers; 


• 
operands 
may be a variety of NDP 
data 


types: temporary 
real, long real, short real, 


short integer or word integer, with automatic 
conversion to temporary 
real performed 
by 


the 8087. 


Five basic instruction 
forms may be used across 


all six operations, 
as shown in table S-ll. 
The 


classical stack form may be used to make the 8087 
operate 
like 
a 
classical 
stack 
machine. 
No 


operands are coded in this form, only the instruc- 
tion 
mnemonic. 
The 
NDP 
picks 
the 
source 


operand from the stack top and the destination 
from the next stack element. 
It then pops the 


stack, performs 
the operation, 
and returns the 


result to the new stack top, effectively replacing 
the operands by the result. 


The register 
form 
is a generalization 
of the 


classical stack form; the programmer specifies the 
stack top as one operand and any register on the 
stack as the other operand. Coding the stack top 
as the destination 
provides a convenient way to 


access a constant, 
held elsewhere in the stack, 


from the stack top. The converse coding (ST is the 
source operand) allows, for example, adding the 
top into a register used as an accumulator. 


Often the operand in the stack top is needed for 
one operation but then is of no further use in the 
computation. 
The register pop form can be used 


to pick up the stack top as the source operand, 
and then discard it by popping the stack. Coding 
operands 
of 
ST(l),ST 
with 
a 
register 
pop 


mnemonic is equivalent to a classical stack opera- 
tion: the top is popped and the result is left at the 
new top. 


Instruction 
Form 
Mnemonic 
Operand Forms 
ASM·86 Example 
Form 
destination, 
source 


Classical stack 
Fop 
{ST(1),Sn 
FADD 


Register 
Fop 
ST(i),ST or ST,ST(i) 
FSUB 
ST,ST(3) 


Register pop 
FopP 
ST(i),ST 
FMULP 
ST(2),ST 


Real memory 
Fop 
{ST,} 
short-realllong-real 
FDIV 
AZIMUTH 


Integer memory 
Flop 
{ST,} 
word-integerlshort-integer 
FIDIV 
N_PULSES 


NOTES: Braces { 
} surround implicit operands; these are not coded, and are shown 
here for information only. 


op = ADD 
destination 
f- destination + source 
SUB 
destination 
f- destination 
- source 
SUBR destination 
f- source - destination 
MUL 
destination 
f- destination 
• source 
DIV 
destination 
f- destination 
7 source 
DIVR 
destination 
f- source 7 destination 


The two memory forms increase the flexibility of 
the 8087's arithmetic instructions. 
They permit a 
real number or a binary integer in memory to be 
used directly as a source operand. This is a very 
useful facility in situations 
where operands 
are 
not used frequently 
enough 
to justify 
holding 
them in registers. Note that any memory address- 
ing mode may be used to define these operands, 
so they may be elements in arrays, structures or 
other 
data 
organizations, 
as 
well 
as 
simple 
scalars. 


The six basic operations are discussed further in 
the next paragraphs, 
and 
descriptions 
of the 
remaining seven arithmetic operations follow. 


Addition 
FADD 
FADDP 
FIADD 


/ /source/destination,source 
destination, source 
source 


The addition instructions (add real, add real and 
pop, integer add) add the source and destination 
operands and return the sum to the destination. 
The operand at the stack top may be doubled by 
coding: 


Normal Subtraction 
FSUB 
/ /source/destination,source 
FSUBP 
destination, source 
FISUB 
source 


The 
normal 
subtraction 
instructions 
(subtract 
real, subtract real and pop, integer subtract) sub- 
tract the source operand from the destination and 
return the difference to the destination. 


Reversed Subtraction 
FSUBR 
/ /source/destination, 
source 
FSUBRP destination,source 
FISUBR 
source 


The reversed subtraction 
instructions 
(subtract 
real reversed, 
subtract 
real reversed and pop, 
integer subtract reversed) subtract the destination 
from the source and return the difference to the 
destination. 


Multiplication 
FMUL 
/ /source/destination, 
source 
FMULP 
destination,source 
FIMUL 
source 


The multiplication 
instructions 
(multiply 
real, 
multiply real and pop, integer multiply) multiply 
the source and destination 
operands and return 


•••• ULJ 
"l,"l\V} 
:;4uart::; me content 
ot 
the 
stack top. 


Normal Division 
FDIV 
/ /source/destination,source 
FDIVP 
destination, source 
FIDIV 
source 


The normal 
division 
instructions 
(divide real, 
divide real and pop, 
integer divide) divide the 
destination by the source and return the quotient 
to the destination. 


Reversed 
Division 
FDIVR 
/ /source/destination,source 
FDIVRP 
destination, source 
FIDIVR 
source 


The reversed 
division 
instructions 
(divide real 
reversed, divide real reversed and pop, 
integer 
divide reversed) divide the source operand 
by 


the destination 
and return 
the quotient 
to the 
destination. 


FSQRT (square root) replaces the content of the 
top stack element with its square root. (Note: the 
square root of -0 is defined to be -0.) 


FSCALE (scale) interprets the value contained in 
ST(l) as an integer, and adds this value to the 
exponent of the number in ST. This is equivalent 
to: 


thus, FSCALE provides rapid multiplication 
or 
division by integral powers of 2. It is particularly 
useful for scaling the elements of a vector. 


Note 
that 
FSCALE 
assumes 
the scale factor 


in 
ST(l) 
is an 
integral 
value 
in 
the 
range 
-215~X< 
215• If the value is not integral, but is 
in-range 
and 
is greater 
in magnitude 
than 
1, 


FSCALE 
uses 
the 
nearest 
integer 
smaller 
in 
magnitude, i.e., it chops the value toward o. If the 
value is out of range, or 0 < I X I < 1, the instruc- 
tion will produce an undefined result and will not 


to load the scale factor from a word integer to 
ensure correct operation. 


FPREM 
FPREM 
(partial 
remainder) 
performs 
modulo 


division of the top stack element by the next stack 
element, i.e., ST(l) is the modulus. FPREM pro- 
duces an exact result; the precision exception does 
not occur. The sign of the remainder is the same 
as the sign of the original dividend. 


FPREM operates by performing successive scaled 
subtractions; obtaining the exact remainder when 
the operands differ greatly in magnitude can con- 
sume large amounts of execution time. Since the 
8087 can only be preempted between instructions, 
the remainder 
function 
could seriously increase 


interrupt latency in these cases. Accordingly, the 
instruction is designed to be executed iteratively in 
a software-controlled 
loop. 


FPREM can reduce a magnitude difference of up 
to 264 in one execution. 
If FPREM 
produces a 


remainder that is less than the modulus, the func- 
tion is complete and bit C2 of the status word 
condition 
code 
is cleared. 
If the 
function 
is 


incomplete, C2 is set to 1; the result in ST is then 
called the partial remainder. Software can inspect 
C2 by storing the status word following execution 
of FPREM and re-execute the instruction 
(using 


the partial remainder in ST as the dividend), until 
C2 is cleared. Alternatively, a program can deter- 
mine when the function is complete by comparing 
ST to ST(l). If ST>ST(l) 
then FPREM must be 


executed again; if ST=ST(l) then the remainder is 
0; if ST<ST(I) then the remainder is ST. A higher 
priority interrupting routine which needs the 8087 
can force a context switch between the instruc- 
tions in the remainder loop. 


An 
important 
use 
for 
FPREM 
is to 
reduce 


arguments (operands) of periodic transcendental 
functions 
to 
the 
range 
permitted 
by 
these 


instructions. For example, the FPT AN (tangent) 
instruction requires its argument to be less than 
n/4. Using n/4 as a modulus, FPREM will reduce 
an argument 
so that it is in range of FPT AN. 


Because FPREM 
produces 
an exact result, 
the 


argument reduction does not introduce roundoff 
error into the calculation, 
even if several itera- 
tions are required 
to bring the argument 
into 


range. (The rounding 
of n does not create the 


effect of a rounded argument, 
but of a rounded 


period.) 


FPREM also provides the least-significant 
three 
bits of the quotient generated by FPREM (in C3, 
Cl' Co), This is also important for transcendental 
argument 
reduction 
since it locates the original 
angle in the correct one of eight rr/4 segments of 
the unit circle. 


FRNDINT 
(round 
to integer) 
rounds 
the top 
stack element to an integer. For example, assume 
that ST contains the 8087 real number encoding 
of the decimal value 
155.625. FRNDINT 
will 
change the value to 155 if the RC field of the con- 
trol word is set to down or chop, or to 156 if it is 
set to up or nearest. 


FXTRACT 
(extract 
exponent 
and significand) 
"decomposes" 
the number in the stack top into 
two numbers that represent the actual value of the 
operand's 
exponent 
and significand 
fields. The 
"exponent" 
replaces the original operand on the 
stack and the "significand" 
is pushed onto the 
stack. Following 
execution 
of FXTRACT, 
ST 
(the new stack top) contains 
the value of the 
original significand expressed as a real number: 
its sign is the same as the operand's, 
its exponent 
is 0 true (16,383 or 3FFFH biased), and its signifi- 
cand is identical to the original operand's. 
ST(1) 
contains the value of the original operand's 
true 
(unbiased) exponent expressed as a real number. 
If the original operand is zero, FXTRACT 
pro- 
duces zeros in ST and ST(1) and both are signed 
as the original operand. 


To clarify the operation 
of FXTRACT, 
assume 
ST contains a number whose true exponent is +4 
(i.e., its exponent 
field contains 4003H). After 
executing FXTRACT, 
ST(1) will contain the real 
number +4.0; its sign will be positive, its exponent 
field will contain 4001H (+2 true) and its signifi- 
cand 
field 
will contain 
1~OO... OOB. In other 
words, the value in ST(1) will be 1.0 x 22 = 4. If 
ST contains an operand whose true exponent is 
-7 (i.e., its exponent field contains 3FF8H), then 
FXTRACT 
will return an "exponent" 
of -7.0; 
after the instruction 
executes, ST(1)'s sign and 
exponent 
fields will contain 
COOlH (negative 


sign, true exponent of 2) and its significand will 
be 1~1100... OOB. In other words the value in 
ST(1) will be -1.11 
x 22 = -7.0. 
In both cases, 


following FXTRACT, 
ST's sign and significand 
fields will be the same as the original operand's, 
and 
its exponent 
field 
will contain 
3FFFH, 
(0 true). 


FXTRACT is useful in conjunction 
with FBSTP 
for converting numbers in 8087 temporary 
real 
format 
to 
decimal 
representations 
(e.g., 
for 
printing or displaying). It can also be useful for 
debugging since it allows the exponent and signifi- 
cand parts 
of a real number 
to be examined 
separately. 


FABS (absolute value) changes the top stack ele- 
ment to its absolute 
value by making 
its sign 
positive. 


FCHS (change sign) complements 
(reverses) the 
sign of the top stack element. 


Each of these instructions 
(table S-12) analyzes 
the top stack element, often in relationship 
to 
another 
operand, 
and reports the result in the 
status word condition code. The basic operations 
are 
compare, 
test 
(compare 
with 
zero), 
and 
examine (report 
tag, sign, and normalization). 


Special forms of the compare operation are pro- 
vided to optimize algorithms 
by allowing direct 
comparisons 
with 
binary 
integers 
and 
real 
numbers in memory, as well as popping the stack 
after a comparison. 


The FSTSW (store status word) instruction may 
be used following a comparison 
to transfer 
the 
condition code to memory for inspection. Section 
S.lO contains an example of using this technique 
to implement condition!!.!branching. 


Note that instructions 
other than those in the 
comparison 
group 
may 
update 
the 
condition 
code. To insure that the status word is not altered 
inadvertently, 
store it immediately 
following a 
comparison operation. 


FCOM (compare real) compares the stack top to 
the source operand. The source operand may be a 
register on the stack, 
or a short or long real 
memory operand. 
If an operand 
is not coded, 
ST is compared 
to ST(l). Positive and negative 
forms of zero compare identically as if they were 
unsigned. 
Following the instruction, 
the condi- 
tion codes reflect the order of the operands 
as 
follows: 


C3 
CO 
Order 


0 
0 
ST > source 
0 
1 
ST< source 
1 
0 
ST = source 
1 
1 
ST? source 


NANs and 
00 (projective) 
cannot 
be compared 
and return C3=CO=1 as shown above. 


FCOM 
FCOMP 
FCOMPP 
FICOM 
FICOMP 
FTST 
FXAM 


Compare real 
Compare real and pop 
Compare real and pop twice 
Integer compare 
Integer compare and pop 
Test 
Examine 


FC,OMP (compare 
real and pop) operates 
like 
FCOM, and in addition pops the stack. 


FCOMPP (compare real and pop twice) operates 
like FCOM and additionally pops the stack twice, 
discarding both operands. 
The comparison 
is of 
the stack top 
to ST(l); 
no operands 
may be 
explicitly coded. 


FICOM 
(integer compare) 
converts 
the source 
operand, 
which may reference a word or short 
binary integer variable, 
to temporary 
real and 
compares the stack top to it. 


FICOMP 
(integer compare 
and pop) 
operates 
identically to FICOM and additionally 
discards 
the value in ST by popping the stack. 


FTST (test) tests the top stack element by compar- 
ing it to zero. The result is posted to the condition 
codes as follows: 


C3 
CO 
Result 


0 
0 
ST is positive and 
nonzero 
0 
1 
ST is negative and 
nonzero 
1 
0 
ST is zero (+ or-) 
1 
1 
ST 
is 
not 
com- 
parable (Le., it is a 
NAN or projective 
(0) 


FXAM (examine) reports the content of the top 
stack element 
as positive/negative 
and NAN/ 
unnormal/denormal/normal/zero, 
or 
empty. 
Table S-13 lists and interprets 
all the condition 
code values that FXAM generates. Although four 
different encodings may be returned for an empty 
register, bits C3 and COof the condition code are 
both 1 in all encodings. Bits C2 and Cl should be 
ignored when examining for empty. 


The instructions 
in this group (table S-14) per- 
form the time-consuming core calculations for all 
common 
trigonometric, 
inverse 
trigonometric, 
hyperbolic, 
inverse hyperbolic, 
logarithmic 
and 
exponential 
functions. 
Prologue 
and 
epilogue 
software may be used to reduce arguments to the 
range accepted by the instructions and to adjust 
the result to correspond to the original arguments 
if necessary. The transcendentals 
operate on the 
top one or two stack elements and they return 
their results to the stack also. 


Condition 
Code 
Interpretation 
C3 
C2 
C1 
CO 


0 
0 
0 
0 
+ Unnormal 


0 
0 
0 
1 
+ NAN 


0 
0 
1 
0 
- Unnormal 


0 
0 
1 
1 
-NAN 


0 
1 
0 
0 
+ Normal 


0 
1 
0 
1 
+"" 


0 
1 
1 
0 
- Normal 


0 
1 
1 
1 
- "" 


1 
0 
0 
0 
+0 


1 
0 
0 
1 
Empty 


1 
0 
1 
0 
-0 


1 
0 
1 
1 
Empty 


1 
1 
0 
0 
+ Denormal 


1 
1 
0 
1 
Empty 


1 
1 
1 
0 
- Denormal 


1 
1 
1 
1 
Empty 


FPTAN 


FPATAN 


F2XM1 


FYL2X 


FYL2XP1 


Partial tangent 


Partial arctangent 
2L1 


YelOQ2X 


YeIOQ2(X+1) 


The transcendental 
instructions assume that their 
operands are valid and in-range. 
The instruction 
descriptions in this section provide the range of 
each 
operation. 
To 
be 
considered 
valid, 
an 
operand to a transcendental 
must be normalized; 
denormals, 
unnormals, 
infinities and NANs are 
considered invalid. (Zero operands are accepted 
by some functions 
and are considered 
out-of- 
range by others.) If a transcendental 
operand is 
invalid 
or 
out-of-range, 
the 
instruction 
will 
produce an undefined result without signalling an 
exception. It is the programmer's 
responsibility to 


ensure 
that 
operands 
are 
valid 
and 
in-range 
before executing a transcendental. 
For periodic 
functions, FPREM may be used to bring a valid 
operand into range. 


FPT AN (partial tangent) computes the function 
YIX 
= TAN (0). 0 is taken from the top stack 
element; it must lie in the range 0 < 0 < n14. The 
result of the operation is a ratio; Y replaces 0 in 
the stack and X is pushed, 
becoming the new 
stack top. 


The ratio result of FPT AN and the ratio argu- 
ment of FP ATAN are designed to optimize the 
calculation of the other trigonometric 
functions, 
including SIN, cas, ARCSIN 
and ARCCOS. 
These can be derived from TAN and ARCTAN 
via standard trigonometric identities. 


FP ATAN (partial arctangent) computes the func- 
tion 0 = ARCTAN (YIX). 
X is taken from the 
top stack element and Y from ST(l). 
Y and X 
must observe the inequality 0 < Y < X < 00. The 
instruction 
pops the stack and returns 0 to the 
(new) stack top, overwriting the Y operand. 


F2XMI (2 to the X minus I) calculates the func- 
tion Y = 2x -1. X'is taken from the stack top and 
must be in the range 0 " X " 0.5. The result Y 
replaces X at the stack top. 


This instruction 
is designed to produce 
a very 
accurate result even when X is close to zero. To 
obtain Y=2x, add I to the result delivered by 
F2XM1. 


The following formulas 
show how values other 
than 2 may be raised to a power of X: 


1()X = 2xoLOG210 


eX = 2xoLOG2e 


yX = 2xoLOG2Y 


As sho\yn in the next section, the 8087 has b.uilt-in 
instructions 
for loading 
the constants 
LOGzlO 
and LOGze, and the FYL2X instruction may be 
used to calculate XeLOGzY. 


FYL2X (Y log base 2 of X) calculates the function 
Z = yeLOGzX. X is taken from the stack top and 
Y from ST(l). The operands must be in the ranges 
o < X < 
00 and - 
00 < Y < + 
00. The instruction 
pops the stack and returns Z at the (new) stack 
top, replacing the Y operand. 


This function optimizes the calculation of log to 
any base other than two since a multiplication 
is 
always required: 
1 


FYL2XPI 
(Y log base 2 of (X + I» calculates 
the function 
Z = yeLOGz 
(X+I). 
X is taken 
from the stack top and must be in the range 
0,< Ixi < (I -('/212)). Y is taken from ST(l) and 
must be in the range - 
00 < Y < 
00. FYL2XPI 
pops the stack and returns Z at the (new) stack 
top, (eplacing Y. 


This instruction provides improved accuracy over 
FYL2X when computing the log of a number very 
close to I, for example 
I + 
E where 
E << 1. 


Providing 
E rather than I + 
E as the input to the 
function 
allows 
more 
significant 
digits 
to be 
retained. 


Each 
of these 
instructions 
(table 
S-15) loads 
(pushes) 
a commonly-used 
constant 
onto 
the 
stack. The values have full temporary real preci- 
sion (64 bits) and are accurate to approximately 
19 decimal digits. Since a temporary real constant 
occupies 10 memory bytes, the constant instruc- 
tions, which are only two bytes long, save storage 
and 
improve 
execution 
speed, 
in addition 
to 
simplifying programming. 


FLOZ 
Load 
+0.0 


FL01 
, 


Load 
+1.0 


FLOPI 


- 


Load 
11 


FLOL2T 
Load 109210 


FLOL2E 
Load 1092e 


FLOLG2 
Load 109,02 


FLOLN2 
Load 109e2 


FLOZ (load zero) loads (pushes) ;1-0.0onto the 
stack. 


FLO I (load one) loads (pushes) + 1.0 onto the 
stack. 


FLOL2T (load log·bas'e2 of 10) loads (pushes) the 
value LOGzlO onto the stack. 


FLOL2E (load log base 2 of e) loads (pushes) the 
value LOGze onto the stack. 


, 


FLOLG2 (load log base 10 of 2) loads (pushes) 
the value LOG 102onto the stack. 


FLOLN2 (load log base e of 2) loads (pushes) the 
value LOGe2 onto the stack. 


Most of these instructions 
(table S-16) are not 
used in computations; 
they are provided 
prin- 
cipally for system-level activities. These include 
initialization, 
exception 
handling 
and 
task 
switch}ng. 


As shown in table S-16, an alternate mnemonic is 
available 
for 
many 
of 
the 
processor 
control 
instructions. 
This mnemonic, 
distinguished by a 
second character of "N", 
instructs the assembler 
fo not prefix the instruction with a CPU WAIT 
instruction 
(instead, 
a CPU NOP precedes the 
instruction). This "no-wait" 
form is intended for 
use in critical code regions where aWAIT instruc- 
tion might 
precipitate 
an endless wait. 
Thus, 
when CPU interrupts are disabled, and the NDP 
can potentially generate an interrupt, 
the no-wait 
form should be used. When CPU interrupts are 
enabled, as will normally be the case when an 
application task is running, the "wait" 
forms of 
these instructions should be used. 


Except for FNSTENV and FNSA VE, all instruc- 
tions which provide a no-wait mnemonic are self- 
synchronizing and can be executed back-to-back 
in any combination without intervening FWAITs. 
These instructions 
can be executed by the 8087 
CU while the NEU is busy with 'a previously 
decoded instruction. To insure· that the processor 
control instruction 
executes after completion of 
any operation in progress in the NEU, the "wait" 
form of.that instruction should be used. 


FINIT/FNINIT 
(initialize 
processor) 
performs 
the functional equivalent of a hardware RESET 
(see section S.6), except that it does not affect the 
instruction fetch synchronization 
of the 8087 and 
its CPU. 


For 
compatibility 
with 
the 
8087 emulator, 
a 
system should call the INIT87 procedure in lieu of 
executing·FINIT/FNINIT 
when the processor is 
first initialized (see section S.8 for details). Note 
that if FNINIT is executed while a previous 8087 
memory referencing instruction 
is running, 8087 
bus cycles in progress will be aborted. 


FDISIIFNDISI 
(disable interrupts) sets the inter- 
rupt 
enable 
mask 
in the 
control 
word 
and 
prevents 
the 
NDP 
from 
issuing 
an 
interrupt 
request. 


FINIT/FNINIT 
Initialize processor 


FDISII FNDISI 
Disable interrupts 


FENIIFNENI 
Enable interrupts 


FLDCW 
Load control word 


FSTCWI FNSTCW 
Store control word 


FSTSWI FNSTSW 
Store status word 


FCLEX/FNCLEX 
Clear exceptions 


FSTENVI FNSTENV 
Store environment 


FLDENV 
Load environment 


FSAVE/FNSAVE 
Save state 


FRSTOR 
Restore state 


FINCSTP 
Increment stack pointer 


FDECSTP 
Decrement stack pointer 


FFREE 
Free register 


FNOP 
No operation 


FWAIT 
CPU wait 


FENIIFNENI 
(enable interrupts) clears the inter- 
rupt enable mask in the control word, allowing 
the 8087 to generate interrupt requests. 


FLDCW (load control word) replaces the current 
processor control word with the word defined by 
the source operand. 
This instruction 
is typically 
used to establish, or change, the 8087's mode of 
operation. 
Note that if an exception bit in the 
status word is set, loading a new control word 
that unmasks that exception and clears the inter- 
rupt 
enable mask 
will generate 
an immediate 
interrupt 
request before the next instruction 
is 
executed. 
When 
changing 
modes, 
the 
recom- 
mended procedure is to first clear any exceptions 
and then load the new control word. 


FSTCW IFNSTCW 
(store control 
word) writes 


the current processor control word to the memory 
location defined by the destination. 


FSTSW IFNSTCW 
(store status word) writes the 


current 
value of the 8087 status 
word to the 


destination operand in memory. The instruction 
has many uses: 


• 
to 
implement 
conditional 
branching 


following a comparison 
or FPREM instruc- 


tion (FSTSW); 


• 
to poll the 8087 to determine 
if it is busy 


(I'NSTSW); 


• 
to invoke exception handlers in environments 
that do not use interrupts (FSTSW). 


FCLEX/FNCLEX 
(clear exceptions) 
clears all 


exception flags, the interrupt request flag and the 
busy flag in the status word. As a consequence, 
the 8087's INT and BUSY lines go inactive. An 
exception 
handler 
must 
issue this 
instruction 


before returning to the interrupted computation, 
or another 
interrupt 
request will be generated 


immediately, and an endless loop may result. 


FSAVE/FNSA VE (save state) writes the full 8087 
state-environment 
plus register 
stack-to 
the 


memory 
location 
defined 
by 
the 
destination 


operand. Figure S-18 shows the layout of the 94- 
byte save area; typically the instruction 
will be 


coded to save this image on the CPU stack. If an 
instruction 
is executing in the 8087 NEU when 


FNSAVE 
is decoded, 
the 
CPU 
queues 
the 


FNSAVE and delays its execution until the run- 
ning 
instruction 
completes 
normally 
or 


encounters 
an unmasked 
exception. 
Thus, 
the 


save image reflects the state of the NDP following 
the completion of any running instruction. 
After 


writing 
the 
state 
image 
to 
memory, 
FSAVE/FNSA VE 
initializes 
the 
8087 
as 
if 


FINIT IFNINIT had been executed. 


FSAVE/FNSA VE is useful whenever a program 
wants to save the current state of the NDP and 
initialize it for a new routine. Three examples are: 


• 
an operating 
system needs to perform 
a 


context switch (suspend the task that had 
been running and give control to a new task); 


• 
an interrupt handler needs to use the 8087; 


• 
an application task wants to pass a "clean" 
8087 to a subroutine. 


FNSAVE must be "protected" 
by executing it in 


a critical region, Le., with CPU interrupts 
dis- 


abled. This prevents an interrupt 
handler from 
executing a second FNSAVE (or other "no-wait" 
processor 
control 
instruction 
that 
references 


memory) which could destroy the first FNSA VE 
if it is queued in the 8087. An FWAIT should be 
executed before CPU interrupts 
are enabled or 


any 
subsequent 
8087 instruction 
is executed. 


(Because the FNSA VE initializes the NDP, there 
is no danger of the FWAIT causing an endless 
waiL) Other CPU instructions 
may be executed 


between 
the 
FNSA VE and 
the 
FWAIT; 
this 


parallel execution will reduce interrupt latency if 
the FNSA VE is queued in the 8087. 


FRSTOR (restore state) reloads the 8087 from the 
94-byte 
memory 
area 
defined 
by 
the 
source 


operand. This information should have been writ- 
ten by a previous FSAVE/FNSA VE instruction 
and not altered by any other instruction. 
CPU 
instructions (that do not reference the save image) 
may immediately follow FRSTOR, but no NDP 
instruction 
should 
be without 
an 
intervening 


FWAIT or an assembler-generated WAIT. 


Note that the 8087 "reacts" 
to its new state at the 


conclusion of the FRSTOR; it will for example, 
generate an immediate 
interrupt 
request if the 


exception and mask bits in the memory image so 
indicate. 


FSTENV IFNSTENV 
(store environment) 
writes 


the 8087's basic status-control, 
status and tag 


words, and exception pointers-to 
the memory 


location 
defined 
by the 
destination 
operand. 


Typically the environment 
is saved on the CPU 


stack. 
FSTENV IFNSTENV 
is often 
used 
by 


exception handlers because it provides access to 
the exception pointers which identify the offend- 
ing instruction 
and operand. 
After 
saving the 
environment, FSTENV IFNSTENV sets all excep- 
tion masks in the processor; it does not affect the 
interrupt-enable 
mask. Figure S-19 shows the for- 
mat 
of the environment 
data 
in memory. 
If 
FNSTENV is decoded while another instruction is 
executing concurrently 
in the NEU, 
the 8087 
queues the FNSTENV 
and does not store the 
environment until the other instruction has com- 
pleted. Thus, the data saved by the instruction 
reflects the 8087 after 
any previously decoded 
instruction has been executed. 


TOP STACK 
{ 
ELEMENT:ST 


CONTROL WORD 


STATUS WORD 


TAG WORD 


IP15-0 


IP19-16\ 
0I 
OPCODE 


OP15-o 


OP19-161 
0 


SIGNIFICAND 
15-0 


SIGNIFICAND 
31-16 


SIGNIFICAND 
47-32 
· 


SIGNIFICAND 
63-48 
sl 
EXPONENT 14-0 


SIGNIFICAND 
15-0 


SIGNIFICAND 
31-16 


SIGNIFICAND 
47-32 
· 


SIGNIFICAND 
63-48 
sl 
EXPONENT 14-0 


SIGNIFICAND 
15-0 


SIGNIFICAND 
31-16 


SIGNIFICAND 
47-32 
· 


SIGNIFICAND 
63-48 
sl 
EXPONENT 14-0 


INSTRUCTION 
{ 
POINTER 


OPERAND 
{ 
POINTER 


NEXT STACK 
{ 
ELEMENT:ST(1) 


LASTSTACK 
{ 
ELEMENT:ST(7) 


NOTES: 
S = Sign 


Bit 0 of each 
field 
is rightmost, 
least 
significant 
bit of corresponding 
register field. 


Bit 63 of significand 
is integer 
bit (assumed 
binary 
point 
is immediately 
to the right). 


Figure S-18. FSA VE/FRSTOR 
Memory 
Layout 


FSTENV IFNSTENV 
must be allowed to com- 
plete 
before 
any 
other 
8087 
instruction 
is 


decoded. 
When FSTENV 
is coded, 
an explicit 
FWAIT, or assembler-generated 
WAIT, should 
precede 
any subsequent 
8087 instruction. 
An 
FNSTENV must be executed in a critical region 
that is protected from interruption, 
in the same 
manner as FNSA VE. (There is no risk of the 
following 
FWAIT 
causing 
an 
endless 
wait, 


because FNSTENV masks all exceptions, thereby 
preventing an interrupt request from the 8087.) 


INCREASING 
ADDRESSES 
=J 
H::! 


CONTROL WORD 


STATUS WORD 


TAG WORD 


IP15-0 


IP19-16! 
01 
OPCODE 


OP15·0 


OP19-161 
0 


INSTRUCTION 
{ 
POINTER 


OPERAND 
{ 
POINTER 


Figure S-19. FSTENV IFLDENV 
Memory 
Layout 


FLDENV 
(load environment) 
reloads the 8087 
environment 
from the memory area defined by 


the source operand. This data should have been 
written 
by 
a 
previous 
FSTENV IFNSTENV 
instruction. 
CPU 
instructions 
(that 
do 
not 


reference 
the 
environment 
image) 
may 


immediately follow FLDENV, but no subsequent 
NDP instruction 
should be executed without an 


intervening 
FWAIT 
or 
assembler-generated 
WAIT. 


Note that loading an environment image that con- 
tains 
an 
unmasked 
exception 
will 
cause 
an 
immediate 
interrupt 
request 
from 
the 
8087 
(assuming IEM=O in the environment image). 


FINCSTP (increment stack pointer) adds 1 to the 
stack top pointer (ST) in the status word. It does 
not alter tags or register contents, 
nor does it 
transfer data. It is not equivalent to popping the 
stack since it does not set the tag of the previous 
stack 
top 
to 
empty. 
Incrementing 
the 
stack 
pointer when ST=7 produces ST=O. 


FDECSTP 
(decrement 
stack 
pointer) 
subtracts 
I 


from ST, the stack top pointer 
in the status 
word. 
No tags or registers 
are altered, 
nor is any data 


transferred. 
Executing 
FDECSTP 
when 
ST=O 


produces 
ST=7. 


FFREE 
(free 
register) 
changes 
the 
destination 


register's 
tag to empty; 
the content 
of the register 


is unaffected. 


FNOP 
(no operation) 
stores 
the stack 
top to the 


stack 
top 
(FST 
ST,ST(O)) 
and 
thus 
effectively 


performs 
no operation. 


FW AIT is not actually 
an 8087 instruction, 
but an 


alternate 
mnemonic 
for the CPU 
WAIT 
instruc- 


tion 
described 
in 
section 
2.8. 
The 
FWAIT 


mnemonic 
should 
be coded 
whenever 
the 
pro- 


grammer 
wants 
to synchronize 
the 
CPU 
to the 


NDP, 
that 
is, 
to 
suspend 
further 
il1'Struction 


decoding 
until the NDP has completed 
the current 


instruction. 
A 
CPU 
instruction 
should 
not 


attempt 
to access a memory 
operand 
that has 
been read or written by a previous 8087 instruc- 


tion until the 8087 instruction has completed. 
The 


following 
coding 
shows how FW AIT can be used 


to force the CPU instruction 
to wait for the 8087: 


FNSTSW 
STATUS 
FWAIT 
;Wait for FNSTSW 
MOV 
AX,STATUS 


Programmers 
should 
not 
code 
WAIT 
to 


synchronize 
the CPU 
and the NDP. 
The routines 


that 
alter 
an object 
program 
for 8087 emulation 
eliminate 
FW AITs 
(and 
assembler-generated 


WAITs) 
but 
do not change 
any explicitly 
coded 


WAITs. 
The program 
will wait forever 
if a WAIT 


is encountered 
in emulated 
execution, 
since there 


is no 8087 to drive the CPU's 
TEST pin active. 


Instruction 
Set Reference 
Information 


Table S-19lists 
the operating 
characteristics 
of all 


the 8087 instructions. 
There 
is one table entry for 
each 
instruction 
mnemonic; 
the 
entries 
are 
in 


alphabetical 
order 
for quick 
lookup. 
Each 
entry 


provides 
the general 
operand 
forms 
accepted 
by 


the instruction 
as well as a list of all exceptions 


that may be detected 
during 
the operation. 


There 
is 
one 
entry 
for 
each 
combination 
of 


operand 
types 
that 
can 
be 
coded 
with 
the 


mnemonic. 
Table 
S-17 explains 
the operand 
iden- 


tifiers 
allowed 
in table S-19. Following 
this entry 


are columns 
that provide 
execution 
time in clocks, 


the number 
of bus transfers 
run during 
the opera- 


tion, the length 
of the instruction 
in bytes, 
and an 


ASM-86 coding 
sample. 


Identifier 
Explanation 


ST 
Stack top; the register 
currently 
at the top of the stack. 


ST(i) 
A register 
in the stack 
i (0~i~7) stack elements 
from the 


top. 
ST(1) is the 
next-an-stack 
register, 
ST(2) is below 


ST(1), etc. 


Short-real 
A short real (32 bits) number 
in memory. 


Long-real 
A long real (64 bits) number 
in memory. 


Temp-real 
A temporary 
real (80 bits) number 
in memory. 


Packed-decimal 
A packed 
decimal 
integer 
(18 digits, 10 bytes) in memory. 


Word-integer 
A word binary integer 
(16 bits) in memory. 


Short-integer 
A short binary integer 
(32 bits) in memory. 


Long-integer 
A long binary integer 
(64 bits) in memory. 


nn-bytes 
A memory 
area nn bytes long. 


The execution 
of an 8087 instruction 
involves 


three principal activities, each of which may con- 
tribute to the total duration 
(execution time) of 


the operation: 


• 
Instruction fetch 


• 
Instruction execution 


• 
Operand transfer 


The CPU and NDP simultaneously pre fetch and 
queue 
their 
common 
instruction 
stream 
from 


memory. This activity is performed during spare 
bus cycles and proceeds in parallel with the execu- 
tion of instructions 
from the queue. Because of 


their complexity, 8087 instructions typically take 
much longer to execute than to fetch. This means 
that in a typical sequence of 8087 instructions the 
processors have a relatively large amount of time 
available 
to maintain 
full instruction 
queues. 


Instruction fetching is therefore fully overlapped 
with execution 
and does not contribute 
to the 
overall duration of a series of instructions. Fetch 
time does become apparent when a CPU jump or 
call 
instruction 
alters 
the 
normal 
sequential 


execution. 
This empties the queues and delays 
execution 
of the target 
instruction 
until 
it is 


fetched from memory. The time required to fetch 
the instruction depends on its length, the type of 
CPU, and, if the CPU is an 8086, whether the 
instruction is located at an even or odd address. 
(Slow memories, which force the insertion of wait 
states in bus cycles, and the bus activities of other 
processors in the system, may also lengthen fetch 
time.) Section 2.7 covers this topic in more detail. 


Table S-19 quotes a typical execution time and a 
range for each instruction. 
Dividing the figures in 


the table by 5 (assuming a 5 MHz clock) produces 
execution time in microseconds. 
The typical case 
is an estimate for operand values that normally 
characterize 
most 
applications. 
The 
range 
encompasses best- and worst-case operand values 
that may be found 
in extreme circumstances. 


Where applicable, the figures include all overhead 
incurred 
by the CPU's 
execution 
of the ESC 


instruction, 
local bus arbitration 
(request/grant 
time), and the average overhead 
imposed by a 
preceding WAIT instruction (half of the 5-clock 
cycle that it uses to examine the TEST pin). 


The execution times assume that no exceptions 
are detected. 
Invalid 
operation, 
denormalized 


(unmasked), 
and zerodivide 
exceptions 
usually 


decrease execution time from the typical figure, 
but it will still fall within the quoted range. The 
precision exception 
has no effect on execution 


time. Unmasked 
overflow and underflow, 
and 


masked 
denormalized 
exceptions, 
impose 
the 


penalties shown in table S-18. Absolute 
worst- 


case execution time is therefore 
the high range 


figure 
plus 
the 
largest 
penalty 
that 
may 
be 
encountered. 


For instructions that transfer operands to or from 
memory, the execution times in table S-19 show 
that the time required for the CPU to calculate 
the operand's 
effective address (EA) should be 


added. Effective address calculation 
time varies 


according to addressing mode; table 2-20 supplies 
the figures. 


Exception 
Additional 
Clocks 


Overflow (unmasked) 
14 


Underflow (unmasked) 
16 


Denormalized (masked) 
33 


Instructions 
that reference memory execute bus 


cycles 
to 
transfer 
operands. 
Each 
transfer 


requires one bus cycle. The number of transfers 
depends on the length of the operand, the type of 
CPU, and the alignment of the operand 
if the 


CPU is an 8086. The figures in table S-19 include 
the "dummy 
read" 
transfer(s) performed by the 


CPU in its execution of the escape instruction that 
corresponds 
to the 8087 instruction. 
The first 


8086 figure is for even-addressed operands, 
and 


the second is for odd-addressed operands. 


A bus cycle (transfer) consumes four clocks if the 
bus is immediately available and if the memory is 
running at processor speed, without wait states. 
Additional time is required if slow memories are 
employed, because these insert wait states into the 
bus cycle. In multiprocessor 
environments, 
the 


bus may not be available immediately if a higher 
priority processor is using it; this also can increase 
effective transfer time. 


Note that the lengths quoted in table 8-19 do not 
include the one byte CPU WAIT instruction that 
the assembler automatically inserts in front of all 
NDP instructions (except those coded with a "no- 
wait" mnemonic). 


Instructions 
that do not reference memory are 
two bytes long. Memory reference instructions 
vary between two and four bytes. The third and 
fourth bytes are used for 8- or 16-bit displacement 
values; the assembler generates the short displace- 
ment whenever possible. 
No displacements 
are 
required in memory references that use only CPU 
register contents to calculate an operand's 
effec- 
tive address. 


FABS 
FABS (no operands) 
Absolute value 
Exceptions: 
I 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


- 


(no operands) 
14 
10-17 
0 
0 
2 
FABS 


FAOO 
FADD 
Iisource/destination,source 
Add real 
Exceptions: 
I, D, 0, U, P 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


IIST,ST(i)/ST(i),ST 
85 
70-100 
0 
0 
2 
FADD ST,ST(4) 
short-real 
105+EA 
9O-120+EA 
2/4 
4 
2-4 
FADD AIR_TEMP 
[SI] 
long-real 
110+EA 
95-125+EA 
4/6 
8 
2-4 
FADD [BX].MEAN 


FAoop 
FADDP destination, source 
Add real and pop 
Exceptions: 
I, D, 0, U, P 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


ST(i),ST 
90 
75-105 
0 
0 
2 
FADDP ST(2),ST 


FBLO 


) 


FBLD source 
Packed decimal (BCD) load 
Exceptions: 
I 


Execution"Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


packed-decimal 
300+EA 
29O-310+EA 
5/7 
10 
2-4 
FBLD YTD_SALES 


FBSTP 
FBSTP destination 
Exceptions: 
I 
Packed decimal (BCD) store and pop 
-~ 
~ 
- 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 
. 


packed-decimal 
530+EA 
520-540+EA 
6/8 
12 
2-4 
FBSTP [BXj.FORECAST 


FCHS 
FCHS (no operands) 
Exceptions: 
I 
Change sign 
. 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


(no operands) 
I 
15 
10-17 
0 
0 
2 
FCHS 


FCLEX/FNCLEX 
FCLEX (no operands) 
Exceptions: 
None 
Clear exceptions 


- 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


(no operands) 
5 
2-8 
0 
0 
2 
FNCLEX 
- 


FCOM 
FCOM 
Ilsource 
Exceptions: 
I, D 
Compare real 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


IIST(I) 
45 
40-50 
0 
0 
2 
FCOM ST(1) 
short-real 
65+EA 
6Q-70+EA 
2/4 
4 
2-4 
FCOM [BP].UPPER_L1MIT 
long-real 
70+EA 
65-75+EA 
4/6 
8 
2-4 
FCOM WAVELENGTH 


-. 


'FCOMP 
FCOMP 
IIsource 
Exceptions: 
I, D 
Compare real and pop 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


IIST(I) 
47 
42-52 
0 
0 
2 
FCOMP ST(2) 
short-real 
68+EA 
63-73+EA 
2/4 
4 
2-4 
FCOMP [BP+2].N_READINGS 
long-real 
72+EA 
67-77+EA 
4/6 
8 
2-4 
FCOMP DENSITY 


FCOMPP 
(no operands) 
Compare real and pop twice 


Transfers 


8086 
8088 
Bytes 


FOECSTP 
(no operands) 
q 
Exceptions: 
None 
Decrement stack pointer 


EXf3cution Clocks 
Transfers 


Typical 
Range 
8086 
8088 


-Bytes 
Coding 
Example 


9 
6-12 
0 
0 
2 
FDECSTP 


FOISI 
(no operands) 
Exceptions: 
None 
Disable interrupts 
,"..' 


Execution 
Clocks 
Transfers 


Typical 
Range 
8086 
8088 


-Bytes 
Coding 
Example 


5 
2-8 
0 
0 
2 
FOlsl 


FOIV 
//source/destination,source 
Divide real 


Transfers 


8086 
8088 
Bytes 


//sT(i),sT 
short-real 
long-real 


198 
220+EA 
225+EA 


193-203 
0 
215-225+EA 
2/4 
220-230+EA 
4/6 


2 
FDIV 
2-4 
FDIV 
DISTANCE 
2-4 
FDIV 
ARC [DI] 


FOIVP 
destination,source 
Divide real and pop 


Transfers 


8086 
8088 
Bytes 


FDIVR //source/destination,source 
,Divide real reversed 


Execution Clocks 
Transfers 


Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


199 
194-204 
0 
0 
2 
FDIVR ST(2),ST 
221+EA 
216-226+EA 
2/4 
6 
2-4 
FDIVR [BX].PULSE_RATE 
226+EA 
221-231+EA 
4/6 
8 
2-4 
FDIVR RECORDER.FREQUENCY 


//ST,ST(i)/ST(i),ST 
short·real 
long-real 


FDIVRP destination, source 
Divide real reversed and pop 


Transfers 


8086 
8088 
Bytes 


FENI/FNENI 
FENI (no operands) 
Exceptions: 
None 
Enable interrupts 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


(ho operands) 
5 


r 
2-8 
0 
0 
2 
FNENI 


~,1 
.' 


" 


" 


FFREE destination 
Free register 


EIADD-source 
Integer add 


Transfers 


8086 
8088 
Bytes 


word-integer 
short-integer 
120+EA 
102-137+EA 
1/2 
125+EA 
108-143+EA 
2/4 
2-4 
FIADD DISTANCE_TRAVELLED 
2-4 
FIADD PUlSE_COUNT 
[SI) 


FICOM 
FICOM 
source 
Exceptions: 
I,D 
Integer compare 


- 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


word-integer 
80+EA 
72-86+EA 
1/2 
2 
2-4 
FICOM TOOL.N_PA55E5 
short-integer 
85+EA 
78-91+EA 
2/4 
4 
2-4 
FICOM [BP+4J.PARM_COUNT 


FICOMP 
FICOMP 
source 
Exceptions: 
I,D 


",,, ' 
Integer compare and pop 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


I 


word-integer 
82+EA 
74-86+EA 
1/2 
2 
2-4 
FICOMP [BP].L1MIT [51] 
short-integer 
87+EA 
80-93+EA 
2/4 
4 
2-4 
FICOMP N_5AMPLE5 


FIDIV 
FIDIV source 
Exceptions: 
I, D, Z, 0, U, P 
Integer divide 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


word-integer 
230;t-EA 
224-238+EA 
1/2 
2 
2-4 
FIDIV SURVEY.OBSERVATIONS 
short-integer 
236+EA 
230-243+EA 
2/4 
4 
2-4 
FIDIV RELATIVE_ANGLE 
[DIJ 


FIDIVR 
FIDIVR 
source 
Exceptions: 
I,D, Z, 0, U, P 
Integer divide reversed 
-, 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 
- 


word-integer 
230+EA 
225-239+EA 
1/2 
2 
2-4 
FIDIVR [BPJ.X_COORD 
short-integer 
237+EA 
231-245+EA 
2/4 
4 
2-4 
FIDIVR FREQUENCY 


", 
, 


FILD 
FILD source 
Exception: 
I 
Integer load 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


word-integer 
50+EA 
46-54+EA 
1/2 
2 
2-4 
FILD [BXJ.SEQUENCE 
short-integer 
56+EA 
52-60+EA 
2/4 
4 
2-4 
FILD STANDOFF [DIJ 


long-i nteger 
64+EA 
60-68+EA 
4/6 
8 
2-4 
FILD RESPONSE.COUNT 


FIMUL 
FIMUL source 
Exceptions: 
I, D, 0, P 
Integer multiply 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


word-integer 
130+EA 
124-138+EA 
1/2 
2 
2-4 
FIMUL BEARING 
short-integer 
136+EA 
130-144+EA 
2/4 
4 
2-4 
FIMUL POSITION.Z_AXIS 


FINCSTP 
FINCSTP (no operands) 
Exceptions: 
None 
Increment stack pointer 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


(no operands) 
9 
6-12 
0 
0 
2 
FINCSTP 


FINIT /FNINIT 
FIN IT (no operands) 
Exceptions: 
None 
Initialize processor 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


(no operands) 
5 
2-8 
0 
0 
2 
FINIT 


FIST 
FIST destination 
Exceptions: 
I, P 
Integer store 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


word-integer 
86+EA 
80-90+EA 
2/4 
4 
2-4 
FIST OBS.COUNT [SI] 
short-integer 
88+EA 
82-92+EA 
3/5 
6 
2-4 
FIST [BP].FACTORED_PULSES 


FISTP 
FISTP destination 
Integer store and pop 
Exceptions: 
I, P 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


word-integer 
88+EA 
82-92+EA 
2/4 
4 
2-4 
FISTP [BX].ALPHA_COUNT 
[SI] 


short-integer 
90+EA 
84-94+EA 
3/5 
6 
2-4 
FISTP CORRECTED_TIME 
long-integer 
100+EA 
94-105+EA 
5/7 
10 
2-4 
FISTP PANEL.N_READINGS 


FISUB 
FISUB source 
Exceptions: 
I, D, 0, P 
Integer subtract 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


word-integer 
120+EA 
102-137+EA 
1/2 
2 
2-4 
FISUB BASE_FREQUENCY 
short-integer 
125+EA 
108-143+EA 
2/4 
4 
2-4 
FISUB TRAIN_SIZE 
[DIJ 


FISUBR 
FISUBR source 
Exceptions: 
I, D, 0, P 
Integer subtract reversed 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


word-integer 
120+EA 
103-139+EA 
1/2 
2 
2-4 
FISUBR FLOOR [BX) [SIJ 


short-integer 
125+EA 
109-144+EA 
2/4 
4 
2-4 
FISUBR BALANCE 


FLD 
FLD source 
Exceptions: 
I, D 
Load real 


EXecution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


ST(i) 
20 
17-22 
0 
0 
2 
FLD ST(O) 
short-real 
43+EA 
38-56+EA 
2/4 
4 
2-4 
FLD READING [Slj.PRESSURE 
long-real 
46+EA 
40-60+EA 
4/6 
8 
2-4 
FLD [BPJ.TEMPERATURE 
temp-real 
57+EA 
53-65+EA 
5/7 
10 
2-4 
FLD SAVEREADING 


FLDCW 
FLDCW source 
Exceptions: 
None 
Load control word 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


2-bytes 
10+EA 
7-14+EA 
1/2 
2 
2-4 
FLDCW CONTROL_WORD 


FLDENV 
FLDENV source 
Exceptions: 
None 
Load environment 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


14-bytes 
40+EA 
35-45+EA 
7/9 
14 
2-4 
FLDENV [BP+6J 


FLDLG2 
FLDLG2 
(no operands) 
Exceptions: 
I 
Load 109102 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


(no operands) 
21 
18-24 
0 
0 
2 
FLDLG2 


FLDLN2 
FLDLN2 
(no operands) 
Exceptions: 
I 
Load 10ge2 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


(no operands) 
20 
17-23 
0 
0 
2 
FLDLN2 


FLDL2E 
FLDL2E (no operands) 
Exceptions: 
I 
Load 1092e 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


(no operands) 
18 
15-21 
0 
0 
2 
FLDL2E 


FLDL2T 
FLDL2T 
(no operands) 
Exceptions: 
I 
Load 109210 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Byles 
Coding Example 


(no operands) 
19 
16-22 
0 
0 
2 
FLDL2T 


FLDPI 
FLDPI (no operands) 
Load II 
Exceptions: 
I 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


(no operands) 
19 
16-22 
0 
0 
2 
FLDPI 


FLDZ 
FLDZ (no operands) 
Exceptions: 
I 
Load +0.0 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


(no operands) 
14 
11-17 
0 
0 
2 
FLDZ 


FLD1 
FLD1 (no operands) 
Exceptions: 
I 
Load +1.0 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


(no operands) 
18 
15-21 
0 
0 
2 
FLD1 


FMUL 
FMUL 
//source/destination,source 
Exceptions: 
I, D, 0, U, P 
Multiply real 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


//ST(i),ST /ST,ST(i)' 
97 
90-105 
0 
0 
2 
FMUL ST,ST(3) 
//ST(i),ST /ST,ST(i) 
138 
130-145 
0 
0 
2 
FMUL ST,ST(3) 
short-real 
118+EA 
110-125+EA 
2/4 
4 
2-4 
FMUL SPEED_FACTOR 
long-real' 
120+EA 
112-126+EA 
4/6 
8 
2-4 
FMUL [BP].HEIGHT 
long-real 
161+EA 
154-168+EA 
4/6 
8 
2-4 
FMUL [BPJ.HEIGHT 


, 


occurs when one or both operands is "short"-it 
has 40 trailing zeros in its fraction (e.g., it was loaded from a short-real 
memory operand). 


FMULP 
FMULP destination,source 
Exceptions: 
I, D, 0, U, P 
Multiply real and pop 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


ST(iLST' 
100 
94-108 
0 
0 
2 
FMULP ST(1LST 
ST(ILST 
142 
134-148 
0 
0 
2 
FMULP ST(1),ST 


, 


occurs when one or both operands is "short"-it 
has 40 trailing zeros in its fraction (e.g., it was loaded from a short-real 
memory operand). 


, 


FNOP 
FNOP (no operands) 
Exceptions: 
None 
No operation 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


(no operands) 
13 
10-16 
0 
0 
2 
FNOP 


FPATAN 
FPATAN (no operands) 
Exceptions: 
U, P (operands not checked) 
Partial arctangent 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


(no operands) 
650 
250-800 
0 
0 
2 
FPATAN 


FPREM 
FPREM (no operands) 
Exceptions: 
I, D, U 
Partial remainder 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


(no operands) 
125 
15-190 
0 
0 
2 
FPREM 


FPTAN 
FPTAN (no operands) 
Exceptions: 
I, P (operands not checked) 
Partial tangent 


Execution Clocks 
Transfers 
Operands 
Typical 
Ra.nge 
8086 
8088 
Bytes 
Coding Example 


(no operands) 
450 
30-540 
0 
0 
2 
FPTAN 


FRNDINT 
FRNDINT 
(no operands) 
Exceptions: 
I, P 
Round to integer 


Execution Clocks 
Transfers 
, 


Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 
l 


(no operands) 
45 
16-50 
0 
0 
2 
FRNDINT 


I 


Transfers 


8086 
8088 
Bytes 


FRSTOR source 
Restore saved state 


FSAVE/FNSAVE 
FSAVE destination 
, 


Save state 
Exceptions: 
None 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 - 8088 
Bytes' 
- 
Coding Example 
. 


94-bytes 
210+EA 
205-215+EA 48/50' 
94 
2-4 
FSAVE [SPJ 


FSCALE 
- 
FSCALE 
(no operands) 
Exceptions: 
I, 0, U 
Scale 


Execution Clocks 
Transfers 
I 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


'. 


(no operands) 
35 
32-38 
0 
0 
2 
FSCALE 


- --- 


FSQRT (no operands) 
Square root 


Transfers 


8086 
8088- 
Bytes 


FST destination 
Exceptions: 
1,0, U, P 
Store real 


Execution Clocks 
Transfers 


Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


18 
15-22 
0 
0 
2 
FST ST(3) 
87+EA 
84-90+EA 
3/5 
6 
2-4 
FST CORRELATION [DIJ 


100+EA 
96-104+EA 
5/7 
10 
2-4 
FST MEAN_READING 


ST(i) 
short-real 
long-real 


FSTCW IFNSTCW 
FSTCW destination 


I. 


Store control word 
Exceptions: 
None 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


2-bytes 
15+EA 
12-18+EA 
2/4 
4 
2-4 
FSTCW SAVE_CONTROL 


FSTENV/FNSTENV 
FSTENV destination 
Exceptions: 
None 
Store environment 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


14-bytes 
45+EA 
40-50+EA 
8/10 
16 
2-4 
FSTENV [BP] 


i 


FSTP 
FSTP destination 
Exceptions: 
1,0, U, P 
Store real and pop 
.; 
hJ • 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


ST(i) 
20 
17-24 
0 
0 
2 
FSTP ST(2) 
short-real 
89+EA 
86-92+EA 
3/5 
6 
2-4 
FSTP [BX].ADJUSTED_RPM 
long-real 
102+EA 
9B-106+EA 
5/7 
10 
2-4 
FSTP TOTAL_DOSAGE 
temp-real 
55+EA 
52-58+EA 
6/8 
12 
2-4 
FSTP REG_SAVE [SIJ 


f 


FSTSW IFNSTSW 
FSTSW destination 
Exceptions: 
None 
Store status word 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding EXl!mple 


T 


2-bytes 
15+EA 
12-18+EA 
2/4 
4 
2-4 
FSTSW SAVE_STATUS 
~ 


FSUB 
F$UB 
I/source/destination,source 
Subtract real 
Exceptions: 
I,D,O,U,P 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


c 


//ST,ST(i)/ST(ij,ST 
85 
70-100 
0 
0 
2 
FSUB ST,ST(2) 
short-real 
105+EA 
90-120+EA 
2/4 
4 
2-4 
FSUB BASE_VALUE 
long-real 
110+EA 
95-125+EA 
4/6 
8 
2-4 
FSUB COORDINATE.X 


FSUBP 
FSUBP destination,source 
Exceptions: 
I,D,O,U,P 
Subtract real and pop 


Execution Clocks , 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


ST(i),ST 
90 
' 75-105 
0 
0 
2 
FSUBP ST(2),ST 


FSUBR 
FSUBR //source/destination,source 
Exceptions: 
I,D,O,U,P 
Subtract real reversed 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


//ST,ST(i)/ST(i),ST 
87 
70-100 
0 
0 
2 
FSUBR ST,ST(1) 
short-real 
105+EA 
90-120+EA 
2/4 
4 
2-4 
FSUBR VECTOR[SI) 
long-real 
110+EA 
95-125+EA 
4/6 
8 
2-4 
FSUBR [BX].INDEX 


FSUBRP 
FSUBRP destination, source 
Exceptions: 
I,D,O,U,P 
Subtract real reversed and pop 


Executon Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


ST(i),ST 
90 
75-105 
0 
0 
2 
FSUBRP ST(1),ST 


FTST 
FTST (no operands) 
Test stack top against + 0.0 
Exceptions: 
I, 0 


, 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


(no operands) 
42 
38-48 
0 
0 
2 
FTST 


FWAIT 
FWAIT (no operands) 
(CPU) Wait while 8087is busy 
Exceptions: 
None (CPU instruction) 
~ 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


(no operands) 
3+5n* 
3+5n* 
0 
0 
1 
FWAIT 
. 
. 


FXAM 
FXAM (no operands) 
Exceptions: 
None 
Examine stack top 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


(no operands) 
17 
12-23 
0 
0 
2 
FXAM 


FXCH 
FXCH 
Iidestination 
Exceptions: 
I 
Exchange registers 
. 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


IIST(i) 
12 
10-15 
0 
0 
2 
FXCH ST(2) 
, 


FXTRACT 
FXTRACT (no operands) 
Exceptions: 
I 
Extract exponent and significand 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


I, 


(no operands) 
50 
27-55 
0 
0 
2 
FXTRACT 


'. 


FYL2X 
FYL2X (no operands) 
Exceptions: 
P (operands not checked) 
y. Log2x 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


(no operands) 
950 
900-1100 
0 
0 
2 
FYL2X 


FYL2XP1 
FYL2XP1 (no operands) 
y. iog2(x+ 1) 
Exceptions: 
P (operands not checked) 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


(no operands) 
l 
850 
700-1000 
0 
0 
2 
FYL2XP1 


I 


---- 
---- 
- 
F2XM1 
F2XM1 
(no operands) 
Exceptions: 
U, P (operands not checked) 
2 x_1 


Execution Clocks 
Transfers 
Operands 
Typical 
Range 
8086 
8088 
Bytes 
Coding Example 


~~ 


(no operands) 
500 
310-630 
0 
0 
2 
F2XM1 


5.8 Programming Facilities 


Writing programs for the 8087 is a natural exten- 
sion of the process described in section 2.9, just as 
the NDP itself is an extension to the CPU. This 
section describes how PL/M-86 and ASM-86 pro- 
grammers work with the 8087 in these languages. 
It 
also 
covers 
the 
8087 software 
emulators 
provided for both translators. 


The level of detail in this section is intended to 
give programmers 
a basic understanding 
of the 


software tools that can be used with the 8087, but 
this information 
is not sufficient to document the 


full capabilities of these facilities. The definitive 
description of ASM-86 and the full 8087 emulator 
is provided 
in MCS-86 
Assembly 
Language 


Reference 
Manual, 
Order 
No. 
9800640, 
and 


MCS-86 
Assembler 
Operating 
Instructions 
for 


ISIS-II Users, Order No. 9800641. PLlM-86 and 
the partial emulator are documented in PL/M-86 
Programming 
Manual, 
Order No. 9800466 and 


ISIS-II PL/M-86 
Compiler Operator's Manual, 


Order No. 9800478. These publications 
may be 
ordered from Intel's Literature Department. 


Readers should be familiar with section 2.9 of the 
8086 Family User's Manual 
in order to benefit 


from the material in this section. 


High level language programmers 
can access a 
useful subset of the 8087's (real or emulated) 
capabilities. 
The 
PLlM-86 
REAL 
data 
type 


corresponds to the NDP's short real (32-bit) for- 
mat. This data type provides a range of about 
8.43*10-37 ~ Ixl ~ 3.38*1038, with about seven 
significant decimal digits. This representation 
is 


adequate 
for 
the 
data 
manipulated 
by many 
microcomputer applications. 


The utility of the REAL data type is extended by 
the 
PLlM-86 
compiler's 
practice 
of 
holding 


intermediate results in the 8087's temporary real 
format. This means that the full range and preci- 
sion 
of 
the 
processor 
may 
be 
utilized 
for 


intermediate 
results. Underflow, 
overflow, 
and 


rounding errors are most likely to occur during 
intermediate 
computations 
rather 
than 
during 


calculation 
of 
an 
expression's 
final 
result. 


Holding 
intermediate 
results in temporary 
real 


format greatly reduces the likelihood of overflow 
and 
underflow 
and 
eliminates 
roundoff 
as a 


serious source of error until the final assignment 
of the result is performed. 


The compiler 
generates 
8087 code to evaluate 
expressions 
that 
contain 
REAL 
data 
types, 


whether 
variables 
or constants 
or both. 
This 


means that addition, subtraction, 
multiplication, 


division, comparison, 
and assignment of REALs 


will be performed by the NDP. INTEGER expres- 
sions, on the other hand, 
are evaluated on the 


CPU. 


Five built-in 
procedures 
(table 
S-20) give the 


PL/M-86 
programmer 
access to 8087 functions 


manipulated 
by the processor 
control 
instruc- 


tions. 
Prior 
to 
any 
arithmetic 
operations, 
a 


typical PLlM-86 
program 
will setup the NDP 


after power up using the INIT$REAL$MATH 
$UNIT 
procedure 
and 
then 
issue 


SET$REAL$MODE 
to 
configure 
the 
NDP. 


SET$REAL$MODE loads the 8087 control word, 
and its l6-bit parameter has the format shown in 
figure 
S-7. 
The 
recommended 
value 
of 
this 


parameter is 033EH (projective closure, round to 
nearest, 64-bit precision, 
interrupts 
enabled, all 


exceptions 
masked 
except 
invalid 
operation). 


Other settings may be used at the programmer's 
discretion. 


Procedure 
8087 Instruction 
Description 


INIT$REAL$MATH$U NIT(1) 
FINIT 
Initialize processor. 


SET$REAL$MODE 
FLDCW 
Set exception masks, rounding 
precision, and infinity controls. 


GET$REAL$ERROR(2) 
FNSTSW& FNCLEX 
Store, 
then 
clear, 
exception 
flags. 


SAVE$REAL$STATUS 
FNSAVE 
Save processor state. 


RESTORE$REAL$STATUS 
FRSTOR 
Restore processor state. 


(1)Also initializes interrupt pointers for emulation. 


(2)Returns low-order byte of status word. 


If any exceptions 
are unmasked, 
an exception 


handler must be provided in the form of an inter- 
rupt procedure that is designated to be invoked by 
CPU interrupt 
pointer (vector) number 
16. The 


exception 
handler 
can 
use 
the 
GET$REAL 


$ERROR procedure to obtain the low-order byte 
of the 8087 status word and to then clear the 
exception 
flags. 
The 
byte 
returned 
by 


GET$REAL$ERROR 
contains 
the 
exception 


flags; these can be examined 
to determine 
the 


source of the exception. 


The 
SAVE$REAL$ST ATUS 
and 
RESTORE 


$REAL$ST ATUS procedures 
are provided 
for 


multi-tasking environments where a running task 
that uses the 8087 may be preempted by another 
task that also uses the 8087. It is the responsibility 
of 
the 
preempting 
task 
to 
issue 


SAVE$REAL$ST ATUS before 
it executes any 
statements that affect the 8087; these include the 
INIT$REAL$MATH$UNIT 
and 
SET$REAL 


$MODE procedures as well as arithmetic expres- 
sions. 
SAVE$REAL$ST ATUS 
saves the 
8087 


state (registers, status, and control words, etc.) on 
the CPU's 
stack. 
RESTORE$REAL$STATUS 


reloads the state information; 
the preempting task 


must invoke this procedure before terminating in 
order to restore the 8087 to its state at the time the 
running 
task was preempted. 
This enables the 


preempted 
task to resume execution 
from the 


point of its preemption. 


Note that the PL/M-86 
compiler prefixes every 
8087 instruction 
with a CPU WAIT. Therefore, 
programmers 
should 
not 
code 
PL/M-86 


statements that generate 8087 instructions if the 


NDP can request an interrupt and that interrupt is 
blocked 
(this 
may 
result 
in the 
endless 
wait 


condition described in section S.6.) 


The ASM-86 assembly language provides a single 
uniform set of facilities for all combinations 
of 


the 
8086/8088/8087 
processors. 
Assembly 


language programs 
can be written 
to be com- 


pletely independent of the processor set on which 
they are destined to execute. This means that a 
program written originally for an 8088 alone will 
execute on an 
8086/8087 
combination 
without 


re-assembling. 
The programmer's 
view of the 


hardware is a single machine with these resources: 


• 
160 instructions 


• 
12data types 
• 
8 general registers 


• 
4 segment registers 
• 
8 floating-point 
registers, 
organized 
as a 


stack 


The combination 
of the assembly language and 


the 8087 emulator decouple the source code from 
the execution vehicle. For example, the assembler 
automatically inserts CPU WAIT instructions in 
front of those 8087 instructions that require them. 
If the program actually runs with the emulator 
rather 
than 
the 
8087, 
the 
WAITs 
are 
auto- 


matically removed at link time (since there is no 
NDP for which to wait). 


Defining Data 


The 
ASM-86 
directives 
shown 
in table 
S-21 
allocate storage for 8087 variables and constants. 
As with other storage allocation 
directives, the 


assembler 
associates 
a type with any variable 


defined with these directives. The type value is 
equal to the length of the storage unit in bytes (10 
for DT, 8 for DQ, etc.). The assembler checks the 
type of any variable coded in an instruction to be 
certain that it is compatible with the instruction. 
For example, the coding FIADD ALPHA will be 
flagged as an error if ALPHA's 
type is not 2.or 4, 
because 
integer addition 
is only available 
for 


word and short integer data types. The operand's 
type 
also 
tells the 
assembler 
which 
machine 


instruction 
to produce; 
although 
to the pro- 
grammer there is only an FIADD instruction, 
a 


different machine instruction is required for each 
operand type. 


On occasion it is desirable to use an instruction 
with an operand tha~ has no declared type. For 
example, if register BX points to a short integer 
variable, 
a 
programmer 
may 
want 
to 
code 


FIADD [BX]. This can be done by informing the 
assembler of the operand's type in the instruction, 
coding 
FIADD 
DWORD 
PTR 
[BX]. 
The 


corresponding 
ov~rrides 
for the other 
storage 


allocations are WORD PTR, QWORD PTR, and 
TBYTEPTR. 


The assembler does not, however, check the types 
of operands 
used in processor 
control 
instruc- 


tions. Coding FRSTOR [BP] implies that the pro~ 
grammer has set up register BP to point to the 
stack location where the processor's 94-byte state 
record has been previously saved. 


The initial values for 
8087 constants 
may be 


coded in several different 
ways. Binary integer 


constants may be specified as bit strings, decimal 
integers, octal integers, or hexadecimal 
strings. 


Packed decimal values are normally 
written as 


decimal 
integers, 
although 
the 
assembler 
will 


accept 
and 
convert 
other 
representations 
of 


integers. Rea! values may be written as ordinary 
decimal real numbers (decimal point required), as 
decimal numbers in scientific notation, or as hex- 
adecimal 
strings. 
Using hexadecimal 
strings is 


primarily 
intended 
for 
defining 
special values 
such as infinities, 
NANs, 
and 
nonnormalized 


numbers. 
Most 
programmers 
will 
find 
that 


ordinary decimal and scientific decimal provide 
the simplest way to initialize 
8087 constants. 


Figure S-20 compares several ways of setting the 
various 8087 data types to the same initial value. 


Note that preceding 8087 variables and constants 
with the ASM-86 EVEN directive ensures that the 
operands will be word-aligned 
in memory. This 


will produce the best performance 
in 8086-based 


systems, and is good practice even for 8088 soft- 
ware, in the event that the programs 
are trans- 


ferred to an 8086. All 8087 data types occupy 
integral numbers of words so that no storage is 
"wasted" 
if blocks 
of 
variables 
are 
defined 
together 
and 
preceded 
by 
a 
single 
EVEN 


declarative, 


The ASM-86 RECORD and STRUC (structure) 
declaratives can be very useful in NDP program- 
ming. The record facility can be used to define the 
bit fields of the control, 
status, and tag words. 


Figure S-21 shows one definition 
of the status 
word and how it might be used in a routine that 
polls the 8087 until it has completed an instruc- 
tion. 


Because structures 
allow different 
but 
related 


data types to be grouped together, they often pro- 
vide a natural way to represent "real world" data 
organizations. 
The 
fact 
that 
the 
structure 


template may be "moved" 
about in memory adds 


to its flexibility. Figure S-22 shows a simple struc- 


Directive 
Interpretation 
8087 Data Types 


OW 
Define Word 
Word integer 


DO 
I 
Define Doubleword 
Short integer, short real 


'., , 
DQ 
Define Quadword 
Long integer, long real 


DT 
Define Tenbyte 
Packed decimal, temporary real 


; 
EVEN 
FORCE 
WORD 
ALIGNMENT 
WORD 
INTEGER 
OW 
11111111100DD10B 
BIT 
STRING 


SHORT 
INTEGER 
00 
OFFFFFF82H 
HEX 
STRING 
MUST 
START 
WITH 
DIGIT 
LONG 
INTEGER 
DQ 
-126 
ORDINARY 
DECIMAL 
SHORT 
REAL 
DO 
-126.0 
NOTE 
PRESENCE 
OF 
LONG 
REAL 
DO 
-1.26E2 
• 'SCIENTIFIC" 
PACKiD 
DECIHAL 
DT 
-126 
.; ORDINARY 
DECIMAL 
INTEGER 
IN 
THE 
FOLLOWING, 
SIGN 
AND 
EXPONENT 
IS 
'C005', 
SIGNIFICAND 
IS 
'7EOO 
... 00', 
'R' 
INFORMS 
ASSEMBLER 
THAT 
THE 
STRING 
REPRESENTS 
A 
REAL 
DATA 
TYPE. 


; RESERVE 
SPACE 
FOR 


STATUS_WORD 
; 
LAY 
OUT 
STATUS 
STATUS 
RECORO 


& 
BUSY: 
1, 


& 
COND 
CODE3: 
1, 


& 
STACK 
TOP: 
3, 


& 
COND 
CODE2: 
1, 


& 
COND-CODE1: 
1, 


& 
COND-CODEO: 
1, 


& 
INT 
REQ: 
1, 


& 
RESiRVED: 
1, 


& 
P 
F LA G : 
1 , 


& 
U-FLAG: 
1, 


& 
O-FLAG: 
1, 


& 
Z-FLAG: 
1, 


& 
D-FLAG: 
1, 


& 
I-FLAG: 
1 


; 
POLL-STATUS 
WORD 
UNTIL 
8087 
IS 
NOT 
BUSY 
POLL: 
FNSTSW 
STATUS 
WORD 
TEST 
STATUS-WORD, 
MASK 
BUSY 
JNZ 
POLL- 


STATUS 
WORD 


OW 
? 


WORD 
FIELDS 


N 
OBS 
DO 
;SHORT 
INTEGER 
MiAN 
DQ 
; LONG 
REAL 
MODE 
OW 
;WORD 
INTEGER 
STD 
OEV 
DQ 
;LONG 
REAL 
;ARRAY 
OF 
OBSERVATIONS 
WORD 
INTEGER 
TEST 
SCORES 
OW 
1000 
DUP 
(?) 
SAMPLE 
iNDS 


ture that might be used to represent data con- 
sisting of a series of test score samples. A struc- 
ture could also be used to define the organization 
of the information 
stored 
and loaded 
by the 


FSTENV and FLDENV instructions. 


Addressing 
Modes 


8087 memory data can be accessed with any of the 
CPU's 
five memory 
addressing 
modes. 
This 
means that 8087 data types can be incorporated in 


data aggregates ranging from simple to complex 
according to the needs of the application. 
The 
addressing modes, and the ASM-86 notation used 
to specify them in instructions, 
make the access- 


ing of structures, arrays, arrays of structures, and 
other organizations 
direct and straightforward. 
Table S-22 gives several examples of 8087 instruc- 
tions coded with operands that illustrate different 
addressing modes. 


Intel offers two software products 
that provide 


the 
functional 
equivalent 
of 
an 
8087, 


implemented 
in 
8086/8088 
software. 
The 
full 


emulator (E8087) emulates all 8087 instructions. 
The partial emulator (PE8087) is a smaller version 
that implements only the instructions 
needed to 


support PL/M-86 
programs. 
The full emulator 
adds about 
16k bytes to a program, 
while the 


partial 
emulator 
executes 
in 
about 
8k. 
Any 


emulated program 
will deliver the same results 


(except for timing) 
if it is executed 
on 8087 
hardware. 


The emulators 
may be viewed as conSlStmg of 


emulated 
hardware 
and emulated 
instructions. 
The emulators 
establish 
in CPU 
memory 
the 


equivalent of the 8087 register stack, control, and 
status words and all other programmer-accessible 
elements of the NDP architecture. 
The emulator 
instructions utilize the same algorithms 
as their 


hardware counterparts. 
Emulator instructions are 


actually 
implemented 
as 
CPU 
interrupt 
pro- 


cedures. During relocation and linkage the 8087 
machine instructions 
generated 
by the ASM-86 
and PLlM-86 translators are changed to software 
interrupt 
(INT) instructions 
which invoke these 


procedures as the CPU processes its instruction 
stream. 


Since the decision to produce 
real or emulated 


8087 instructions is made at link time, a program 
may be switched from one mode to the other 
without retranslating 
the source code. When the 
PLlM-86 
compiler or ASM-86 assembler places 


an 
8087 machine 
instruction 
into 
an 
object 
module, it also inserts a special external reference. 
This reference is satisfied by linking the object 
module to one of two Intel-supplied libraries: 
the 


real library, or the emulator 
library. If the real 


library is specified, LINK-86 simply deletes the 
external 
references, 
leaving 
the 
original 
8087 


machine instructions. 


To run on an emulated 8087, the object program 
is linked to the emulator library and to a file con- 
taining the code of either the full or the partial 
emulator. 
LINK-86 then adds the emulator code 


to the program 
and changes the 8087 machine 


instructions (and their preceding WAITs) to CPU 
software 
interrupt 
instructions. 
Any 
FWAIT 


instructions are also changed to CPU NOPs. 


Note that an explicitly-coded CPU WAIT instruc- 
tion will not be changed; if it is executed under 
emulation, the CPU will wait forever. This is why 


ALPHA 
is a simple scalar (mode is 
direct). 


BETA is a field in a structure that is 
"overlaid" 
on 
ALPHA 
(mode 
is 
direct). 


BXcontains the address of a long real 
variable (mode is register indirect). 


ALPHA is an array and 81 contains the 
offset of an array element from the 
start of the array (mode is indexed). 


BP 
contains 
the 
address 
of 
a 
structure on the CPU stack and BETA 
is a field in the structure 
(mode is 
based). 


BXcontains the address of a packed 
decimal array and 01 contains the off- 
set 
of an array element 
(mode 
is 
based indexed). 


the FWAIT mnemonic 
should always be used 


when the external processor that the CPU is to 
wait for is an 8087. 


In order to be compatible 
with E8087, ASM-86 


programs 
should 
observe 
the 
following 


conventions: 


• 
Their 
stack 
segment 
and 
class should 
be 


named STACK. 


• 
Interrupt 
pointer 
(vector) 
16 should 
be 


designated for the user's exception handler 
interrupt procedure. 


• 
The external 
procedure 
INIT87 
should be 


called in the program's 
initialization (power- 


up) sequence. If the emulator is being used, 
this procedure 
will initialize CPU interrupt 


pointers 20-31 to the addresses of emulator 
procedures 
and will execute an (emulated) 


FINIT 
instruction. 
If the program 
is not 


being emulated, 
INIT87 simply executes the 


FINIT instruction. 


PLlM-86 
automatically 
observes corresponding 


conventions. 


Figures S-23 and S-24 show the PLlM-86 
and 
ASM-86 code for a simple 8087 program, called 
ARRSUM. 
The 
program 
references 
an array 
(X$ARRAY), 
which contains 
0-100 short 
real 


values; the integer variable N$OF$X indicates the 
number 
of array 
elements 
the program 
is to 


consider. 
ARRSUM 
steps through 
X$ARRA Y 


accumulating three sums: 


• 
SUM$X, the sum of the array values; 


• 
SUM$INDEXES, 
the sum 
of each array 
value times its index, where the index of the 
first element is 1, the second is 2, etc.; 


• 
SUM$SQUARES, 
the sum of each array 
element squared. 


(A true program, 
of course, would go beyond 


these steps to store and use the results of these 
calculations.) 
The control 
word is set with the 


recommended values: projective closure, round to 
nearest, 64-bit precision, interrupts enabled, and 
all exceptions masked except invalid operation. It 


is assumed that an exception handler has been 
written to field the invalid operation, if it occurs, 
and that it is invoked by interrupt 
pointer 
16. 


Either version of the program 
will run on an 


actual or an emulated 8087 without altering the 
code shown. 


The PLlM-86 version of ARRSUM (figure S-23) 
is very straightforward 
and illustrates how easily 


the 8087 can be used in this language. 
After 


declaring 
variables 
the 
program 
calls built-in 


procedures 
to 
initialize 
the 
processor 
(or 
its 


emulator) and to load the control word. The pro- 
gram clears the sum variables 
and then steps 


through X$ARRAY with a DO-loop. 
The loop 


control takes into account PL/M-86's 
practice of 


considering the index of the first element of an 
array 
to 
be 
O. 
In 
the 
computation 
of 


SUM$INDEXES, 
the built-in procedure FLOAT 
converts 
I+ 1 from integer to real because the 


language 
doe$ 
not 
support 
"mixed 
mode" 


arithmetic. One of the strengths of the NDP, of 


ISIS-II 
PLfM-86 
DEBUG 
V2.1 
COMPILATION 
OF 
MODULE 
ARRAYSUM 
OBJECT 
MODULE 
PLACED- IN 
:F4:ARRSUM.OBJ 
COMPILER 
INVOKED 
BY: 
:FO:PLM86 
:F4:ARRSUM.P86 
XREF 


ARRAY$SUM: 
DO, 


2 
DECLARE 
(SUM$X.SUM$INDEXES.SUM$SQUARES) 
REAL; 


3 
DECLARE 
X$ARRAY 
(100) 
REAL, 


4 
DECLARE 
(N$OF$X.I) 
INTEGER; 
5 
DECLARE 
CONTROL$87 
LITERALLY 
•033EH' ; 


fo 
ASSUME 
X$ARRAY 
AND 
N$OF$X 
ARE 
INITIALIZED 
*f 


fo 
PREPARE 
THE 
8087. 
OR 
ITS 
EMULATOR 
Of 
CALL 
INIT$REAL$MATH$UNIT; 


CALL 
SET$REAL$MODE(CONTROL$87), 


f* 
CLEAR 
SUMS 
*f 
SUM$X. 
SUM$INDEXES. 
SUM$SQUARES 
= 0.0, 


f* 
LOOP 
THROUGH 
X$ARRAY. 
ACCUMULATING 
SUMS 
*f 
DO 
I = 
0 TO 
UOF$X 
- 
1, 
SUM$X 
= SUM$X 
+ X$ARRAY(I), 
SUM$INDEXES 
= 
SUM$INDEXES 
+ 


(X$ARRAY(I) 
° FLOAT(I 
+ 1»), 
SUM$SQUARES 
= SUM$SQUARES 
+ 
(X$ARRAY(I) 
° X$ARRAY(I)), 
END, 


fo 
ETC ...of 


14 
END 
ARRAY$SU)l, 


9 
1 
10 
2 


11 
2 


0002H 
151 
ARRAYSUM 
• 
PROCEDURE 
STACK=0002H 
CONTROL87. 
LITERALLY 
7 
FLOAT. 
BUILTIN 
11 


4 
019EH 
2 
1.. 
INTEGER 
9 
INITREALMATHUNIT 
BUILTIN 
6 
4 
019CH 
2 
NOFX 
• 
INTEGER 
9 
SETREALMODE. 
BUILTIN 
7 
2 
0004H 
4 
SUMINDEXES 
REAL 
8 
11 


2 
0008H 
4 
SUMSQUARES 
REAL 
8 
12 
2 
OOOOH 
4 
SUMX 
• 
REAL 
8 
10 
3 
OOOCH 
400 
XARRAY 
• 
REAL 
ARRAY(100) 


MODULE 
INFORMATION: 
, 


CODE 
AREA 
SIZE 
= 0099H 
1530 
CONSTANT 
AREA 
SIZE 
= 0004H 
40 


VARIABLE 
AREA 
SIZE 
= 01AOH 
4160 
MAXIMUM 
STACK 
SIZE 
= 0002H 
20 


33 
LINES 
READ 


o 
PROGRAM 
ERROR(S) 


END 
OF 
PL/M-86 
COMPILATION 


course, is that it does support arithmetic on mixed 
data types, and assembly language programmers 
can take advantage of this facility. 


The ASM-86 version (figure S-24) defines the 
external procedure INIT87, which makes the dif- 
ferent initialization requirements of the processor 
and its emulator transparent 
to the source code. 


After defining the data, and setting up the seg- 


ment registers and stack pointer, 
the program 


cans INIT87 and loads the control 
word. The 


computation 
begins with the next three instruc- 


tions, 
which 
clear 
three 
registers 
by loading 


(pushing) zeros onto the stack. As shown in figure 
S-25, these registers remain at the bottom of the 
stack throughout 
the computation 
while tem- 


porary values are pushed on and popped off the 
stack above them. 


ISIS-II 
8086/8087/8088 
MACRO 
ASSEMBLER 
V3.0 
ASSEMBLY 
OF 
MODULE 
ARRSUM 


OBJECT 
MODULE 
PLACED 
IN 
:Fl :ARRSUM.OBJ 
ASSEMBLER 
INVOKED 
BY: 
:FO:ASM86 
:F1:ARRSUM.A86 
XREF 


LINE 
SOURCE 


1 
;DEFINE 
INITIALIZATION 
ROUTINE 
2 
EXTRN 
INIT87 :FAR 


3 
4 
;ALLOCATE 
SPACE 
FOR 
DATA 


5 
DATA 
SEGMENT 
PUBLIC 
'DATA' 


6 
CONTROL 
87 
0111 
033EH 


7 
N OF 
X - 
0111 
? 


8 
X:::~RtlAY 
DO 
100 
DUP 
(?) 


9 
SUM 
X 
DO 
10 
SUM-INDEXES 
DO 
11 
Su.~-SQU ARES 
DO 
12 
DATA 
ENDS 


0000 
3E03 
0002 
???? 


0004 
(ioo 
???????? 
) 
0194 
???????? 
0198 
???????? 
019C 
???????? 


8086/8087/8088 
MACRO 
ASSEMBL.":R 
ARRSUM 


LOC 
OBJ 
LINE 
SOURCE 


13 
14 
;ALLOCATE 
C PU 
STACK 
SPACE 


15 
STACK 
S.":GMENT STACK 
•S'rACK' 


0000 
(200 
16 
DII 
200 
DUP 
(7) 
???1 


17 
18 
; LABEL 
INITIAL 
TOP 
OF 
STACK 


0190 
19 
S'rACK TOP 
LABEL 
IIORD 


20 
STACK- 
ENDS 
21 
22 
CODll 
SEGMENT 
PUBLIC 
•CODE' 


23 
ASSUME 
CS:CODll,DS:DATA,SS:STACK,ES:NOTHING 


24 


0000 
25 
START: 
0000 
BB---- 
R 
26 
MOV 
AX,DATA 


0003 
8ED8 
27 
MOV 
DS,AX 
0005 
BB---- 
R 
28 
MOV 
AX,STACK 
0008 
8EDO 
29 
MOV 
SS,AX 
OOOA 
BC9001 
R 
30 
MOV 
SP,OFFSET 
STACK_TOP 


31 
32 
jASSUME 
X 
ARRAY & N OF 
X 
ARE 
INITIALIZED. 


33 
; 
NOTE: 
PROGRAM 
ZEROS 
N OF 
X 


34 
;PREPARE 
THE 
8087 
OR 
ITS 
EMULATOR. 
35 
OOOD 
9AOOOO---- 
E 
36 
CALL 
INIT87 
0012 
9BD92EOOOO 
R 
37 
FLDCII 
CONTROL_87 


38 
39 
;CLEAR 
3 REGISTERS 
TO 
HOLD 
RUNNING 
SUMS. 
40 
0017 
9BD9EE 
41 
FLDZ 
OOlA 
9BD9EE 
42 
FLDZ 
cr 


001D 
9BD9EE 
43 
FLDZ 


44 
45 
;SETUP 
CX 
AS 
LOOP 
COUNTllR & SI 
AS 
INDEX 
TO 
X_ ARRAY. 


46 
0020 
8BOEC200 
R 
47 
MOV 
CX,N 
OF 
X 
0024 
E329 
48 
JCXZ 
POP 
llESULTS 
;EXIT 
EARLY 
IF 
X_ARRAY 
EMPTY 
0026 
BB0400 
49 
MOV 
AX,'l'YPE lC_ARRAY 
0029 
F7E9 
50 
IMUL 
CX 
002B 
8BFO 
51 
MOV 
SI,AX 
52 
53 
;SI 
NOli CO~TAINS 
INDEX 
OF 
LAST 
ELEMllNT 
+ 1. 
54 
;LOOP 
THRU 
X_ARRAY 
ACCUMULATING 
SUMS. 
002D 
55 
SUM_NEXT: 
002D 
83EE04 
56 
SUB 
SI, TnE 
X ARRAY 
BACKUP 
ONE 
ELEMENT 


0030 
9BD9840400 
R 
57 
FLD 
X 
ARRAY[ sl] 
PUSH 
IT 
ONTO 
STACK 


0035 
9BDCC3 
58 
FADD 
S'I'(3),ST 
ADD 
INTO 
SUM 
OF 
X 


0038 
9BD9CO 
59 
FLD 
ST 
DUPLICATE 
X ON 
TOP 


003B 
9BDCC8 
60 
FMUL 


~~I~LsT 
SQUARE 
IT 


003E 
9BDEC2 
61 
FADDP 
ADD 
INTO 
SUII OF 
SQUARES 
62 
AND 
DISCARD 


0041 
9BDEOE0200 
R 
63 
FIMUL 
N OF 
X 
GET 
X 
TIMES 
ITS 
INDEX 
0046 
9BDEC2 
64 
FADDP 
S'I'(2T,ST 
ADD 
INTO 
SUM 
OP 
(INDEX 
* X) 
65 
AND 
llISCARD 


0049 
FFOE0200 
R 
66 
DEC 
N OF 
X 
REDUCE 
INDEX 
FOR 
NEXT 
ITERATION 
004D 
E2DE 
67 
LOOP 
SUM_WEXT 
CO~TINUE 


68 
69 
;POP 
R~NING 
SUMS 
INTO 
MEI~ORY 
004F 
70 
POP_RESULTS: 
004P 
9BD91E9COI 
R 
71 
FSTP 
SUM 
SQUARES 
0054 
9BD91 E9801 
R 
72 
PSTP 
SUM-INDEXES 
0059 
9BD91 E9401 
R 
73 
FSTP 
SUM=X 


74 
75 
;ETC .•. 
76 
77 
78 
CODE 
ENDS 
79 
0000 
80 
llND 
START 


NAME 
TYPE 
VALUE 
ATTRIBUTES, 
XREFS 


??SEG 
SEGMENT 
SIZE=OOOOH 
PARA 
PUBLIC 
CODE. 
SEGMENT 
SIZE=005EH 
PARA 
PUBLIC 
'CODE' 
22# 
23 
78 
CONTROL_87. 
V 
WORl)- 
OOOOH 
DATA 
6# 
37 
DATA. 
SEGMENT 
SIZE=OlAOH 
PARA 
PUBLIC 
'DATI.' 
5# 
12 
23 
26 


INIT87. 
L FAR 
OOOOH 
EXTRN 
2# 
36 


N 
OF 
X. 
V 
WORD 
0002H 
DATA 
7# 
47 
63 
66 
pnp 'REsuimi 
L NEAR 
004FH 
CODE 
48 
70# 


STA'C'K • 
SEGMENT 
SIZE=0190H 
PARA 
STACK 
'STACK' 


STACK 
TOP 
. 
V 
WORD 
0190H 
STACK 
19# 
30 


START- 
• 
L NEAR 
OOOOH 
CODE 
25# 
80 


SUM 
INDEXES 
V DWORD 
0198H 
DATA 
10# 
72 


SU,"NEXT. 
L NEAR 
002DH 
CODE 
55# 
67 
SUM-SQUARES 
V DWORD 
019CH 
DATA 
11# 
71 
SU,"X 
• 
V DWORD 
0194H 
DATA 
9# 
73 


X_AllRAY 
: • 
V 
DWORD 
0004H 
DATA 
8# 
49 
56 
57 


The program uses the CPU LOOP instruction to 
control its iteration through X_ARRAY; 
register 


CX, which LOOP automatically 
decrements, 
is 


loaded with N_OF _X, 
the number 
of array 


elements to be summed. 
Register SI is used to 


select (index) the array elements. The program 
steps 
through 
X_ARRAY 
from 
"back 
to 


front", 
so SI is initialized to point at the element 


just beyond the first element to be processed. The 
ASM-86 TYPE operator is used to determine the 
number of bytes in each array element. This per- 
mits changing X_ARRAY 
to a long real array by 


simply changing its definition 
(DD to DQ) and 


re-assembling. 


Figure S-25 shows the effect of the instructions in 
the program loop on the NDP register stack. The 
figure 
assumes 
that 
the 
program 
is 
in 
its 


first iteration, 
that N_OF _X 
is 20, and that 


X_ARRAY(l9) 
(the 20th element) contains the 


value 2.5. When the loop terminates, 
the three 


sums are left as the top stack elements so that the 
program 
ends 
by 
simply 
popping 
them 
into 


memory variables. 


This section describes features of the 8087 which 
will be of interest to groups of users who have 
special requirements. 
Most users will not need to 


understand 
this material 
in detail 
in order 
to 


utilize the NDP successfully. Most readers, then, 
can either browse this section, or skip it altogether 
in favor of the programming 
examples in section 


S.10. 


The first four topics in this section cover the 
8087's generation and handling of nonnormalized 
real values, zeros, infinities and NANs. In the 
great 
majority 
of 
applications, 
these 
special 


values will either not appear at all, or in the case 
of zeros, will function according to the normal 
rules of arithmetic. Next the bit encodings of each 
data type are summarized in table form, including 
special values. This information 
may be of use to 


programmers who are sorting these data types or 
are decoding unformatted memory dumps or data 
monitored from the bus. At the end of the section 
is a table that lists all 8087 exception conditions 
by class, and the processor's masked response to 
each exception. This information 
will principally 


be of use to writers of exception handlers and to 
anyone else interested 
in ascertaining 
the exact 


conditions under which the NDP signals a given 
type of exception. 


ST(O) 


ST(1) 


ST(2) 


0.0 
SU 


0.0 
SU 


0.0 
SU 


M_SQUARES 


M_INDEXES 


- 


ST(O) 
2.5 


T(1) 


ST(2) 
0.0 


ST(3) 
0.0 


X._ARRAY 
(19) 


SUM_SQUARES 


ST(O) 


ST(1) 


~ 


2.5 
X_A 


0.0 
SUM 


0.0 
SUM 


2.5 
SUM 
---- 
RRAY 
(19) 
ST 


SQUARES 
ST 


- 
fMUL 
ST , ST 
- 
---- - 
ST(O) 
6.25 
X_ARRAY(19)2 


ST(1) 
2.5 
X_ARRAY(19) 


ST(2) 
0.0 
SUM_SQUARES 


ST(3) 
0.0 
SUM_INDEXES 


ST(4) 
2.5 
SUM_X 
..- 
..- 


ST(O) 


ST(1) 


ST(2) 


ST(3) 


f I MU L 
N 
Of 
X 


50.0 
X_A 


6.25 
SUM 


0.0 
SUM 


2.5 
SUM 


(0) 
2.5 


(1) 
2.5 


(2) 
0.0 


(3) 
0.0 


(4) 
2.5 


X_ARRAY 
(19) 


X_ARRAY 
(19) 


ST(O) 
2.5 


ST(1) 
6.25 


ST(2) 
0.0 


ST(3) 
2.5 
.- 


fADDP 
ST(2),ST 


~§§'25 
SUM_SQUARES 


ST(1) 
50.0 
SUM_INDEXES 


sT(2) 
2.5 
SUM_X 


As discussed in section S.3, the 8087 generally 
stores 
nonzero 
real 
numbers 
in 
normalized 
floating point form; that is, the integer (leading) 
bit of the significand 
is always a 1. This bit is 
explicitly stored in the temporary real format, and 
is implicit in the short and long real forms. Nor- 
malized storage allows the maximum number of 
significant digits to be held in a significand of a 
given width, because leading zeros are eliminated. 


Denormals 


A denormal is the result of the NDP's 
masked 
response to an underflow exception. Underflow 
occurs when the exponent of a true result is too 
small to be represented in the destination format. 
For example, a true exponent of -130 will cause 
underflow if the destination is short real, because 
-126 is the smallest exponent 
this format 
can 
accommodate. 
(No underflow would occur if the 
destination 
were long or temporary 
real since 
these can handle exponents down to -1023 and 
-16,383, respectively.) 


The NDP's unmasked response to underflow is to 
stop and request an interrupt if the destination is 
a memory operand. If the destination is a register, 
the processor adds the constant 24,576 (decimal) 
to the true result's exponent, 
returns the result, 
and then 
requests 
an interrupt. 
The constant 
forces the exponent into the range of the tem- 
porary real format, and an exception handler can 
subtract 
out the constant 
to ascertain 
the true 


exponent. 
Thus, 
execution 
always stops when 


there is an unmasked underflow. 


The intent of the masked response to underflow is 
to allow computation 
to continue without pro- 


gram intervention, while introducing an error that 
carries about the same risk of contaminating 
the 


final result as roundoff 
error. Roundoff 
(preci- 


sion) errors 
occur 
frequently 
in real number 


calculations; 
sometimes they spoil the result of 


computation, 
but often they do not. Recognizing 


that roundoff 
errors are often non-fatal, 
com- 


putation 
usually proceeds and the programmer 


inspects the final result to see if these errors have 
had 
a significant 
effect. 
The 
8087's 
masked 


underflow response allows programmers 
to treat 


under flows in a similar manner; the computation 
continues and the programmer 
can examine the 
final result to determine if an underflow has had 
important 
consequences. 
(If the underflow 
has 


had a significant effect, an invalid operation will 
probably be signalled later in the computation.) 


Most 
computers 
underflow 
"abruptly"; 
they 
simply return a zero result, which is likely to pro- 
duce an unacceptable 
final result if computation 


continues. 
The 
8087, 
on 
the 
other 
hand, 
underflows 
"gradually" 
when 
the 
underflow 


exception 
is 
masked. 
Gradual 
underflow 
is 


accomplished by denormalizing the result until it 
is just within the exponent range of the destina- 
tion. Denormalizing means incrementing the true 
result's exponent and inserting a corresponding 
leading zero in the significand, shifting the rest of 
the significand one place to the right. Table 8-23 
illustrates how a result might be denormalized to 
fit a short real destination. 


Denormalization 
produces a denormal or a zero. 


Denormals 
are 
readily 
identified 
by 
their 


exponents, 
which are always the minimum 
for 


their formats; in biased form, this is always the 
bit string: 00... 00. This same exponent value is 
also assigned to the zeros, but a denormal has a 
nonzero significand. A denormal in a register is 
tagged special. 


The denormalization 
process may cause the loss 


of low-order significand bits as they are shifted 
off the right. In a severe case, all the significand 
bits of the true result are shifted out and replaced 
by the leading zeros. In this case, the result of 
denormalization 
is a true zero, and if the value is 


in a register, it is tagged as such. However, this is 
a comparatively rare occurrence, and in any case 
is no worse than "abrupt" 
underflow. 


Denormals 
are 
rarely 
encountered 
in 
most 


applications. 
Typical 
debugged 
algorithms 


generate 
extremely 
small 
results 
during 
the 


evaluation 
of intermediate 
subexpressions; 
the 


final result is usually of an appropriate magnitude 
for 
its 
short 
or 
long 
real 
destination. 
If 
intermediate results are held in temporary real, as 
is recommended, 
the great range of this format 


Operation 
Sign 
Exponent(1) 
Significand 


True Result 
0 
-129 
1a01011100 ... 00 


Denormalize 
0 
I·· 
-128 
oa 101011100 ... 00 


Denormalize 
0 
-127 
oa01010111 00 ... 00 


Denormalize 
0 
-126 
0a00101 0111 00 ... 00 
Denormal Result(2) 
0 
-126 
0a001010111 00 ... 00 


Notes: 
{1)expressed as unbiased, decimal number 


(2)Before storing, significand 
is rounded to 24 bits, integer bit is dropped, and exponent 
is 


biased by adding 126. 


makes underflow 
very unlikely. Denormals 
are 


likely to arise only when an application generates 
a great many intermediates, 
so many that they 


cannot 
be 
held 
on 
the 
register 
stack 
or 
in 


temporary 
real 
memory 
variables. 
If 
storage 


limitations force the use of short or long reals for 
intermediates, 
and small values are produced, 
underflow 
may 
occur, 
and 
if 
masked,. 
may 


generate denormals. 


Accessing a denormal may produce an exception 
as shown in table S-24. (The denormalized excep- 
tion signals that a denormal 
has been fetched.) 


Denormals may have reduced significance due to 
lost low-order bits, and an option of the proposed 
IEEE 
standard 
precludes 
operations 
on 
non- 


normalized 
operands. 
This 
option 
may 
be 


implemented in the form of an exception handler 
that responds to unmasked denormalized excep- 
tions. Most users will mask this exception so that 
computation 
may proceed; any loss of accuracy 
will be analyzed by the user when the final result 
is delivered. 


As table S-24 shows, the division and remainder 
operations do not accept denormal divisors and 
raise the invalid operation exception. Recall, also, 
that 
the 
transcendental 
instructions 
require 


normalized operands and do not check for excep- . 
tions. In all other cases, the NDP converts denor- 
mals to unnormals, 
and the unnormal arithmetic 


rules then apply. 


An unnormal is the "descendent" 
of a denormal 


and therefore of a masked underflow 
response. 


An unnormal may exist only in the temporary real 
format; it may have any exponent that a normal 
may have, but it is distinguished 
from a normal 


by the integer bit of its significand, 
which is 


always O. An unnormal 
in a register is tagged 
valid. 


Unnormals allow arithmetic to continue follow- 
ing an underflow while still retaining their identity 
as numbers which may have reduced significance. 
That is, unnormal 
operands 
generate unnormal 


results, so long as their unnormality has a signifi- 
cant effect on the result. 
Unnormals 
are thus 


prevented 
from 
"masquerading" 
as normals, 


numbers 
which have full significance. 
On the 


other hand, if an unnormal 
has an insignificant 


effect on a calculation with a normal, the result 
will be normal. 
For example, 
adding 
a small 


unnormal 
to a large· normal 
yields a normal 


result. The converse situation yields an unnormal. 


Table S-25 shows how the instruction 
set deals 
with unnormal operands. Note that the unnormal 
may be the original 
operand 
or a temporary 


created by the 8087 from a denormal. 


FLD(shortllong real) 


arithmetic (except following) 


Division or F.PREMwith 
denormal divisor 


Load as equivalent unnormal 


Convert (ina work area) denormal to equivalent 
unnormal and proceed 


Convert (in a work area) denormal to equivalent 
unnormal and proceed 


Return real indefinite 


ono.,t.. ••••• r- 
.,... •....•-,. 
------ 
Operation 
Result 


Division (un normal dividend 
only) 


FPREM (unnormal 
dividend 
only) 


Division!FPREM 
(unnormal 
divisor) 


Compare! 
FTST 


FSQRT 


FST, FSTP (shortllong 
real 
destination) 


FSTP (temporary 
real destination) 


FIST, FISTP, FBSTP 


FLD 


FXCH 


Transcendental 
instructions 


As discussed 
in section 
S.3, 
the real and 
packed 


decimal 
data types support 
signed zeros, 
while the 


binary 
integers 
represent 
a single 
zero, 
signed 


positive. 
The 
signed 
zeros 
behave, 
however, 
as 
though 
they 
are 
a single 
unsigned 
quantity. 
If 


necessary, 
the FXAM 
instruction 
may be used to 


determine 
a zero's 
sign. 


The zeros discussed 
above 
are called true zeros; 
if 


one of them 
is loaded 
or generated 
in a register, 


the register 
is tagged 
zero. 
Table 
S-26 
lists 
the 


results 
of 
instructions 
executed 
with 
zero 


Normalization 
of operand'with 
larger 
absolute 
value determines 
normalization 
of result. 


If 
either 
operand 
is 
unnormal, 
result 
is 


unnormal. 


Result is unnormal. 


Result is normalized. 


Signal invalid operation. 


Normalize 
as much as possible 
before 
making 
comparison. 


Normalize 
as 
much 
as 
possible 
before 
rounding. 


Signal invalid operation. 


Ifvalue is above destination's 
underflow 
boundary, 
then 
signal 
invalid operation; 
else 
signal underflow. 


Store as usual. 


Signal invalid operation. 


Load as usual. 


Exchange 
as usual. 


Undefined; 
operands 
must 
be normal 
and are 
not checked. 


operands 
and also shows how a true zero may be 


created 
from 
nonzero 
operands. 
(Nonzero 


operands 
are denoted 
"X" 
or "Y" 
in the table.) 


Only 
the 
temporary 
real 
format 
may 
contain 
a 
special 
class 
of 
values 
called 
pseudo-zeros. 
A 


pseudo-zero 
is an unnormal 
whose 
significand 
is 


all zeros, 
but whose 
(biased) 
exponent 
is nonzero 


(true 
zeros 
have 
a zero 
exponent). 
Neither 
is a 


pseudo-zero's 
exponent 
all 
ones, 
since 
this 


encoding 
is reserved 
for infinities 
and 
NANs. 
A 


pseudo-zero 
result 
will 
be 
produced 
if 
two 


unnormals, 
containing 
a total 
of more 
than 
64 


leading 
zero 
bits 
in 
their 
significands, 
are 


multiplied 
together. 
This is a remote 
possibility 
in 


most applications, 
but it can happen. 


FLD, FBLD (1) 
+0 
-0 
FILD (2) 


+0 
FST,FSTP 


+0 
-0 
+X (3) 
-X (3) 


FBSTP 
+0 
-0 
FIST, FISTP 
+0 
-0 
+X (4) 
-X (4) 


Addition 
+0 plus +0 
-0 plus-O 
+0 plus -0, -0 plus +0 
-X plus +X, +X plus-X 
±Oplus ±X, ±X plus ±O 


Subtraction 


+0 minus-O 
-0 minus +0 
+0 minus +0, -0 minus-O 
+X minus +X, -X minus-X 
±Ominus ±X, ±X minus ±O 


Multiplication 


+0· +0, -0·-0 
+0· -0, -0· 
+0 


+0· +X, +X· +0 
+0. -X, -X. 
+0 
-0· 
+X, +X·-O 


-0· 
-X, -X·-O 


+X· +Y, -X.-Y 
+X· -Y, -X. 
+Y 


+0 
-0 
'0 (5) 
'0 (5) 
tX 
(6) 


+0 
-0 
'0 (5) 
'0 (5) 
tX 
(6) 


+0 
-0 
+0 
-0 
-0 
+0 
+0, underflow 
(7) 
-0, underflow 
(7) 


'Division 
±O';- ±O 
±X.;- ±O 
+0';- +X, -O';--X 
+0';- -X, -0';- +X 
-X';- -Y, +X.;- +Y 
-X';- +Y, +X';--Y 


FPREM 
±Orem ±O 
±X rem ±O 
+0 rem +X, +0 rem -X 
-0 rem +X, -0 rem -X 
+X rem +Y, +X rem-Y 
-X rem -V, -X rem +Y 


FSQRT 
-0 
+0 


Compare 
±O:+X 
±O:±O 
±O:-X 


FTST 
±O 
FCHS 
+0 
-0 
FABS 
±O 
F2XM1 
+0 
-0 
FRNDINT 
+0 
-0 
FXTRACT 
+0 
-0 


Invalid operation 
Zerodivide 
+0 
-0 
+0, underflow 
(8) 


-0, underflow 
(8) 


Invalid operation 
Invalid operation 
+0 
-0 
+0 (9) 


-0 
(9) 


A<B 
A=B 
A>B 


Both +0 
Both -0 


Notes: 


(1) Arithmetic 
and compare operations with real memory operands interpret the memory operand signs in 
the same way. 


(2) Arithmetic and compare operations with binary integers interpret the integer sign in the same manner. 


(3) Severe underflows 
in storing to short or long real may generate zeros. 


(4) Small values (Ixl < 1) stored into integers may round to zero. 


(5) Sign is determined 
by rounding mode: 


• = + for nearest, up or chop 
• = - for down 


(6) t = sign of X. 


(7) Very small values of X and Y may yield zeros, after rounding of true result. NDP signals underflow 
to 
warn that zero has been yielded by nonzero operands. 


(8) Very small X and very large Y may yield zero, after rounding of true result. NDP signals underflow 
to 
warn that zero has been yielded from nonzero operands. 


(9) When Y qivides into X exactly. 


Pseudo-zero 
operands 
behave 
like unnormals, 
except in the following cases where they produce 
the same results as true zeros: 


• 
compare and test instructions 


• 
FRNDINT (round to integer) . 


• 
division, .where the dividend is either .a true 
zero 
or 
a pseudo-zero 
(the 
divisor 
is a 
pseudo-zero) . 


In addition and subtraction of a pseudo-zero and 
a true zero or another pseudo-zero, 
the pseudo- 
zero(s) behave like unnormals, 
except for the 
determination 
of the result's 
sign. The sign is 
determined as shown in table S-26 for two true 
zero operands. 


The real formats support signed representations 
of infinities. 
These values are encoded 
with a 
biased exponent of all ones and a significand of 


IAOO ... OO; if the infinity 
is in a register, 
it is 
tagged 
special'. 
The 
significand 
distinguishes 
infinities from NANs, induding real indefinite. 


A programmer may code an infinity, orit may be 
created by the NDP as its masked response to an 
overflow or a zerodivide exception. 
Note that 
when 
rounding 
is up 
or 
down,_ the 
masked 
response 
may 
create 
the 
largest 
valid 
value 
representable in the destination rather than infin- 
ity. See table 
S-33 for 
details. 
As operands, 
infinities behave somewhat differently depending 
on how the infinity control field in the control 
word is set (see table S-27). When the projective 
model of infinity is selected, the infinities behave 
as a single unsigned representation; 
because of 
this; infinity cannot be compared with any value 
except infinity. In affine mode, the signs of the 
infinities 
are 
observed, 
and 
comparisons 
are 
possible. 


Operation 
Projective 
Result 
Affine 
Result 


Addition 


+00 plus +00 
Invalid operation 
+00 


-00 plus-oo 
Invalid operation 
-00 


+00 plus-oo 
Invalid operation 
Invalid operation 
-00 plus +00 
Invalid operation 
Invalid operation 
±ooplus ±X 
'00 
'00 
±X plus ±oo 
'00 
'00 


Subtraction 


+00 minus -00 
Invalid operation 
+00 


-00 minus +00 
Invalid operation 
-00 


+00 minus +00 
Invalid operation 
Invalid operation 


-00 minus -00 
Invalid operation 
Invalid operation 
±oominus±X 
'00 
'00 
±X minus±oo 
too 
too 


Multiplication 
. 


±oo • ±oo 
E900 
E900 
±oo. ±Y 
E900 
E900 
±O. 
±OO, ±oo • ±O 
Invalid operation 
Invalid operation 


Operation 
Projective 
Result 
Affine 
Result 


Division 


±oo 7 ±oo 
Invalid operation 
Invalid operation 
±oo-;-±X 
E900 
E900 
±X -;-±oo 
E90 
E90 


FSQRT 
-00 
Invalid operation 
Invalid operation 
+00 
Invalid operaton 
+00 


FPREM 
±oorem ±oo 
Invalid operation 
Invalid operation 
±oorem ±X 
Invalid operation 
Invalid operation 
±Y rem ±oo 
'Y 
'Y 
±Orem ±oo 
'0 
'0 


FRNDINT 
±oo 
'00 
'00 


FSCALE 
±ooscaled by ±oo 
Invalid operation 
Invalid operation 
±ooscaled by ±X 
'00 
'00 
±Oscaled by ±oo 
'0 
'0 
±Y scaled by ±oo 
Invalid operation 
Invalid operation 


FXTRACT 
±oo 
Invalid operation 
Invalid operation 


Compare 
j 


±oo: ±oo 
A=B 
-00 < +00 
±oo:±Y 
A ? B (and) invalid 
operation 
-00 < Y < +00 
±oo: ±O 
A ? B (and) invalid 
operation 
-00 < 0 < +00 


FTST 
A? B (and) invalid 
±oo 
operation 
'00 


Notes: 
X = zero or nonzero operand 


, Y = nonzero operand 
. 


• = sign of original operand 
t = sign is complem~nt of original operand's sign 


E9= sign is "exclusive 
or" original operand signs (+ if operands had same sign, 


- if operands had different signs) 


A NAN (Not-A-Number) 
is a member of a class 
of special values that exist in the real formats 
only. A NAN has an exponent of 11.. .11B, may 
have either sign, and may have any significand 
except 
1<\00... OOB, which 
is assigned 
to 
the 
infinities. A NAN in a register is tagged special. 


The 8087 will generate 
the special NAN, 
real 
indefinite, 
as its masked response to an invalid 
operation 
exception. 
This 
NAN 
is 
signed 


negative; its significand 
is encoded 
1<\100... 00. 
All other NANs represent 
programmer-created 
values. 


Whenever the NDP uses an operand 
that is a 
NAN, 
it signals invalid operation. 
Its masked 
response to this exception is to return the NAN as 
the operation's 
result. 
If both operands 
of an 
instruction are NANs, the result is the NAN with 
the larger absolute value. In this way, a NAN that 
enters 
a computation 
propagates 
through 
the 
computation 
and will eventually be delivered as 


scendental 
instructions 
do 
not 
check 
their 
operands, and a NAN will produce an undefined 
result. 


By unmasking 
the invalid operation 
exception, 
the programmer 
can use NANs to trap to the 
exception 
handler. 
The 
generality 
of 
this 
approach and the large number of NAN values 
that are available, provide the sophisticated pro- 
grammer with a tool that can be applied to a 
variety of special situations. 


For example, 
a compiler 
could use NANs to 


references to uninitialized (real) array elements. 
The compiler could pre-initialize each array ele- 
ment with a NAN whose significand contained 
the index (relative position) of the element. If an 
application program attempted 
to access an ele- 


ment that it had not initialized, it would use the 
NAN placed there by the compiler. If the invalid 
operation exception were unmasked, an interrupt 
would occur, and the exception handler would be 
invoked. The exception handler could determine 
which 
element 
had 
been 
accessed, 
since the 


operand address field of the exception pointers 
would point to the NAN, and the NAN would 
contain the index number of the array element. 


NANs could also be used to speed up debugging. 
In its early testing phase a program often contains 
multiple errors. An exception handler could be 
written to save diagnostic information in memory 
whenever 
it 
was 
invoked. 
After 
storing 
the 


diagnsotic data, 
it could supply a NAN as the 


result of the erroneous instruction, and that NAN 
could point to its associated diagnostic area in 
memory. 
The 
program 
would 
then 
continue, 


•.••.••.U.1.&1I6 
a 
\,.ul1'Cl'CUl 
1"1/"\.1"1lOr 
eacn 
error. 
When 


the program 
ended, 
the NAN results could be 


used to access the diagnostic data saved at the 
time the errors occurred. Many errors could thus 
be diagnosed and corrected in one test run. 


Tables S-28 through S-31 summarize how various 
types of values are encoded in the seven NDP data 
types. In all tables, the less significant bits are to 
the right and are stored in the lowest memory 
addresses. The sign bit is always the left-most bit 
of the highest-addressed byte. 


Notice that in every format one encoding is inter- 
preted as representing the special value indefinite. 
The 8087 produces this encoding as its response to 
a masked invalid operation exception. In the case 
of the reals, indefinite 
can be loaded and stored 


like any NAN and it always retains its special 
identity; programmers are advised not to use this 
encoding for any other purpose. Packed decimal 
indefinite may be stored by the NDP in a FBSTP 
instruction; 
attempting 
to use this encoding in 


a 
FBLD 
instruction, 
however, 
will have 
an 


undefined result. In the binary integers, the same 
encoding may represent either indefinite 
or the 


la:rgest negative number supported by the format 
(-215, 
-231 
or -263). 
The 8087 will store this 


encoding as its masked response to an invalid 
operation, or when the value in a source register 
represents, 
or rounds 
to, 
the largest 
negative 


integer representable by the destination. 
In situa- 


tions where its origin may be ambiguous, 
the 


invalid operation exception flag can be examined 
to see if the value was produced by an exception 
response. When this encoding is loaded, or used 
by an integer arithmetic or compare operation, it 
is always interpreted as a negative number; thus 
indefinite 
cannot 
be 
loaded 
from 
a 
packed 


decimal or binary integer. 


Class 
Sign 
Magnitude 


en 
(Largest) 
0 
11... 11 


Q) 
• 
• 
~ 
'iii 
• 
• 


0 
• 
• 
11. 
(Smallest) 
0 
00...01 
Zero 
0 
00... 00 


en 
(Smallest) 
1 
11...11 


Q) 
• 
• 


= 
• 
• 
I'll 
Cl 
• 
• 
Q)z 
(Largestllndefinite *) 
1 
00...00 


Word: /.-15 bits-.l 
Short: ~31 
bits-.l 
Long: ~63 
bits~ 


* If this encoding is used as a source operand 


(as in an integer load or integer arithmetic 
instruction), 
the 
8087 
interprets 
it as 
the 


largest negative number representable 
in the 


format: _215, _231, or -263. The 8087 willdeliver 
this encoding to an integer destination in two 
cases: 


1) 
if the 
result 
is the 
largest 
negative 


number, 


2) 
as the 
response 
to a masked 
invalid 


operation 
exception, 
in which case 
it 


represents 
the 
special 
value 
integer 


indefinite. 


Table S-32 lists every exception condition that the 
NDP 
detects 
and 
describes 
the 
processor's 


response when the relevant exception mask is set. 
The unmasked responses are described in table 
S-6. Note 
that 
if an 
unmasked 
overflow 
or 


underflow occurs in an FST or FSTP instruction, 
no result if stored, and the stack and memory are 
left as they existed before 
the instruction 
was 


executed. 
This gives an exception 
handler 
the 


opportunity to examine the offending operand on 
the stack top. 


When rounding is directed (the RC field of the 
control word is set to "up" 
or "down"), 
the 8087 


handles a masked 
overflow 
differently 
than it 


does for 
the 
"nearest" 
or 
"chop" 
rounding 


modes. 
Table 
S-33 shows the NDP's 
masked 


response when the true result is too large to be 
represented in it's destination real format. For a 
normalized result, the essence of this response is 
to deliver 00 or the largest valid number represen- 
table in the destination format, as dictated by the 
rounding mode and the sign of the true result. 
Thus, 
when RC=down, 
a positive overflow 
is 


rounded 
down to the largest positive number. 


Conversely, when RC=up, a negative overflow is 
rounded 
up to the largest negative number. 
A 


properly 
signed 
00 
is returned 
for 
a positive 


overflow with RC=up, 
or a negative overflow 


with RC=down. 
For an unnormalized 
result, the 
action is similar except that the the unnormal 
character of the result is preserved if the sign and 
rounding mode do not indicate that 
00 should be 


delivered. 


In all masked 
overflow 
responses 
for directed 


rounding, 
the overflow flag is not set, but the 


precision exception is raised to signal that the 
exact true result has not been returned. 
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TableS-29. 
Packed Decimal Encodings 


Class 
Sign 
digit 


(Largest) 
0 
0000000 
1 o 
0 
1 
1 o 
0 
1 
1 
001 
0 
0 
o 
0 
UI 
CIl 
• 
• 
• 
= 
• 
• 
• 
.iij 
• 
• 
• 
0 
(Smallest) 
0 
0000000 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
1 
~ 


Zero 
0 
0000000 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


Zero 
1 
0000000 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
UI 
CIl 
(Smallest) 
1 
0000000 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
1 
~ 
ftI 
• 
• 
• 
Cl 
• 
• 
• 
CIl 
Z 
• 
• 
• 
(Largest) 
0000000 
0 
0 
0 
0 
1 o 
0 
1 
1 
001 
1 
001 


Indefinite· 
1 
1111111 
1 
1 
1 
1 
1 
U U U U U U U U 
U U U U 


1.....-1 
byte -I- 
9 bytes 
-I 


• The packed decimal indefinite encoding is stored by FBSTP in response to a masked invalid 
operation 
exception. 
Attempting 
to load this value via FBLD produces 
an undefined 
result. 


Note: "UUUU" 
means bit values are undefined and may contain any value. 


Class 
Sign 
Biased 
Significand* 
Exponent 
aff...ff 


0 
11 ... 11 
11 ... 11 
• 
• 
• 
NANs 
• 
• 
• 
• 
• 
• 
0 
11 ... 11 
00 ... 01 


00 
0 
11 ... 11 
00 ... 00 


UI 
0 
11 ... 10 
11 ... 11 
CIl 
= 
• 
• 
• 
0iij 
Normals 
• 
• 
• 
0 
• 
• 
• 
~ 
0 
00 ... 01 
00 ... 00 
UI 
iG 
0 
00 ... 00 
11 ... 11 
CIl 
II: 
• 
• 
• 
Denormals 
• 
• 
• 
• 
• 
• 
0 
00 ... 00 
00 ... 01 
Zero 
0 
00 ... 00 
00 ... 00 


Class 
Sign 
Biased 
Significand* 


Exponent 
Aff .•. ff 


Zero 
1 
00 ... 00 
00 ... 00 
. 
1 
00 ... 00 
00 ... 01 
• 
• 
• 
Denormals 
• 
• 
• 
VI 
• 
• 
• 
~ 
1 
00 ... 00 
11 ... 11 
Qla: 
1 
00 ... 01 
00 ... 00 
• 
• 
• 


VI 
Normals 
• 
• 
• 
Ql 
• 
• 
• 
~ 
I 
" 
1 
11 ... 10 
11 ... 11 
'"C) 
Ql 
00 
1 
11 ... 11 
00 ... 00 
z 
1 
11 ... 11 
00 ... 01 
• 
• 
• 
• 
• 
• 
• 
• 
• 
VI~ I Indefinite 
1 
11 ... 11 
10 ... 00 
z 
• 
• 
• 
• 
• 
• 
• 
• 
• 
, 


1 
11 ... 11 
11 ... 11 
Short: ~ 
8 bits .1. 
23bits--+-l 
Long: ~11 
bits~52bits~ 


Class 
Sign 
Biased 
Significand 


- 
- 
Exponent 
IAft ..ft 


0 
11 ... 11 
111 ... 11 


III 
• 
• 
• 
Ql 
NANs 
• 
• 
• 
~ 
·iii 
• 
• 
• 
0 
0 
11 ... 11 
100 ... 01 
Q. 


00 
0 
11 ... 11 
100 ... 00 
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Class 
Sign 
Biased 
Significand 
Exponent 
laft ...ft 


0 
11 ... 10 
Normals 
• 
• 
111 ... 11 
• 
• 
• 
• 
• 
• 
• 
• 
• 
• 
• 
• 
• 
• 
100 ... 00 
• 
• 
• 
• 
Unnormals 


Ul 
• 
• 
ell 
~ 
• 
• 
011.. .11 
'iij 
• 
• 
• 
0 
D. 
• 
• 
• 
• 
• 
• 
0 
00 ... 01 
000 ... 00 


Denormals 


0 
00 ... 00 
011...11 
• 
• 
• 
• 
• 
• 
• 
• 
• 
0 
00 ... 00 
000 ... 01 


Zero 
0 
00 ... 00 
000 ... 00 
Reals 
Zero 
1 
00 ... 00 
000 ... 00 


Denormals 


00 ... 000 
000 ... 01 
• 
• 
• 
• 
• 
• 
• 
• 
00 ... 00 
011.. .11 


00 ... 01 
Unnormals 


Ul 
• 
• 
000 ... 00 
ell 
~ 
• 
• 
• 


<'II 
• 
• 
• 
Cl 
• 
• 
• 
ell 
Z 
• 
• 
011.. .11 
• 
• 
• 
• 
Normals 
• 
• 
• 
• 
100 ... 00 
.0 
• 
• 
• 
• 
• 
• 
• 
• 
11...10 
111 ... 11 


00 
11...11 
100 ... 00 
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Chiss 
S. 
Biased 
Significand 
Ign 
Exponent 
lI\ff ...ff 


1 
11...11 
100...00 
• 
• 
• 
• 
• 
• 


!Il 
• 
• 
• 
CII.:: 
NANs IIndefinite 
1 
11...11 
110...00 
-; 
Cl 
• 
• 
• 
CIIZ 
• 
• 
• 
• 
• 
• 
1 
11...11 
111...11 


Source register is tagged empty (usually 
due to stack underflow). 


Destination register is not tagged empty 
(usually due to stack overflow). 


One or both operands is a NAN. 


Return real indefinite 
(overwrite 
destination value). 


Return 
NAN with 
larger 
absolute 
value 


(ignore signs). 


Set condition codes "not comparable". 
(Compare and test operations only): 
one or both operands is a NAN. 


(Addition operations only): closure is 
affine and operands 
are opposite-signed 


infinities; 
or closure is projective and both 
operands are 00 (signs immaterial). 


(Subtraction operations only): closure is 
affine 
and 
operands 
are 
like-signed 


infinities; 
or closure is projective and both 
operands are 00 (signs immaterial). 


(Multiplication 
operations only): 00·0; or 
0·00. 


(Division operations only): 00-;-00;or 0 -;-0; 
or 0 -;-pseudo-zero; 
or divisor is denormal 
or unnormal. 


(FPREM instruction 
only): modulus 


(divisor) is unnormal or denormal; 
or dividend is 00. 


(FSQRT instruction 
only): operand is 
nonzero 
and 
negative; 
or 
operand 
is 
denormal or unnormal; or closure is affine 
and operand is -00; or closure is projective 
and operand is 00. 


Return real indefinite, 
set condition code 
= "complete 
remainder". 


(Compare operations only): closure is 
projective and 00 is being compared with 0 
or a normal, or 00. 


(FTST instruction 
only): closure is 
projective and qperand is 00. 


(FIST, FISTP instructions 
only): source 


register is empty, or a NAN, or denormal, 
or unnormal, or 00, or exceeds represent- 
able range of destination. 


(FBSTP instruction 
only): source register 


is 
empty, 
or 
a NAN, 
or 
denormal, 
or 


unnormal, 
or 
00, or exceeds 
18 decimal 


digits. 


(FST, FSTP instructions 
only): destination 


is short or long real and source register is 
an unnormal with exponent in range. 


(FXCH instruction 
only): one or both 


registers is tagged empty. 


Change empty register(s) to real indefinite 
and then perform exchange. 


(FLD instruction 
only): source operand is 


denormal. 


(Arithmetic 
operations only): one or both 
operands is denormal. 


(Compare and test operations only): one 
or both operands is denormal or un normal 
(other than pseudo-zero). 


(Arithmetic 
operations only): rounding is 


nearest or chop, and exponent of true 
result> 
16,383. 


(FST, FSTP instructions 
only): rounding is 


nearest or chop, and exponent of tru~ 
result> 
+127 (short real destination) 


or> +1023(long real destination). 


Convert (in a work area) the operand to the 
equivalent unnormal and proceed. 


Convert (in a work area) any denormal to 
the equivalent unnormal; normalize as 
much as possible, and proceed with 
operation. 


Return 00 signed with "exclusive 
or" of 


operand signs. 


Return properly signed ooandsignal 
precision exception. 


Return properly signed 
00 and signal 


precision exception. 


Underflow 


(Arithmetic 
operations only): exponent of 
Denormalize until exponent rises to 
true result <-16,382 (true). 
-16,382 (true), round significand to 64 bits. 
If denormalized 
rounded 
significand 
= 0, 


then return true 0; else, return denormal 
(tag = special, biased exponent =0). 


(FST, FSTP instructions 
only): destination 
Denormalize until exponent rises to -126 


is short real and exponent of true result 
(true), round significand to 24 bits, store 


<-126 
(true). 
true 0 if denormalized rounded significand 
= 0; else, store denormal (biased expo- 
nent = 0). 


(FST, FSTP instructions 
only): destination 
Denormalize until exponent rises to -1022 
is long real and exponent of true result 
(true), round significand to 53 bits, store 


<-1022 (true). 
true 0 if rounded denormalized significand 
= 0; else, store denormal (biased expo- 
nent = 0). 


Precision 


True rounding error occurs. 
No special action. 


Masked response to overflow exception 
No special action. 
earlier in instruction. 


True Result 
Rounding 
Result 
Delivered 
Normalization 
Sign 
Mode 


Normal 
+ 
Up 
+00 


Normal 
+ 
Down 
Largest finite positive number(1) 


Normal 
- 
Up 
Largest finite negative number(1) 


Normal 
- 
Down 
-00 


Unnormal 
+ 
Up 
+00 


Unnormal 
- 
Down 
Largest exponent, result's significand(2) 


Unnormal 
+ 
Up 
Largest exponent, result's significand(2) 


Unnormal 
- 
Down 
-00 


(1)The largest valid representable 
reals are encoded: 


exponent: 
11... 108 
significand: 
(1),\11... 108 


(2)The significand 
retains its identity as an unnormal; the true result is rounded as usual 


(effectively 
chopped toward 0 in this case). The exponent is encoded 11... 108. 


nnn_ 
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2'-32 and S-6). The code fragment, 
then, 


sets ZF and CF to the values of C3 and CO and 
then uses the CPU conditional jumps to test the 
flags. Table 2-15 shows how each conditional 
jump instruction tests the CPU flags. 
As discussed 
in section 
S.7, 
the 
comparison 
instructions 
post their results to the condition 
code bits of the 8087 status word. Although there 
are many ways to implement conditional branch- 
ing following a comparison, the basic approach is 
as follows: 


execute the comparison, 


store the status word, 


inspect the condition code bits, 


jump on the result. 


Figure S-26 is a code fragment that illustrates how 
two memory-resident long real numbers might be 
compared 
(similar code could be used with the 
FTST instruction). The numbers are called A and 
B, and the comparison is A to B. The comparison 
itself simply requires loading A onto the top of 
the 8087 register stack and then comparing it to B 
and popping the stack in the same instruction. 
The status word is written to memory and the 
code waits for completion 
of the store before 
attempting to use the result. 


There are four possible orderings of A and B, and 
bits C3 and CO of the condition 
code indicate 
which ordering holds. These bits are positioned in 
the upper byte of the status word so as to corres- 


The FXAM instruction updates all four condition 
code bits. Figure S-27 shows how a jump table 
can be used to determine the characteristics of the 
value examined. The jump table (FXAM_TBL) 
is initialized to contain the 16-bit displacement of 
16 labels, one for each possible condition 
code 
setting. Note that four of the table entries contain 
the same value, since there are four condition 
code settings that correspond to "empty." 


The program fragment performs the FXAM and 
stores the status word. It then manipulates 
the 
condition code bits to finally produce a number in 
register BX that equals the condition code times 
2. This involves zeroing the unused bits in the byte 
that contains the code, shifting C3 to the right so 
that it is adjacent 
to C2, and then shifting the 
code to multiply it by 2. The resulting value is 
used 
as 
an 
index 
which 
selects 
one 
of 
the 
displacements 
from 
FXAM_ 
TBL 
(the 
multiplication 
of the condition 
code is required 
because of the 2-byte length of each value in 
FXAM_ TBL). The unconditional 
JMP instruc- 
tion effectively vectors through the jump table to 
the labelled routine that contains code (not shown 
in the example) to process each possible result of 
the FXAM instruction. 


A 
DQ? 


B 
DQ? 
STAT 
87 
DW 
? 


F L D 
FCOMP 
FSTSW 
FWAIT 


A 


B 
STAT 
87 


;LOAD 
A 
ONTO 
TOP 
OF 
;COMPARE 
A:B, 
POP 
A 
;STORE 
RESULT 
;WAIT 
FOR 
STORE 


iLOAD 
CPU 
REGISTER 
AH 
WITH 
BYTE 
OF 
STATUS 
WORD 
CONTAINING 
CONDITION 
CODE 
MOV 
AH, 
BYTE 
PTR 
STAT_87+1 


iLOAD 
CONDITION 
CODES 
INTO 
CPU 
FLAGS 
SAHF 


iUSE 
CONDITIONAL 
JUMPS 
TO 
DETERMINE 
ORDERING 
OF 
A 
AND 
B 
JB 
A_LESS_OR_UNORDERED 
iCF 
(CO) 
= 
0 
JNE 
A 
GREATER 
A 
EQUAL: 
iCF 
(CO) 
= 
0, 
ZF 
(C3) 
= 


A 
GREATER: 
iCF 
(CO) 


A 
LESS_OR_UNORDERED: 
iCF 
(CO) 
= 
1, 
TEST 
ZF 
(C3) 
JNE 
A_LESS 
A_B_UNORDERED: 
iCF 
(CO) 
= 
1, 
ZF 
(C3) 
= 


A 
LESS: 
iCF 
(CO) 
= 
1, 
ZF 
(C3) 
0 


FXAM 
TBL 
OW 
POS _UNNORM, 
POS _NAN, 
NEG _UNNORM, 
& 
NEG _NAN, 
POS _NORM, 
POS _INFINITY, 
& 
NEG_NORM, 
NEG_INFINITY, 
POS _ZERO, 
& 
EMPTY, 
NEG _ZERO, 
EMPTY 
, POS _DENORM, 


& 
EMPTY, 
NEG _DENORM, 
EMPTY 
STAT 
87 
DW 
? 


iEXAMINE 
ST, 
STORE 
RESULT, 
WAIT 
FOR 
COMPLETION 
FXAM 
FSTSW 
STAT_8? 
FWAIT 
iCLEAR 
UPPER 
HALF 
OF 
BX, 
LOAD 
CONDITION 
CODE 
IN 
LOWER 
HALF 
MOV 
BH,O 
MOV 
BL, 
BYTE 
PTR 
STAT_8?+1 
iCOPY 
ORIGINAL 
IMAGE 
MOV 
AL,BL 
iCLEAR 
ALL 
BITS 
EXCEPT 
C2-CO 
AND 
BL,00000111B 
iCLEAR 
ALL 
BITS 
EXCEPT 
C3 
AND 
AL,01000000B 
iSHIFT 
C3 
TWO 
PLACES 
RIGHT 
SHR 
AL,1 
SHR 
AL,1 
iSHIFT 
C2-CO' 
ONE 
PLACE 
LEFT 
(MULTIPLY 
BY 
2) 
SAL 
BX,1 
iDROP 
C3 
BACK 
IN 
ADJACENT 
TO 
C2 
(OOOXXXXO) 
OR 
BL,AL 
iJUMP 
TO 
THE 
ROUTINE' 
'ADDRESSED" 
BY 
CONDITION 
CODE 
JMP 
FXAM 
TBL[BX] 


iHERE 
ARE 
THE 
JUMP 
TARGETS, 
ONE 
TO 
HANDLE 
EACH 
POSSIBLE 
RESULT 
OF 
FXAM 
POS 
UNNORM: 


There are many approaches 
to writing exception 
handlers. One useful technique is to consider the 
exception 
handler 
interrupt 
procedure 
as con- 
sisting of "prologue," 
"body" 
and "epilogue" 
sections of code. (For compatibility with the 8087 
emulators, 
this procedure should be invoked by 
interrupt pointer (vector) number 16.) 


At the beginning of the prologue, CPU interrupts 
have been disabled by the CPU's normal interrupt 
response mechanism. The prologue performs all 
functions that must be protected 
from possible 
interruption 
by higher-priority 
sources. Typically 
this 
will 
involve 
saving 
CPU 
registers 
and" 


transferring diagnostic information from the 8087 
to memory. When the critical processing has been 
completed, the prologue may enable CPU inter- 
rupts to allow higher-priority 
interrupt handlers 
to preempt the exception handler. 


The 
exception 
handler 
body 
examines 
the 
diagnostic information and makes a response that 
is 
necessarily 
application-dependent. 
This 
response may range from halting execution, 
to 
displaying a message, to attempting to repair the 
problem and proceed with normal execution. 


The epilogue essentially reverses the actions of the 
prologue, restoring the CPU and the NDP so that 
normal execution can be resumed. The epilogue 


must not load an unmasked exception flag into 
the 8087 or another 
interrupt 
will be requested 
immediately (assuming 8087 interrupts 
are also 
loaded as unmasked). 


Figures S-28 through 
S-30 show the ASM-86 
coding of three skeleton exception handlers. They 


, show how prologues and epilogues can be written 
for various situations, but only provide comments 
indicating 
where 
the 
application-dependent 
exception handling body should be placed. 


Figures S-28 and S-29 are very similar; their only 
substantial 
difference is their choice of instruc- 
tions to save and restore the 8087. The tradeoff 
here is between the increased diagnostic 
infor- 
mation 
provided 
by FNSA VE and 
the 
faster 
execution of FNSTENV. For applications that are 
sensitive to interrupt 
latency, or do not need to 
examine register contents, FNSTENV reduces the 
duration 
of the "critical 
region," 
during which 
the CPU 
will not recognize 
another 
interrupt 
request (unless it is a non-maskable interrupt). 


After the exception handler body, the epilogues 
prepare the CPU and the NDP to resume execu- 
tion from 
the point 
of interruption 
(i.e., 
the 
instruction 
following the one that generated the 
unmasked exception). Notice that the exception 
.flags in the memory image that is loaded into the 
8087 are cleared to zero prior to reloading 
(in 
fact, in these examples, 
the entire status word 


lllUH.:aungto tne mterrupt 
controller 
hardware 
(e.g., 8259A) that the interrupt 
has been pro- 


cessed. 
The 
actual 
processing 
done 
here 
is 
application-dependent, 
but 
might 
typically 
involve writing an "end of interrupt" 
command 
to the interrupt controller. 
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employed. The basic technique is to save the full 
8087 state and then to load a new control word in 
the prologue. Note that considerable care should 
be taken when designing an exception handler of 
this type to prevent 
the 
handler 
from 
being 


reentered endlessly. 


The examples in figures S-28 and S-29 assume 
that the exception handler itself will not cause an 
unmasked exception. Where this is a possibility, 


iSAVE 
CPU 
REGISTERS, 
ALLOCATE 
STACK 
SPACE 
iFOR 
8087 
STATE 
IMAGE 
PUSH 
BP 


MOV 
BP,SP 
SUB 
SP,94 
iSAVE 
FULL 
8087 
STATE, 
WAIT 
FOR 
COMPLETION, 
iENABLE 
CPU 
INTERRUPTS 
FNSAVE 
[BP-94] 
FWAIT 
STI 


iAPPLICATION-DEPENDENT 
EXCEPTION 
HANDLING 
iCODE 
GOES 
HERE 


iCLEAR 
EXCEPTION 
FLAGS 
IN 
STATUS 
WORD 
iRESTORE 
MODIFIED 
STATE 
i IMAGE 
MOV 
BYTE 
PTR 
[BP-92], 
OH 
FRSTOR 
[BP-94] 
iWAIT 
FOR 
RESTORE 
TO 
FINISH 
BEFORE 
RELEASING 
MEMORY 


FWAIT 
iDE-ALLOCATE 
STACK 
SPACE, 
RESTORE 
CPU 
REGISTERS 


MOV 
SP,BP 


iCODE 
TO 
SEND' 
'END 
OF 
INTERRUPT" 
COMMAND 
TO 
i8259A 
GOES 
HERE 


iRETURN 
TO 
INTERRUPTED 
CALCULATION 
IRET 
SAVE 
ALL 
ENDP 


;SAVE 
CPU 
REGISTERS, 
ALLOCATE 
STACK 
SPACE 
;FOR 
8087 
ENVIRONMENT 
PUSH 
BP 


MOV 
BP,SP 
SUB 
SP,14 
;SAVE 
ENVIRONMENT, 
WAIT 
FOR 
COMPLETION, 
;ENABLE 
CPU 
INTERRUPTS 
FNSTENV 
[BP-14J 
FWAIT 
.S T I 


;CLEAR 
EXCEPTION 
FLAGS 
IN 
STATUS 
WORD 
;RESTORE 
MODIFIED 
;ENVIRONMENT 
IMAGE 
MOV 
BYTE 
PTR 
[BP-12J, 
OH 
F LDENV 
[BP-14 
J 
;WAIT 
FOR 
LOAD 
TO 
FINISH.BEFORE 
RELEASING 
MEMORY 
FWAIT 
. 


;DE-ALLOCATE 
STACK 
SPACE, 
RESTORE 
CPU 
REGISTERS 
MOV 
SP,BP 


;CODE 
TO 
SEND 
I 'END 
OF 
INTERRUPT" 
COMMAND 
TO 
;8259A 
GOES 
HERE 


;RETURN 
TO 
INTERRUPTED 
CALCULATION 
IRET 
SAVE 
ENVIRONMENT 
ENDP 


Figure 8-29. 
Reduced Latency Exception Handler 


;SAVE 
CPU 
REGISTERS, 
ALLOCATE 
STACK 
SPACE 
FOR 
;8087 
STATE 
IMAGE 
PUSH 
BP 


MOV 
BP,SP 
SUB 
SP,94 
;SAVE 
STATE, 
LOAD 
NEW 
CONTROL 
WORD, 
WAIT 
;FOR 
COMPLETION, 
ENABLE 
CPU 
INTERRUPTS 
FNSAVE 
[BP-941 
FLDCW 
LOCAL_CONTROL 
FWA IT 
STI 
;CODE 
TO 
SEND' 
'END 
OF 
INTERRUPT" 
COMMAND 
TO 
;8259A 
GOES 
HERE 


;APPLICATION 
EXCEPTION 
HANDLING 
CODE 
GOES 
HERE. 
;AN 
UNMASKED 
EXCEPTION 
GENERATED 
HERE 
WILL 
;CAUSE 
THE 
EXCEPTION 
HANDLER 
TO 
BE 
REENTERED. 
;IF 
LOCAL 
STORAGE 
IS 
NEEDED, 
IT 
MUST 
BE 
;ALLOCATED 
ON 
THE 
CPU 
STACK. 


;CLEAR 
EXCEPTION 
FLAGS 
IN 
STATUS 
WORD 
;RESTORE 
MODIFIED 
STATE 
IMAGE 
MOV 
BYTE 
PTR 
[BP-921, 
OH 
FRSTOR 
[BP-941 
;WAIT 
FOR 
RESTORE 
TO 
FINISH 
BEFORE 
RELEASING 
MEMORY 


FWAIT 
;DE-ALLOCATE 
STACK 
SPACE, 
RESTORE 
CPU 
REGISTERS 
MOV 
SP,BP 


POP 
BP 
;RETURN 
TO 
POINT 
OF 
INTERRUPTION 
IRET 
REENTRANT 
ENDP 


iAPX 86/20 
iAPX 88/20 
NUMERIC DATA PROCESSOR 


• High Performance 
2-Chip Numeric 


Data Processor 


• Standard 
iAPX 86/10, 88/10 Instruction 


Set Plus Arithmetic, 
Trigonometric, 
Exponential, 
and Logarithmic 


Instructions 
For All Data Types 


• 
All 24 iAPX 86/10, 88/10 Addressing 
Modes Available 


• 
Conforms 
To Proposed 
IEEE Floating 


Point Standard 


• 
Support 
8 Data 'TYpes: 8-, 16-, 32-, 64- 


Bit Integers, 
32-, 64-, 80-Blt Floating 


Point; and 18-Digit BCD Operands 


• 
8x80-Bit 
Individually 
Addressable 


Register 
Stack plus 14 General 


Purpose 
Registers 


• 
7 Built-in 
Exception 
Handling 


Functions 


• 
MULTI BUS System 
Compatible 


Interface 


The Intel iAPX 86/20 and iAPX 88/20 are two-chip numeric data processors (NDP's). They provide the instruc- 
tions and data types needed for high-performance 
numeric applications. The NDP provides 100 times the 
performance of an iAPX 86/10, 88/10 CPU alone for numeric processing. The iAPX 86/20 consists of an iAPX 
86/10 (16-bit 8086 CPU) and a numeric processor extension (NPX), the 8087. The iAPX 88/20 consists of the 
NPX in conjunction 
with the iAPX 88/10 (8-bit 8088 CPU). The NDP conforms to the proposed IEEE Floating 
Point Standard. 


Both components of the iAPX 86/20 and iAPX 88/20 are implemented in N-channel, depletion load, silicon gate 
technology 
(HMOS), housed in two 40-pin packages. The iAPX 86/20, 88/20 adds 68 numeric processing 
instructions to the iAPX 86/10, 88/10 instruction 
set and eight 80-bit registers to the register set. 


MAX 
{~} 
MODE 
81186 


GNO 
Vcc 
GNO 
.. 
Vcc 
r:--------, 


(1.14)1.014 
1.015 
(1.14) 1.014 
" 


1.015 (All) 


(1.13)1.013 
All/S3 
(1.13) 1.013 
3 
31 
Al51S3 
I 
I 
(1.12) 1.012 
1.17154 
(1.12)1.012 · 
" 


1.17154 
I 
00 
I 
(1.11)1.011 
1.11155 
(1.11)1.011 
s 
AlllS5 
I 
I 
(1.10)1.010 
1.11154 
(1.10)1.010 · 


3S 
1.11154 
I 
REQUESTI 
QUEUE 
I 


(1.1)1.01 
BHElS7 
(1.1)1.01 
, 
,. 
- BHElS7 (HIGH) 


GRANT 
STATUS 
(1.8) ADS 
ROICi,.l 
(A') 1.08 · 
MN.toIX 
I 
I 
I 
1.07 
INT 
1.07 
, 
IlIl 


AD6 
Ramo 
AD6 " 
RaiGro 
I 
I 
ADS 
NC 
ADS 
" 
RaIGTl 
I 
I 


1.04 
NC 
1.04 
" 
I:OCK 
REQUEST/ 
QUEUE 
I 
GRANT 
STATUS 
1.03 
52 
1.03 
" 
52 


I 
I 
AD2 
51 
AD2 " 
51 
[§] 
I 


NPX 
I 
1.01 
so 
1.01 
.. 
so 
I 


ADO 
aso 
ADO " 
aso 
L 
_______ 
J"'el20"IJ20 
NC 
OSl 
NMI " 
OSl 


NC 
BUSY 
Itml 
" 
TEST 
CLK 
READY 
CLK " 
READY 
GNO 
RESET 
GNO 
20 
RESET 
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Symbol 


AD15-ADO 


A19/56, 
A18/55, 
A17/54. 
A16/53 


'tYpe 
Name and Function 


I/O 
Address Data: These lines constitute 
the time multiplexed 
memory 
address 
(T1) and data (T2. 


T3. Tw, T4) bus. AO is analogous 
to SHE for the lower 
byte of the data bus. pins 07-00. 
It is 


LOW during 
T1 when 
a byte is to be transferred 
on the lower 
portion 
of the bus in memory 


operations. 
Eight-bit 
oriented 
devices tied to the lower 
half of the bus would 
normally 
use AO 


to condition 
chip select functions. 
These lines are active 
HIGH. They are input/output 
lines for 


8087 
driven 
bus cycles 
and are inputs 
which 
the 8087 
monitors 
when 
the 8086/8088 
is in 


control 
of the bus. 
A15-A8:do 
not require 
an address 
latch 
in an iAPX 88/20. The 8087 will 
supply 
an address 
for the T1-T4 period. 


I/O 
Address 
Memory: 
During 
T1 
these 
are 
the 
four 
most 
significant 
address 
lines 


for 
memory 
operations. 
During 
memory 
operations, 
status 
information 
is available 
on 


these 
lines 
during 
T2. T3. Tw. 
and 
T4. 
For 8087 
controlled 
bus 
cycles, 
56, 
54, 
and 
53 


are reserved 
and currently 
one (HIGH). while 
55 is always 
LOW These lines are inputs 
which 


the 8087 monitors 
when 
the 8086/8088 
is in control 
of the bus. 


I/O 
Bus High Enable: During 
T1 the bus high enable 
signal 
(SHE) should 
be used to enable 
data 


onto the most significant 
half of the data bus. pins 015-08. 
Eight-bit 
oriented 
devices 
tied to 


the upper 
half of the bus would 
normally 
use SHE to condition 
chip select 
functions. 
SHE is 


LOW during 
T1 for read and write cycles when a byte is to be transferred 
on the high portion 
of 


the bus. The 57 status 
information 
is available 
during 
T2. T3. Tw. and T4' The signal 
is active 


LOW 57 is an input 
which 
the 8087 monitors 
during 
8086/8088 
controlled 
bus cycles. 


S2 
S1 
SO 
o (LOW) 
X 
X 
Unused 
1 (HIGH) 
0 
0 
Unused 
1 
0 
1 
Read Memory 
1 
1 
0 
Write 
Memory 
1 
1 
1 
Passive 


5tatus 
is driven 
active 
during 
T4, remains 
valid 
during 
T1 and T2, and 
is returned 
to the 


passive 
state (1,1,1) 
during 
T3 or during 
Tw when 
READY is HIGH. This status 
is used by the 


8288 Sus Controller 
to generate 
all memory 
access 
control 
signals. 
Any change 
in 52, 51, or 


50 during 
T4 is used to indicate 
the beginning 
of a bus cycle. and the return 
to the passive 


state in T3 or TW is used to indicate 
the end of a bus cycle. These signals 
are monitored 
by the 


8087 when 
the 8086/8088 
is in control 
of the bus. 


I/O 
Request/Grant: This request/grant 
pin is used by the NPX to gain control 
of the local bus from 


the CPU for operand 
transfers 
or on behalf 
of another 
bus master. It must be connected 
to one 


of the two processor 
request/grant 
pins. The request 
grant sequence 
on this pin is as follows: 


1. A pulse 
one clock 
wide 
is passed 
to the CPU to indicate 
a local bus request 
by either 
the 


8087 or the master 
connected 
to the 8087 RQ/GT1 
pin. 


2. The 8087 waits 
for the grant 
pulse and when 
it is received 
will either 
initiate 
bus transfer 


activity 
in the clock cycle following 
the grant or pass the grant out on the RQ/GT1 pin in this 


clock 
if the initial 
request 
was for another 
bus master. 


3. The 8087 will generate 
a release 
pulse to the CPU one clock 
cycle after the completion 
of 


the last 8087 bus cycle or on receipt 
of the release 
pulse from the bus master 
on RQ/GT1. 


inter 


Symbol 
'TYpe 
Name and Function 


RQ/GT1 
I/O 
Request/Grant: This request/grant pin is used by another local bus master to force the 8087 to 
request the local bus. If the 8087 is not in control of the bus when the request is made the 
request/grant sequence is passed through the 8087 on the RQ/GTO pin one cycle later. 
Subsequent grant and release pulses are also passed through the 8087 with a two and one 
clock delay, respectively, for resynchronization. AOIGi1 
has has an internal pullup resistor, 


and so may be left unconnected. If the 8087 has control of the bus the request/grant sequence 
is as follows: 
1. A pulse 1 CLK Widefrom another local bus master indicates a local bus request to the 8087 
(pulse 1). 


2. During the 8087's next T4or T1a pulse 1 CLK wide from the 8087 to the requesting master 
(pulse 2) indicates that the 8087 has allowed the local bus to float and that it will enter the 
"RQ/GT acknowledge" state at the next CLK. The 8087's control 
unit is disconnected 


logically from the local bus during "RQ/GTacknowledge." 


3. A pulse 1 CLK wide from the requesting master indicates to the 8087 (pulse 3) that the 
"RQ/GT" request is about to end and that the 8087 can reclaim the local bus at the next 
CLK. 
Each master-master exchange of the local bus is a sequence of 3 pulses. There must be one 
dead CLK cycle after each bus exchange. Pulses are active LOW 


QS1, 
I 
OS1, QSO: QS1 and QSO provide the 8087 with status to allow tracking 
of the CPU 
QSO 
instruction queue. 
~ 


QS1 
QSO 
o (LOW) 
0 
No Operation 
0 
1 
First Byte of Op Code from Queue 
1 (HIGH) 
0 
Empty the Queue 


1 
1 
SUbsequent Byte from Queue 


INT 
0 
Interrupt: 
This line is used to indicate that an unmasked exception has occurred during 


numeric instruction 
execution when 8087 interrupts are enabled. This signal is typically 


routed to an 8259A. INT is active HIGH. 


BUSY 
0 
Busy: This signal indicates that the 8087 NEU is executing a numeric instruction. It is con- 
nected to the CPU's TEST pin to provide synchronization. 
In the case of an unmasked 


exception BUSY remains active until the exception is cleared. BUSY is active HIGH. 


READY 
I 
Ready: READY is the acknowledgment 
from the addressed memory device that it will 


complete the data transfer. The RDYsignal from memory is synchronized by the 8284AClock 
Generator to form READY.This signal is active HIGH. 


RESET 
I 
Reset: RESETcauses the processor to immediately terminate its present activity. The signal 
must be active HIGH for at least four clock cycles. RESET is internally synchronized. 


CLK 
I 
Clock: The clock provides the basic timing for the processor and bus controller. It is asym- 
metric with a 33% duty cycle to provide optimized internal timing. 


Vcc 
Power: Vcc is the +5V power supply pin. 


GND 
Ground: GND are the ground pins. 


NOTE: 
For thepin descriptionsof the8086and8088CPU'sreferencethoserespectivedatasheets(iAPX86/10,iAPX88/10). 


-~ 


The iAPX 86/20 and iAPX 88/20 provide 
functions 
meant 
specifically 
for 
high 
performance 
numeric 
processing 
requirements. 
Trigonometric, 
logarith- 
mic, 
and 
exponential 
functions 
are built 
into 
the 
processor 
hardware. 
These functions 
are essential 


in scientific, 
engineering, 
navigational, 
or military 


applications. 


The NDP also has capabilities 
meant for business or 


commercial 
computing. 
An iAPX 86/20, 88/20 can 


process 
Binary 
Coded 
Decimal 
(BCD) numbers .up 


to 18 digits 
without 
roundo1f errors. 
It can also per-- 


form 
arithmetic' 
on 
integers 
as large 
as 64 bits 


(±101~. 
. 


Programs 
for the iAPX 86/20 and iAPX 88/20 can be 


written 
in ASM-86, 
the 
iAPX 86,88 assembly 
lan- 
guage, PUM-86, 
FORTRAN-86, 
and PASCAL-86, In- 


tel's high-level 
languages 
for iAPX 86, 88 systems. 


The remainder 
of the data sheet will concentrate 
on 


the numeric 
processor. extension 
(refered to as NPX 


or 8087). For iAPX 86/10 or iAPX 88/10 CPU details 
refer to those 
respective 
data sheets. 


The iAPX 86/20, 88/20 Numeric 
Data Processor's 
ar- 


chitecture 
is designed 
for 
high 
performance 


numeric 
computing 
in conjunction 
with 
general 


purpose 
processing. 


The 8087 
is a numeric 
processor 
extension 
that 


provides 
arithmetic 
and logical 
instruction 
support 


for a variety 
of numeric 
data types 
in iAPX 86/20, 


88/20 systems. 
It also executes 
numerous 
built-in 


transcendental 
functions 
(e.g., 
tangent 
and 
log 


functions). 
The 
8087 
executes 
instructions 
as a 


coprocessor 
to a maximum 
mode 8086 or 8088. It 


effectively 
extends the register and instruction 
set of 


an iAPX 
86/10 
or 88/10 
based 
system 
and 
adds 


several new data types as well. Figure 3 presents the 
registers 
of the iAPX 86/20. Table 2 shows the range 


of data types 
supported 
by the 
NDP. The 8087 is 
treated 
as an extension 
to the iAPX 86/10 or 88/10, 


providing 
register, 
data types, control, 
and instruc- 


tion 
capabilities 
at the hardware 
level. At the pro- 


grammers 
level the iAPX 86/20, 88/20 is viewed as a 


single 
unified 
processor. 


As a coprocessor 
to an 8086 or 8088, the 8087 is 


wired 
in parallel with the CPU as shown 
in Figure 4. 


The CPU's status 
(SO-S2) 
and queue 
status 
lines 


(OSO-OS1) enable the 8087 to monitor 
and decode 


'8087 


iAPX 86110, 88/10 
DATA 
FIELD 
TAG 
FIELD 
15 
FilE: 
o I 
79 
78 
64 
83 
0 
1 
0 


,"~' 


Rl 
SIGN 
EXPONENT 
SIGNIFICAND 


~ 


BX 
I 
R2 


ex 
I 
R3 
OX 
I 
R4 
~ 
I 
R5 
01 
I 
R8 
BP 
I 
R7 


sp 
I 
R8 
IL __ 
, 
15 


IP 
I 
CONTROL 
REGISTER 


FLAGS 
I 
STATUS 
REGISTER 
IL ____ 
--, 
INSTRUCTION 
POINTER 


~~I 
I 


I 
I 
OAT A POINTER 
I 
I 
I 
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Data 
Range 
Precision 
Formats 


Byte Integer 
102 
8 Bits 
17 
10 


Word Integer 
104 
16 Bits 
115 


Short Integer 
109 
32 Bits 
131 


Long Integer 
1018 
64 Bits 
163 


Packed BCD 
1018 
18 Digits 
S 


Short Real 
10:!:38 
24 Bits 
S 


Long Real 
10±308 
53 Bits 
S 
F52 


Temporary Real 
10±4932 
64 Bits 
S 


Integer: I 
Real: (_1)S(2E-BIAS)(FooF1"') 


Packed BCD; (-1)S(D17" 
.00) 
Bias=127 for Short Real 
1023 for Long Real 
16383 for Temp Real 


r--'" 
I 
INT 


8259A 
I 
PIC 
I 
L ~n_J 


8284A 
CLOCK 
GENERATOR 


ClK 


iAPX86 
BUS 
INTERFACE 
COMPONENTS 


MUlTIMASTER 
SYSTEM 
BUS 


ClK 
8087 
NOP 
I 
INT 
I 
RO/GT1 
I 
I 


1 
I 
r ~- -, 
I 
ROIGT 
I 
I 
I 
I 
L- 


- 
~ClK 
8089 ~-•. 
lOP 
I 
•.. 
....I 


inter 


instructions 
in synchrohization 
with 
the CPU and 
without 
any CPU overhead. 
Once started 
the 8087 
can process in parallel with and independent 
of the 
host CPU. For resynchronization, 
the NPX's BU8Y 
signal informs the CPU that the NPX is executing 
an 
instruction 
and the CPU WAIT instruction 
tests this 
signal 
to insure 
that the NPX is ready to execute 
subsequent 
instructions. 
The NPX can interrupt 
the 
CPU when 
it detects 
an error 
or exception. 
The 
8087's interrupt 
request 
line is typically 
routed 
to 
the 
CPU through 
an 8259A 
Programmable 
Inter- 
rupt 
Controller. 
(8ee 
Figure 
2 for 
8087 
pinout 
information.) 


The 8087 uses one of the request/grant 
lines of the 
iAPX 86, 88 architecture 
(typically 
RQ/GT1) to obtai n 
control 
of the local bus for data transfers. 
The other 
request/grant 
line is available for general system use 
(for instance by an I/O processor 
in LOCAL mode). A 
bus master 
can also be connected 
to the 8087's 
RQ/GT1 line. In this configuration 
the 8087 will pass 
the request/grant 
handshake 
signals 
between 
the 
CPU and the attached master when the 8087 is not in 
control 
of the bus and will relinquish 
the bus to the 
master directly 
when the 8087 is in control. 
In this 
way two additional 
masters can be configured 
in an 
iAPX 86/20, 88/20 system; 
one will share the 8086 
bus with the 8087 on a first come first served basis, 
and the second 
will be guaranteed 
to be higher 
in 
priority 
than the 8087. 


As Figure 4 shows, all processors 
utilize the same 
clock 
generator 
and system bus interface 
compo- 
nents (bus controller, 
latches, transceivers 
and bus 
arbiter). 


The 8087 bus' structure, 
operation 
and timing 
are 
identical 
to all other processors 
in the iAPX 86, 88 
series (maximum 
mode configuration). 
The address 
is time 
multiplexed 
with the data on the first 
16/8 
lines of the address/data 
bus. A16 through 
A19 are 
time multiplexed 
with fourstatus 
lines 83-86. 
83, 84 
and 86 are always one (high) 
for 8087 driven 
bus 
cycles while 85 is always zero (low). When the 8087 
is monitoring 
CPU bus cycles (passive mode) 86 is 
also 
monitored 
by 
the 
8087 
to 
differentiate 
8086/8088 activity 
from that of a local I/O processor 
or any other local bus master. (The 8086/8088 must 
be the only processor 
on the local bus to drive 86 
low.) 87 is multiplexed 
with and has the same value 
as BHE for all 8087 bus cycles. 


The first three status lines, 80-82, are used with an 
8288 bus controller 
to determine 
the type 
of bus 


Unused 
Unused 
Memory 
Data Read 
Memory 
Data Write 
Passive (no bus 
cycle) 


The NDP includes 
the standard 
iAPX 86/10, 88/10 
instruction 
set for general 
data manipulation 
and 
program 
control. 
It also 
includes 
68 numeric 
instructions 
for extended 
precision 
integer, floating 
point, 
trigonometric, 
logarithmic, 
and exponential 
functions. 
8ample 
execution 
times for several NDP 
functions 
are shown in Figure 4. Overall iAPX 86/20 
system 
performance 
is 100 times 
that 
of an iAPX 
86/10 class processor 
for numeric 
instructions. 


Any 
instruction 
executed 
by 
the 
NDP 
is 
the 
combined 
result of the CPU and NPX activity. 
The 
CPU and the NPX have specialized 
functions 
and 
registers 
providing 
fast concurrent 
operation. 
The 
CPU controls 
overall 
program 
execution 
while the 
NPX uses the coprocessor 
interface 
to recognize 
and perform 
numeric 
operations. 


Table 2 lists the eight 
data types 
the iAPX 86/20, 
88/20 supports 
and 
presents 
the format 
for each 
type. Internally, 
the NPX holds 
all numbers 
in the 
temporary 
real format. 
Load and store instructions 
automatically 
convert 
operands 
represented 
in 
memory as 16-, 32-, or 64-bi,t integers, 
32- or 64-bit 
floating 
point 
numbers 
or 18-digit 
packed 
BCD 
numbers 
into temporary 
real format and vice versa. 


The 
NDP also 
provides 
the 
capability 
to control 
round 
off, underflow, 
and overflow 
errors 
in each 
calculation. 


Computations 
in the NPX use the processor's 
regis- 
ter stack. 
These eight 
80-bit 
registers 
provide 
the 
equivalent 
capacity 
of 20 32-bit registers. 
The NPX 
register set can be accessed as a stack, with instruc- 
tions 
operating 
on the top one or two 
stack 
ele- 
ments, or as a fixed 
register 
set, with 
instructions 
operating 
on explicitly 
designated 
registers. 


Table 5 lists the 8087's instructions 
by class. All ap- 
pear as E8CAPE instructions 
to the host. Assembly 
language 
programs 
are written 
in A8M-86, the iAPX 
86, 88 assembly 
language. Table 3 gives the execu- 
tion times of some typical 
numeric 
instructions. 


inter 


Table 
3. Execution 
Times 
for Selected 
iAPX 86/20 
Numeric 
Instructions 
and Corresponding 
iAPX 86/10 Emulation 


Approximate 
Execution 
Time 
(}Ls) 
Floating 
Point 
Instruction 
iAPX 86t20 
iAPX 
86/10 
(5 MHz 
Clock) 
Emulation 


Add/Subtract 
17 
1,600 
Multiply 
(single 
I' 
precision) 
19 
1,600 
Multiply 
(extended 
precision) 
27 
2,100 
Divide 
39 
3,200 
Compare 
9 
1,300 
Load (double precision) 
10 
1,700 
Store (double precision) 
21 
1,200 
Square Root 
36 
19,600 
Tangent 
90 
13,000 
Exponentiation 
100 
17,100 


NUMERIC 
PROCESSOR 
EXTENSION 
ARCHITECTURE 


As shown 
in Figure 
5, the 8087 is internally 
divided 


into two processing 
elements, 
the control 
unit (CU) 


and 
the 
numeric 
execution 
unit 
(NEU). 
The 
NEU 
executes 
all 
numeric 
instructions, 
while 
the 
CU 
receives 
and decodes 
instructions, 
reads and writes 
memory 
operands 
and 
executes 
NPX 
control 
in- 


structions. 
The 
two 
elements 
are 
able 
to operate 
independently 
of one 
another, 
allowing 
the 
CU to 
maintain 
synchronization 
with 
the 
CPU 
while 
the 
NEU is busy 
processing 
a numeric 
instruction. 


The CU keeps the 8087 operating 
in synchronization 


with 
its host CPU. 8087 instructions 
are intermixed 
with CPU instructions 
in a single 
instruction 
stream. 
The CPU fetches 
all instructions 
from 
memory; 
by 


monitoring 
the status 
signals 
(So-S2, 
S6) emitted 
by 


the CPU, the NPX control 
unit determines 
when 
an 


rcONTROLUNIT - 
T NUMER~XECuTI;;;uNrr - 
- - - 
- - I 


I 


I 


I 


I 


I 
I 


DATA 
I 


I 


I 


I 
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(7) 


I 


(6) 
I 
T 
A 
(5) 
I 
I 


G 


REGISTER 
STACK 
(4) 


W 
(3) 
I 
I 
0 


STATUS 
R 
(2) 


D 
I 
EXCEPTION 
I 
(1) 


ADDRESS 
POINTERS 
(0) 


L 
,- 
80 BITS 
-' 
~ 
- - 
- 
- 
- 
- 
- 
- 
- 
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8086 instruction 
is being fetched. 
The CU monitors 


the Data bus in parallel 
with 
the CPU to obtain 
in- 


structions 
that pertain 
to the 8087. 


The CU maintains 
an instruction 
queue that is identi- 


cal to the queue in the host CPU. The CU automatic- 
ally determines 
if the CPU is an 8086 or an 8088 


immediately 
after reset (by monitoring 
the BHE/ S7 


line) and matches 
its queue 
length 
accordingly. 
By 


monitoring 
the CPU's queue status lines (OSO, OS1), 
the CU obtains 
and decodes 
instructions 
from the 


queue 
in synchronization 
with the CPU. 


A numeric 
instruction 
appears 
as an ESCAPE in- 


struction 
to the 8086 or 8088 CPU. Both the CPU and 


NPX decode 
and execute 
the ESCAPE instruction 


together. 
The 8087 only recognizes 
the numeric 
in- 
structions 
shown 
in Table 5. The start of a numeric 


operation 
is accomplished 
when the CPU executes 


the ESCAPE instruction. 
The instruction 
mayor 
may 


not identify 
a memory 
operand. 


The CPU does, however, 
distinguish 
between 
ESC 


instructions 
that reference 
memory 
and those that 


do not. If the instruction 
refers to a memory operand, 
the CPU calculates 
the operand's 
address using any 


one of its available addressing 
modes, and then per- 
forms a "dummy 
read" of the word at that location. 
(Any location 
within 
the 1M byte address 
space 
is 


allowed.) 
This is a normal 
read cycle except that the 


CPU ignores the data it receives. 
If the ESC instruc- 


tion does not contain 
a memory 
reference 
(e.g. an 


8087 stack operation), 
the CPU simply 
proceeds 
to 


the next instruction. 


An 8087 Instruction 
can have one of three memory 
reference 
options; 
(1) not 
reference 
memory; 
(2) 


load an operand 
word from memory into the 8087; or 


(3) store 
an 
operand 
word 
from 
the 
8087 
into 


memory. 
If no memory 
reference 
is required, 
the 


8087 simply 
executes 
its instruction. 
If a memory 


reference 
is required, 
the CU uses a "dummy 
read" 
cycle initiated 
by the CPU to capture 
and save the 


address 
that the CPU places on the bus. If the in- 


struction 
is a load, the CU additionally 
captures 
the 


data word 
when 
it becomes 
available 
on the local 


data bus. If data required 
is longer 
than one word, 
the CU immediately 
obtains 
the bus from the CPU 


using the requesVgrant 
protocol 
and reads the rest 


of the information 
in consecutive 
bus cycles. 
In a 


store operation, 
the CU captures and saves the store 


address as in a load, and ignores the data word that 
follows 
in the "dummy 
read" cycle. When the 8087 is 


ready to perform 
the store, the CU obtains 
the bus 


from the CPU and writes the operand 
starting 
at the 


specified 
address. 


The NEU executes 
all instructions 
that 
involve 
the 


register 
stack; 
these 
include 
arithmetic, 
logical, 


transcendental, 
constant 
and data transfer 
instruc- 


tions. The data path in the NEU is 84 bits wide (68 
fraction 
bits, 15 exponent 
bits and a sign bit) which 


allows internal 
operand 
transfers 
to be performed 
at 


very high speeds. 


When the NEU begins 
executing 
an instruction, 
it 


activates 
the 8087 BUSY signal. 
This signal 
can be 


used in conjunction 
with the CPU WAIT instruction 


to resynchronize 
both processors 
when the NEU has 


completed 
its current 
instruction. 


The iAPX 86/20 register 
set is shown 
in Figure 
3. 


Each of the eight data registers 
in the 8087's register 


stack 
is 80 bits wide 
and 
is divided 
into 
"fields" 


corresponding 
to the 
NDP's temporary 
real 
data 


type. 


At a given point in time the TOP field 
in the control 


word 
identifies 
the current 
top-of-stack 
register. 
A 
"push" 
operation 
decrements 
TOP by 1 and loads a 


value 
into the new top register. 
A "pop" 
operation 


stores the value 
from 
the current 
top 
register 
and 


then 
increments 
TOP by 1. Like iAPX 86/10,88/10 


stacks 
in memory, 
the 
8087 
register 
stack 
grows 
"down" 
toward 
lower-addressed 
registers. 


Instructions 
may address 
the data registers 
either 


implicitly 
or explicitly. 
Many instructions 
operate 
on 


the register 
at the top of the stack. 
These instruc- 


tions implicitly 
address the register pointed 
to by the 


TOP. Other 
instructions 
allow 
the 
programmer 
to 


explicitly 
specify 
the register 
which 
is to be used. 


Explicit 
register 
addressing 
is "top-relative." 


The status word shown in Figure 6 reflects 
the over- 


all state of the 8087; it may be stored in memory and 
then inspected 
by CPU code. The status 
word 
is a 


16-bit register divided 
into fields as shown 
in Figure 


6. The busy bit (bit 15) indicates 
whether 
the NEU is 


either 
executing 
an instruction 
or has an interrupt 


request pending 
(B = 1), or is idle (B = 0). Several 
instructions 
which 
store and manipulate 
the status 


word are executed 
exclusively 
by the CU, and these 


do not set the busy bit themselves. 


1111R Is set if any unmasked 
exception 
bit is set, cleared 
otherwise. 


l2)See Table 
3 for condltion 
code 
interpretation. 


l3lTop Values: 


000 = Register 
0 is Top of StacK. 


001 = Register 
~ is Top of Stack. 


INVALID 
OPERATION 


DENORMALlZEO 
OPERAND 


ZERO 
DIVIDE 


OVERFLOW 


UNDERFLOW 


PRECISION 


(RESERVED) 


INTERRUPT 
REQUEST(1) 


CONDITION 
CODEn) 


TOP 
OF STACK 
POINTERf.1) 


NEU 
BUSY 


The four 
numeric 
condition 
code bits '(CO-C3) are 


similar 
to the flags in a CPU: various 
instructions 


update 
these 
bits to reflect 
the outcome 
of NOP 


operations. 
The effect 
of these instructions 
on the 


condition 
code bits is summarized 
in Table 4. 


Bits 14-12 of the status word pojnt to the 8087 regis- 
ter 
that 
is the 
current 
top-of-stack 
(TOP) 
as 


described 
above. 


Bit 7 is the interrupt 
request bit. This bit is set if any 


unmasked 
exception 
bit is set and cleared 
other- 


wise. 


Bits 
5-0 
are 
set 
to 
indicate 
that 
the 
NEU 
has 


detected 
an exception 
while executing 
an instruc- 


tion. 


The tag word marks the content 
of each register as 


shown in Figure 7. The principal 
function 
of the tag 


word is to optimize 
the NOP's performance. 
The tag 


. word can be used, however, to interpret 
the contents 


of 8087 registers. 


The instruction 
and data pointers 
(see Figure 8) are 


provided 
for user-written 
error handlers. 
Whenever 


the 8087 executes an NEU instruction, 
the CU saves 


the 
instruction 
address, 
the 
operand 
address 
(if 


present) 
and the instruction 
opcode. 
8087 instruc- 


tions can store this data into memory. 


Instruction 
C3 
C2 
C, 
Co 
Interpretation 


Compare. 
Test 
0 
X 
X 
0 
A>B 
0 
X 
X 
1 
A<B 
1 
X 
X 
0 
A=B 
1 
X 
X 
1 
A? 
B (not 
comparable) 


Remainder 
0, 
0 
00 
02 
Complete 
reduction 
0, 
1 
00 
02 
Incomplete 
reduction 


Examine 
0 
0 
0 
0 
Valid. 
positive. 
un normalized 
0 
0 
0 
1 
Invalid. 
positive. 
exponent 
;t 0 


0 
0 
1 
0 
Valid. 
negative. 
unnormalized 
0 
0 
1 
1 
Invalid. 
negative. 
exponent 
;t 0 


0 
1 
0 
0 
Valid. 
positive. 
normalized 


0 
1 
0 
1 
Infinity. 
positive 
0 
1 
1 
0 
Valid. 
negative. 
normalized 
0 
1 
1 
1 
Infinity. 
negative 
1 
0 
0 
0 
Zero. 
positive 
1 
0 
0 
1 
Empty 
1 
0 
1 
0 
Zero, 
negative 
1 
0 
1 
1 
Empty 


1 
1 
0 
0 
Invalid, 
positive. 
exponent 
= 0 
1 
1 
0 
1 
Empty 
1 
1 
1 
0 
Invalid, 
negative, 
exponent 
= 0 
1 
1 
1 
1 
Empty 


X = value is not affected by instruction. 
o = Co. Ca. C, hold 3 LSBs of the quotient generated during a remainder operation. 


TAG VALUES: 
00 " VALID 
01 " ZERO 
10 " SPECIAL 
11 " EMPTY 


INSTRUCTION 
POINTER 
(15-0) 


INSTRUCTION 
POINTER I 0 I 
INSTRUCTION 
OPCODE 
(16-0) 
(19-16) 


DATA POINTER (15-{1) 


DATA POINTER 
(11-16) I 
0 


The 8087 provides several processing 
options which 


are selected by loading a word from memory into the 
control 
word. Figure 9 shows the format and encod- 


ing of the fields 
in the control 
word. 


The low order byte of this control 
word configures 


8087 interrupts 
and exception 
masking. 
Bits 5-0 of 


the control 
word contain 
individual 
masks for each 


of the six exceptions 
that the 8087 recognizes 
and 


bit 7 contains 
a general 
mask bit for all 8087 in- 


terrupts. 
The high order 
byte of the control 
word 


configures 
the 
8087 
operating 
mode 
including 


precision, 
rounding, 
and infinity 
controls. The preci- 


sion control 
bits (bits 9-8) 
can be used to set the 


8087 internal 
operating 
precision 
at less than the 


default of temporary 
real precision. 
This can be use- 


ful in providing 
compatibility 
with earlier generation 


arithmetic 
processors 
of smaller 
precision 
than the 


8087. The rounding 
control 
bits (bits 11-10) provide 


for directed 
rounding 
and true chop as well as the 


unbiased 
round 
to nearest 
mode specified 
in the 


proposed 
IEEE standard. 
Control over closure of the 


number 
space 
at infinity 
is also provided 
(either 


affine closure, 
±oo, or projective 
closure, 
00, is treated 


as unsigned, 
may be specified). 


I 
x x 
X I, C I 
R C I 
P C I M I x I PM I UM I OM I ZM I OM 11M I 


I 


(llPrecision 
Control 
00" 
24 bits 
01 = Reserved 
10 = 53 bits 
11 = 64 bits 


12lAounding 
Control 
00 = Round 
to Nearest 
or Even 


01 = Round 
Down 
(toward 
- ••) 
10 = Round 
Up (toward 
+ •• ) 
11 = Chop 
(truncate 
toward 
zero) 


The 8087 detects six different 
exception 
conditions 


that can occur during 
instruction 
execution. 
Any or 


all exceptions 
will cause an interrupt 
if unmasked 


and interrupts 
are enabled. 


If interrupts 
are disabled 
the 8087 will simply 
con- 


tinue 
execution 
regardless 
of whether 
the 
host 


clears the exception. 
If a specific 
exception 
class is 


masked 
and that 
exception 
occurs, 
however, 
the 


8087 will post the exception 
in the status 
register 


and perform 
an on-chip 
default 
exception 
handling 


procedure, thereby allowing 
processing 
to continue. 


The 
exceptions 
that 
the 
8087 
detects 
are 
the 


following: 


1. INVALID OPERATION: Stack overflow, 
stack un- 


derflow, 
indeterminate 
form 
(0/0, 00- 
00, etc.) or 


the use of a Non-Number 
(NAN) as an operand. 


An exponent value is reserved and any bit pattern 
with this value in the exponent 
field is termed 
a 


Non-Number 
and causes this exception. 
If this 


exception 
is masked, the 8087's default 
response 
is to generate a specific 
NAN called INDEFINITE, 


or to propagate 
already existing 
NANs as the cal- 


culation 
result. 


INVALID 
OPERATION 


DENORMALIZEO 
OPERAND 


ZERO 
DIVIDE 


OVERFLOW 


UNDERFLOW 


PRECISION 


(RESERVED) 


INTERRUPT 
MASK 
(1 = INTERRUPTS 
ARE MASKED) 


PRECISION CONTROl(1) 


ROUNDING 
CONTROL(21 


INFINITY 
CONTROL 
(0 :: PROJECTIVE, 
1 = AFFINE) 


(RESERVED) 


inter 


2. OVERFLOW: The result is too large in magnitude 


to fit the specified 
format. The 8087 will generate 
an encoding 
for 
infinity 
if this 
exception 
is 
masked. 


3. ZERO DIVISOR: The divisor is zero while the divi- 
dend 
is a non-infinite, 
non·zero 
number. 
Again, 
the 8087 will generate 
an encoding 
for infinity 
if 


this exception 
is masked. 


4. UNDERFLOW: 
The 
result 
is non-zero 
but 
too 


small in magnitude 
to fit in the specified 
format. If 


this 
exception 
is 
masked 
the 
8087 
will 
denormalize 
(shift right) the fraction 
until the ex- 
ponent 
is in range. This process is called gradual 


underflow. 


5. DENORMALIZED 
OPERAND: 
At least one of the 


operands 
or the result is denormalized; 
it has the 


smallest 
exponent 
but 
a non-zero 
significand. 


Normal processing 
continues 
if this exception 
is 


masked off. 


6. INEXACT RESULT: If the true result is not exactly 


representable 
in the specified 
format, 
the result 


is rounded 
according 
to the rounding 
mode, and 


this flag is set. If this exception 
is masked, 
pro- 


cessing 
will simply 
continue. 


inter 


Ambient Temperature Under Bias 
O"Cto 70"C 
Storage Temperature 
-65°C to +150"C 
Voltage on Any Pin with 
Respect to Ground ............•....... 
-1.0V 
to + 7V 
Power Dissipation 
3.0 Watt 


·NOTlCE: 
Stresses above those listed 
under 
Absolute 
Maximum 
Ratings may cause permanent 
damage to the 
device. This is a stress rating only and functional 
opera- 
tion of the device at these or any other conditions 
above 


those 
indicated 
in the 
operational 
sections 
of this 
specification 
is not implied. 
Exposure to absolute maxi- 
mum rating conditions 
for extended periods 
may affect 


device reliability. 


Symbol 
Parameter 
Min. 
Max. 
Units 
Teat Conditions 


Vll 
Input low 
Voltage 
-0.5 
+0.8 
V 


VIH 
Input High Voltage 
2.0 
Vee +0.5 
V 


VOL 
Output low Voltage 
0.45 
V 
IOL = 2.0 mA 


VOH 
Output High Voltage 
2.4 
V 
IOH = -400 
IJ-A 


Ice 
Power Supply Current 
475 
mA 
TA = 25°C 


III 
Input leakage 
Current 
:!:10 
IJ-A 
OV..;; VIN";; Vee 


IlO 
Output leakage 
Current 
:!:10 
IJ-A 
0.45V ..;;VOUT ..;;Vee 


VeL 
Clock Input low 
Voltage 
-0.5 
+0.6 
V 


VeH 
Clock Input High Voltage 
3.9 
Vee + 1.0 
V 


CIN 
Capacitance 
of Inputs 
10 
pF 
fc = 1 MHz 


CIO 
Capacitance 
of I/O Buffer 
(AD0-15, A16-A19, BHE, S2-S0, 
.- 
15 
pF 
fc = 1 MHz 
RO/GT) and ClK 


COUT 
Capacitance 
of Outputs 
BUSY,INT 
10 
pF 
fc = 1 MHz 


A.C. CHARACTERISTICS 
(TA = O°Cto 70°C, Vee = +5V :!:10%) 


TIMING 
REQUIREMENTS 


Symbol 
Parameter 
Min. 
Max. 
Units 
Test Conditions 


TClCl 
ClK Cycle Period 
200 
500 
ns 


TClCH 
ClK 
low Time 
(% TClCl) 
- 15 
ns 


TCHCl 
ClK High Time 
('13 TClCl) 
+ 2 
ns 


TCH1CH2 
ClK 
Rise Time 
- 
10 
ns 
From 1.0V to 3.5V 


TCL2CL1 
ClK 
Fall Time 
10 
ns 
From 3.5V to 1.0V 


TDVCl 
Data In Setup Time 
30 
ns 


TClDX 
Data In Hold Time 
10 
ns 


TRYHCH 
READY Setup Time 
(% TClCl) 
- 15 
ns 


TCHRYX 
READY Hold Time 
30 
ns 


TRYlCl 
READY Inactive to ClK (See Note 3) 
-8 
ns 


TGVCH 
RO/GT Setup Time 
30 
ns 


TCHGX 
RO/GT Hold Time 
40 
ns 


TOVCl 
OS0-1 Setup Time 
30 
ns 


TClOX 
OSo-l 
Hold Time 
10 
ns 


TSACH 
Status Active Setup Time 
30 
ns 


TSNCl 
Status Inactive Setup Time 
30 
ns 


TlllH 
Input Rise Time (Except ClK) 
20 
ns 
From 0.8V to 2.0V 


TIHll 
Input Fall Time (Except ClK) 
12 
ns 
From 2.0V to 0.8V 


inter 


A.C. CHARACTERISTICS 
(Continued) 


TIMING RESPONSES 


Symbol 
Parameter 
Min. 
Max. 
Units 
Test Conditions 


TCLML 
Command Active Delay (See Note 1) 
• 
10 
35 
ns 


TCLMH 
Command Inactive Delay (See Note 1) 
10 
35 
ns 


TRYHSH 
Ready Active to Status Passive (See Note 2) 
110 
ns 


TCHSV 
Status Active Delay 
10 
110 
ns 
')l1 


TCLSH 
Status Inactive Delay 
10 
130 
ns 


TCLAV 
Address Valid Delay 
! 
" .,.! 
10 
110 
ns 


TCLAX 
Address Hold Time 
10 
ns 


TCLAZ 
Address Float Delay 
TCLAX 
80 
ns 


TSVLH 
Status Valid to ALE High (See Note 1) 
15 
ns 


TCLLH 
CLK Low to ALE Valid (See Note 1) 
15 
ns 
TCHLL- 
ALE Inactive Delay (See Note 1) 
15 
ns 
CL = 20-100 
pF for all 


TCLDV 
Data Valid Delay 
10 
110 
ns 
8087 Outputs (in addition 


TCHDX 
Data Hold Time 
10 
ns 
to 8087 self-load) 


TCVNV 
Control Active Delay (See Note 1) 
5 
45 
ns 


TCVNX 
Control Inactive Delay (See Note 1) 
10 
45 
ns 


TCHBV 
BUSYand 
INT Valid Delay 
10 
150 
ns 


TCHDTL 
Direction 
Control Active Delay (See Note 1) 
50 
ns 


TCHDTH 
Direction 
Control Inactive Delay (See Note 1) 
30 
ns 


TCLGL 
RQ/GT Active Delay 
0 
85 
ns 
CL = 40 pF (in 


TCLGH 
RQ/GT 
Inactive Delay 
0 
85 
ns 
addition 
to 8087 self-load) 


TOLOH 
Output Rise Time 
20 
ns 
From 0.8V to 2.0V 


TOHOL 
Output Fall Time 
12 
ns 
From 2.0V to 0.8V 


NOTES: 
1. Signal at 8284Aor 8288shown for reference only. 
2. Applies only to T3 and wait states. 
3. Applies only to T2 state (8 ns into T3)' 


2.4--v.S_TEST 
POINTS_tY- 


0.4S~--------~~ 
IC'~100pF 


A.G. TESTING: 
INPUTS 
ARE DRIVEN 
AT 2.4V FOR A LOGIC "1" ANOQ.45V 
FOR 
A LOGtC "0." 
THE CLOCK 
IS DRIVEN 
AT 4.3V AND 
02SV 
TIMING 
MEASURE- 
MENTS 
ARE 
MADE 
AT , .5V FOR BOTH 
A lOGIC' 
". 
AND "0:" 


inter 


READY 
(8087 
INPUT) 
{ 
(SEE 
NOTE 
2) 


{M::~ 


DEN 


{ 
AM:: 


MWTC 


1. ALL SIGNALS 
SWITCH 
BETWEEN 
VOL AND VOH 
UNLESS 
OTHERWISE 
SPECIFIED. 


2. 
READY 
IS SAMPLED 
NEAR THE 
END OFT2. 
T3 ANDTWTO 
DETERMINE 
IFTW 
MACHINE 
STATES ARE TO BE INSERTED. 


3. 
THE 
LOCAL 
BUS FLQ<\TS ONLY 
IF THE 
8087 
IS RETURNING 
CONTROL 
TO THE 
808618088. 


4. ALE 
RISES 
AT LATER OF (TSVLH. 
TCLLH). 


5. 
STATUS 
INACTIVE 
IN STATE JUST 
PRIOR 
TO T 4. 


6. SIGNALS 
AT 8284A 
OR 8288 ARE SHOWN 
FOR 
REFERENCE 
ONLY. 


7. THE 
ISSUANCE 
OF 8288 
COMMAND 
AND CONTROL 
SIGNALS 
(MADe. 
MWTC, 
AMWC AND 
DEN) 
LAGS THE ACTIVE 
HIGH 
8288 
CEN. 


8. 
ALL TIMING 
MEASUREMENTSAAE 
MADE 
AT 1.5V UNLESS 
OTHERWISE 
NOTED. 


inter 


READY I 
IN~ 


_ 
r-~~g:~~~OCLK 
---v 
CYCLE 


TCLGL 
--r 
~7 


inter 


A015-ADo 
Al,/S,-A"/S, 
52.$,,50 
SHE/S7 


_________ 
808_7 
•••: 
•••: 
_ 
ALTERNATE 
MASTER 


(SEE 
NOTE) 


'",,:: 
\~--=--~=- 
-~=T-C=-H=-B:-,tf==========~- 


inter 


Table 5. 8087 Extensions to the 8086/8088 Instruction Set 


7 
6 
5 
4 
3 
2 
1 
0 
7 
6 
5 
4 
3 
2 
1 
0 
7 
6 
5 
4 
3 
2 
1 
0 
7 
6 
5 
4 
3 
2 
1 
0 


Data Transfer 


FLD '" LOAD 


Integer/Real 
Memory 
to ST(o) 
ESCAPE 
MF 
1 I 
MOD 
RIM 
(DISP-LO) 
(DISP-HI) 


Long Integer 
Memory 
10 Sl(o) 
ESCAPE 
1 I 
MOD 
1 
a 
1 
RIM 
(DISP-LO) 
(DISP-HI) 


Temporary 
Real Memory 
to 5T(0) 
ESCAPE 
0 
1 
1 I 
MOD 
1 
a 
1 
RIM 
(DISP-LO) 
(DISP-HI) 


BCD Memory 
10 ST(o) 
ESCAPE 
1 I 
MOD 
1 
RIM 
(DISP-LO) 
(DISP-HI) 


ST(i) 
to ST(o) 
ESCAPE 
1 I 
1 
0 
ST(I) 


FST = STORE 


ST(o) 
10 Integer/Real 
Memory 
ESCAPE 
MF 
1 I 
MOD 
1 
a 
RIM 
(DISP-LO) 
(DISP-HI) 


ST(O) 10 ST{i) 
ESCAPE 
1 
a 
1 I 
1 
0 
1 
0 
ST(I) 


FSTP 
" STORE 
AND 
POP 


Sl(o) 
to Inleger/Real 
Memory 
ESCAPE 
MF 
1 I 
MOD 
0 
1 
RIM 
(DISP-LO) 
(DISP-HI) 
J 


ST(o) 
to long 
Integer 
Memory 
ESCAPE 
1 I 
MOD 
1 
RIM 
(DISP-LO) 
(DISP-HI) 
I 


Sl{O) 
to Temporary 
Real Memory 
ESCAPE 
0 
1 
1 I 
MOD 
1 
RIM 
(DISP-LO) 
(DISP-HI) 
I 


Sl(o) 
to BCD 
Memory 
ESCAPE 
1 I 
MOD 
1 
1 
a 
RIM 
(DISP-LO) 
(DISP-HI) 
I 


ST(o) 
to ST(i) 
ESCAPE 
1 
a 
1 I 
1 
1 
a 
1 
1 
ST(I) 


FXCH 
'" Exchange 
5T(i) 
and SI{Ol 
ESCAPE 
a 
a 
1 
1 
a 
a 
1 
ST(I) 


Comparison 


FCOM 
" Compare 


Integer/Real 
Memory 
10 5T(0) 
ESCAPE 
MF 
a I 
MOD 
a 
1 
RIM 
(DISP-LO) 
(DISP-HI) 


STeil to ST(o) 
ESCAPE 
a 
a 
a I 
1 
a 
1 
a 
STP) 


FCOMP 
= Compare 
and Pop 


Integer/Real 
Memory 
to 5T(0) 
ESCAPE 
MF 
a I 
MOD 
a 
1 
RIM 
(DISP-LO) 
IDISP-HI) 


5T(I) 10ST(o) 
ESCAPE 
a 
a 
a I 
1 
a 
1 
1 
ST(I) 


FCOMPP 
= Compare 
5T(1) 
to ST(o) 
ESCAPE 
1 
a I 1 
1 
a 
1 
1 a 
1 I 
and Pop Twice 
FTST " Test 5T(0) 
ESCAPE 
a 
a 
1 I 1 
1 
a 
a 
1 
a 
a I 


FXAM 
:: Examine 
ST(o) 
ESCAPE 
a 
a 
1 I 1 
1 
a 
a 1 
1 I 


inter 


Arithmetic 
7 
6 
5 
4 
3 
2 
1 
0 
7 
6 
5 
4 
3 
2 
1 
0 
7 
6 
5 
4 
3 
2 
1 
0 
7 
6 
5 
4 
3 
2 
1 
0 


FADD 
" Addition 


Integer/Real 
Memory 
with ST(o) 
ESCAPE 
MF 
IMOD 
0 
0 
0 
AIM 
(DISP-LO) 
(DISP-HI) 


ST(i) 
and ST(o) 
ESCAPE 
d 
P 
o 11 
1 
0 
STeil 


FSUB 
" Subtraction 


Integer/Real 
Memory with ST(o) 
ESCAPE 
MF 
o IMOD 
1 
0 
A 
AIM 
(DISP-LO) 
(DISP-HI) 


ST(i) and ST(o) 
ESCAPE 
d 
P 
11 
1 
0 
A 
AIM 


FMUl 
= Multiplication 


Integer/Real 
Memory 
with 
ST(o) 
ESCAPE 
MF 
IMOD 
0 
1 
AIM 
(DISP-LO) 
(DISP-Hl) 


ST(i) 
and ST(o) 
ESCAPE 
d 
P 
11 
1 
0 
0 
1 
AIM 


FDIV 
= Division 


Integer/Real 
Memory 
with ST(o) 
ESCAPE 
MF 
01 MOD 
1 
1 
A 
AIM 
(DISP-LO) 
(DISP-Hl) 


ST(i) and ST(o) 
ESCAPE 
d 
P 
o 11 
1 
A 
AIM 


FSQRT 
= Square 
Root of ST(o) 
ESCAPE 
1 11 
0 
1 
1 


FSCALE 
= Scale ST(o) by ST(1) 
ESCAPE 
1 11 
1 
0 
1 I 


FPREM 
= Partial Remainder 
of 


C"£SCAPE 
ST(Ol .;- 5T(l) 
0 
0 
1 
11 
1 
0 
0 
0 


FRNDINT 
= Round ST(o) 
to Integer 
ESCAPE 
0 
0 
1 11 
1 
0 
0 


FXTRACT 
:: Extract 
Components 


11 
I 
of ST(o) 
ESCAPE 
1 
1 
0 
1 
0 


FABS 
= Absolute 
Value of ST(D) 
ESCAPE 
0 
0 
1 11 
1 
0 
0 
C 
0 
1 I 


FCHS 
:::Change 
Sign 
of ST(o) 
ESCAPE 
1 11 
1 
0 
0 
0 
0 I 


Transcendental 


FPTAN 
= Partial Tangent 
of ST(D) 
ESCAPE 
0 
0 
1 11 
1 
0 
0 
1 


FPATAN 
= Partial Arctangent 
of 
1 11 
I 
ST(o) + ST(1) 
ESCAPE 
0 
0 
1 
0 
0 
1 
1 


F2XM1 
= 2 5T/O)_1 
ESCAPE 
0 
0 
1 11 
1 
0 
0 
0 1 


FYL2X 
= ST(1)' 
L092 [5T(O)l 
ESCAPE 
0 
0 
1 11 
1 
0 
0 
0 
1 I 


FYL2XP1 
= ST(1) • L092 [ST(o) 
+ 1} 
ESCAPE 
0 
0 
1 11 
1 
0 
1 I 


Constants 


FlDZ 
'" lOAD 
+ 0.0 into ST(Ol 
ESCAPE 
111 
1 
0 
1 
1 
01 


FLD1 
= LOAD. 
1.0 into ST(O) 
ESCAPE 
0 
1 
1 
0 
1 
o I 


FlOPI 
'" LOAD" 
into ST(Ol 
ESCAPE 
0 
1 
1 
0 
1 
0 
1 
1 1 


FLDL2T" 
lOAD 
1092 10 into ST(O) 
ESCAPE 
0 
1 
0 
1 
0 
0 
1 I 


HOl2E 
= LOAD 
tog 2 e into ST(O) 
ESCAPE 
0 
1 
1 
0 
1 
0 
1 
I 


FLDLG2 
= lOAD 
109,0 
2 into ST(Ol 
ESCAPE 
0 
0 
1 
1 
0 
1 
1 
0 
0 I 


FLDlN2 
= lOAD 
loge 2 into ST(O) 
ESCAPE 
0 
1 
1 
0 
1 
1 
0 
1 I 


7 
6 
5 
4 
3 
2 
1 
0 
7 
6 
5 
4 
3 
2 
1 
'0 
7 
6 
5 
4 
3 
2 
1 
0 
7 
6 
5 
4 
3 
2 
1 
0 


Processor Control 


FINIT'" 
Initialize 
NQP 
ESCAPE 
0 
1 
1 
11 
1 
1 
0 
0 
0 
1 
1 I 


FEN I '" Enable 
Interrupts 
ESCAPE 
0 
1 
1 
11 
1 
1 
0 
0 
0 
0 
0 I 


FDI$I 
= Disable 
Interrupts 
ESCAPE 
0 
1 
1 
11 
1 
1 
0 
0 
0 
0 
1 I 


FLDCW 
= Load Control 
Word 
ESCAPE 
0 
0 
1 I MOD 
1 
0 
1 
RIM 
I 
(DISP-LO) 
1 
(DISP-HI) 
I 


F$TCW 
= Store Control 
Word 
ESCAPE 
0 
0 
1 
MOD 
1 , , 
RIM 
I 
(DISP-LO) 
I 
(DISP-HI) 
I 


fSTSW 
= Store Status Word 
ESCAPE 
1 
0 
1 
MOD 
1 , 
1 
RIM 
1 
(DISP-LO) 
I 
(DISP-HI) 
I 


FCLEX 
" Clear Exceptions 
ESCAPE 
0 
1 
1 
1 
1 , 
0 
0 
0 
1 
01 


FSTENV 
= Store 
Environment 
ESCAPE 
0 
0 
1 
MOD 
1 
1 
0 
RIM 
I 
(DISP-LO) 
1 
(DISP-HI) 
1 


FlDENV 
= load 
Environment 
ESCAPE 
0 
0 
1 
MOD 
1 
0 
0 
RIM 
I 
(DISP-LO) 
I 
(DISP-HI) 
I 


FSAVE = Save State 
ESCAPE 
1 
0 
1 
MOD 
, , 
0 
RIM 
I 
(DISP-LO) 
I 
(DISP-HI) 
1 


FRSTOR 
" Restore State 
I 
ESCAPE 
1 
0 
1 I MOD 
1 
0 
0 
RIM 
1 
(DISP-LO) 
1 
(DISP-HI) 
I 


FINCSTP 
= Increment 
Stack 
Pointer 
1 
ESCAPE 
0 
0 
1 
11 
1 
1 , 
0 
1 
1 
1 I 


FDECSTP'" 
Decrement 
Slack 
Pointer I 
ESCAPE 
0 
0 
1 
11 
1 
1 , 
0 
1 
1 
0 I 


FFREE = Free ST(i) 
I 
ESCAPE 
1 
0 
1 l' 


1 
0 
0 
0 
STII) 
1 


FNOP = No Operation 
I 
ESCAPE 
0 
0 
1 
11 
1 
0 
1 
0 
0 
0 
0 1 


FWAIT 
= CPU 
Wait for NDP 
I 1 
0 
0 
1 
1 
0 
1 
1 I 


. 
, 


if mod = 00 then DISP= 0·, disp-Iow and disp-high are absent 
if 
mod = 01 then 
DISP= disp-)ow 
sign-extended 
to 
16-bits, 
disp-high 
is absent 
if mod = 10 then DISP= disp-high; disp-Iow 
if mod = 11 then rim is treated as an ST(i) field 


ST(O)= 
ST(i) = 
Current stack top 
ith register below stack top 


Destination 
0- 
Destination is ST(O) 
1 - 
Destination is ST(i) 


Pop 
0- 
No pop 
1 - 
Pop ST(O) 


if rim = 000 then EA = (BX) + (SI) 
+ DISP 


if rim = 001 then EA = (BX) + (01) 
+ DISP 
if rim = 010 then EA = (BP) + (SI) 
+ DISP 


if rim = 011 then EA = (BP) + (01) 
+ DISP 
if rim = 100 then EA = (SI) 
+ DISP 
if rim = 101 then EA = (01) 
+ DISP 
if rim = 110 then EA = (BP) + DISp· 
if rim = 111 then EA = (BX) + DISP 


Reverse: 
When d = 1 reverse the sense of R. 


0- 
Destination (op) Source 
1 - 
Source (op) Destination 


MF = 
Memory Format 
00 - 
32-bit Real 
01 - 
32-bit Integer 
10 - 
64-bit Real 
11 - 
16-bit Integer 


-0 
:S ST(O) 
:S +0> 
_2'5,,; 
ST(1) < +2'5 and ST(1) integer 


O:S ST(O) 
:S 2-1 


0< 
ST(O) 
< 0> 


-0> < ST(1) < + 0> 
o '" 
IST(O)I 
< (2 - ..;2)/2 


_00 < ST(1) < 00 
0", 
ST(O)< n/4 
o '" ST(O)< ST(1)< + 00 


For FSQRT: 
For FSCALE: 
For F2XM1: 
For FYL2X: 


8087 Instructions, 
Encoding and Decoding 


APPENDIX 
A 
MACHINE 
INSJRUGJIG>N· ENCODING 
AND DECODING 


8087 machine 
instructions 
assume one of five 


different 
forms as shown in table A-I. 
In all 


cases, the instructions are at least two bytes long 
and begin with the bit pattern 
lIOIIB, 
which 


identifies the escape class of instructions. Instruc- 
tions 
which 
reference 
memory 
operands 
are 


encoded 
much 
like similar 
CPU 
instructions, 
since the CPU must calculate the operands effec- 
tive address from the information 
contained 
in 


the 
instruction. 
Section 
4.2 
discusses 
this 


encoding scheme in more detail, and in particular, 
shows how each memory 
addressing 
mode 
is 


encoded. 


Note that all instructions (except those coded with 
a "no-wait" 
mnemonic) 
are 
preceded 
by an 


assembler-generated 
CPU 
WAIT 
instruction 


(encoding: 
10011011 B). 
Segment 
override 


prefixes may also precede 8087 instructions in the 
instruction stream. 


~ 


Lower-addressed 
Byte 
Higher-addressed 
Byte 
0,1,or2 


r 


1 
1 
0 
1 
1 
OP-A 
1 
MOD 
1 
OP-B 
R/M 
DISPLA9 


1 
1 
0 
1 
1 
FORMAT OP-A 
MOD 
OP-B 
R/M 
DISPLAC 


r 


OP-A 1 


J 


1 
1 
0 
1 
1 
R 
P 
1 
OP-B 
REG 


1 
1 
0 
1 
1 
0 
0 
1 
1 
1 
1 
OP 


1 
1 
0 
1 
1 
0 
1 
1 
1 
1 
1 
OP 


(1)Memory transfers, 
including applicable 
processor control instructions; 
0, 1, or 2 displacement 
bytes may 
follow. 


(2)Memory arithmetic and compSlrison instructions; 
0, 1, or 2 displacement 
bytes may follow. 


(3)Stackarithmetic and comparison instructions. 


(4)Constant, transcendental, 
some arithmetic instructions. 


(5)Processor control instructions 
that do not reference memory. 


OP, OP-A, OP-B: Instruction opcode, possibly split into two fields. 


MOD: Same as CPU mode field; see table 4-8. 


R/M: Sames as CPU register/memory 
field; see table 4-10. 


FORMAT: Defines memory operand 
00= short real 
01= short integer 
10= long real 
11= word integer 


R: 0 = return result to stack top' 


1 = return result to'other register 


P: 0 = do not pop stack 


1 = pop stack after operation 


REG: register stack element 


000= stack top 
001= next on stack 
010= third stack element, etc, 


Table A-2 lists all 8087 machine 
instructions 
in 
the data bus. Users writing 
exception 
handlers 
binary 
sequence. 
This 
table 
may 
be 
used to 
may also find this information 
useful to identify 
"disassemble" 
instructions 
in 
unformatted 
the ?ffending 
instruction. 


memory 
dumps or instructions 
monitored 
from 


Table A-2. 
Machine 
Instruction 
Decoding 
Guide 


1st Byte 
2nd Byte 
Bytes 3,4 
ASM-86 Instruction 
Hex 
Binary 
Format 


08 
1101 1000 
MOOOO OR/M 
(disp-Io),(disp-hi) 
FAOO 
short-real 
08 
1101 1000 
MOOOO 1R/M 
(disp-Io),(disp-hi) 
FMUL 
short-real 
08 
1101 1000 
MOO01 
OR/M 
(disp-Io),(disp-hi) 
FCOM 
short-real 
08 
1101 1000 
MOO01 
1R/M 
(disp-Io),(disp-hi) 
FCOMP 
short-real 
08 
1101 1000 
M0010 
OR/M 
(disp-Io),(disp~hi) 
FSUS' 
short-real 
08 
1101 1000. 
M0010 
1R/M 
(disp-Io),(disp-hi) 
FSUBR 
short-real 
08 
1101 1000 
M0011 
OR/M 
(disp-Io),(disp-hi) 
FOIV 
short-real 
08 
1101 1000 
M0011 
1R/M 
(disp-Io),(disp-hi) 
FOIVR 
short-real 
08 
1101 1000 
1100 
OREG 
FAOO 
ST,ST(i) . 


08 
1101 1000 
1100 
1REG 
FMUL 
ST,ST(i) 


08 
1101 1000 
1101 
OREG 
FCOM 
ST(i) 
08 
1101 1000 
1101 
1REG 
FCOMP 
ST(i) 
08 
1101 1000 
1110 
OREG 
FSUS 
ST,ST(i) 
08 
1101 1000 
1110 
1REG 
FSUSR 
ST,ST(i) 
08 
1101 1000 
1111 
OREG 
FOIV 
ST,ST(i) 


08 
1101 1000 
1111 
1REG 
FOIVR 
ST,ST(i) 
09 
1101 1001 
MOOOO OR/M 
(disp-Io),(disp-hi) 
FLO 
shorl-real 
09 
1101 1001 
MOOOO 1R/M 
reserved 
09 
1101 1001 
MOO01. OR/M 
(disp-Io),(disp-hi) 
FST 
short-real 


1st Byte 
ASM-86 
Instruction 


Hex 
Binary 
2nd Byte 
Bytes 
3,4 
Format 


09 
1101 
1001 
MOO01 
1R/M 
(disp-Io),(disp-hi) 
FSTP 
short-real 


09 
1101 
1001 
M0010 
OR/M 
(disp-Io),(disp-hi) 
FLOENV 
14-bytes 
.. 


09 
1101 
1001 
M0010 
1R/M 
(disp-Io),(disp-hi) 
FLOCW 
2-bytes 
09 
1101 
1001 
M0011 
OR/M 
(disp-Io),(disp-hi) 
FSTENV 
14-bytes 


09 
1101 
1001 
M0011 
1R/M 
(disp-Io):(disp-hi) 
FSTCW 
2-bytes 
09 
1101 
1001 
1100 
OREG 
FLO 
ST(i) 


09 
1101 
1001 
1100 
1REG 
FXCH 
ST(i) 
'. 


09 
1101 
1001 
1101 
0000 
FNOP 
09 
1101 
1001 
1101 
0001 
reserved 
. 


09 
1101 
1001 
1101 
001- 
reserved 
09 
1101 
1001 
1101 
01-- 
reserved 
I 


09 
1101 
1001 
1101 
1REG 
*(1 ) 
09 
1101 
1001 
1110 
0000 
FCHS 
09 
1101 
1001 
1110 
0001 
FABS 
09 
1101 
1001 
1110 
001- 
reserved 
09 
1101 
1001 
1110 
0100 
FTST 
09 
1101 
1001 
1110 
0101 
FXAM 
09 
1101 
1001 
1110 
011- 
reserved 
09 
1101 
1001 
1110 
1000 
FL01 
09 
1101 
1001 
1110 
1001 
FLOL2T 
09 
1101 
1001 
1110 
1010 
FLOL2E 
09 
1101 
1001 
1110 
1011 
FLOPI 
09 
1101 
1001 
1110 
1100 
FLOLG2 
09 
1101 
1001 
1110 
1101 
FLOLN2 
09 
1101 
1001 
1110 
1110 
FLOZ 
09 
1101 
1001 
1110 
1111 
reserved 
09 
1101 


, 


1001 
1111 
0000 
F2XM1 
09 
1101 
1001 
1111 
0001 
FYL2X 
09 
1101 
1001 
1111 
0010 
FPTAN 
09 
1101 
1001 
1111 
0011 
FPATAN 
09 
1101 
1001 
1111 
0100 
FXTRACT 
09 
1101 
1001 
1111 
0101 
reserved 
09 
1101 
1001 
1111 
0110 
FOECSTP 
09 
1101 
1001 
1111 
0111 
FINCSTP 
09 
1101 
1001 
1111 
1000 
FPREM 
09 
1101 
1001 
1111 
1001 
FYL2XP1 
09 
1101 
1001 
1111 
1010 
FSQRT 
09 
1101 
1001 
1111 
1011 
reserved 
09 
1101 
1001 
1111 
1100 
FRNOINT 
09 
1101 
1001 
1111 
1101 
FSCALE 
09 
1101 
1001 
1111 
111- 
reserved 
OA 
1101 
1010 
MOOOO 
OR/M 
(disp-Io),(disp-hi) 
FIAOO 
short-integer 


OA 
1101 
1010 
MOOOO 
1R/M 
(disp-Io),(disp-hi) 
FIMUL 
short-integer 


OA 
1101 
1010 
MOO01 
OR/M 
(disp-Io),(disp-hi) 
FICOM 
short-integer 


OA 
1101 
1010 
MOO01 
1R/M 
(disp-Io),(disp-hi) 
FICOMP 
short-integer 


OA 
1101 
1010 
M0010 
OR/M 
(disp-Io),(disp-hi) 
FISUB 
short-integer 


OA 
1101 
1010 
M0010 
1R/M 
(disp-Io),(disp-hi) 
FISUBR 
short-integer 


1st Byte 
2nd Byte 
Bytes 3,4 
ASM-86 
Instruction 


Hex 
Binary 
Format 
. 


DA 
1101 
1010 
MOD11 
OR/M 
(disp-Io),(disp-hi) 
FIDIV 
short-integer 


DA 
1101 
1010 
MOD11 
1R/M 
(disp-Io),(disp-hi) 
FIDIVR 
short-integer 


DA 
1101 
1010 
11-- 
---- 
reserved 
DB 
1101 
1011 
MODOO 
OR/M 
(disp-Io),(disp-hi) 
FILD 
short-integer 


DB 
1101 
1011 
MODOO 
1R/M 
(disp-Io),(disp-hi) 
reserved 
DB 
1101 
1011 
MOD01 
OR/M 
(disp-Io),(disp-hi) 
FIST 
short-integer 
DB 
1101 
1011 
MOD01 
1R/M 
(disp-Io),(disp-hi) 
FISTP 
short-integer 


DB 
1101 
1011 
MOD10· 
OR/M 
(disp-Io),(disp-hi) 
reserved 
DB 
1101 
1011 
MOD10 
1R/M 
(disp-Io),(disp-hi) 
FLD 
temp-real 
DB 
1101 
1011 
MOD11 
OR/M 
(disp-Io),(disp-hi) 
reserved 
DB 
1101 
1011 
MOD11 
1R/M 
(disp-Io),(disp-hi) 
FSTP 
temp-real 
DB 
1101 
1011 
110- 
---- 
reserved 
DB 
1101 
1011 
1110 
0000 
FENI 
DB 
1101 
1011 
1110 
0001 
FDISI 
DB 
1101 
1011 
1110 
0010 
FCLEX 
DB 
1101 
1011 
1110 
0011 
FINIT 
DB 
1101 
1011 
1110 
01-- 
reserved 
DB 
1101 
1011 
1110 
1--- 
reserved 
DB 
1101 
1011 
1111 
---- 
reserved 
DC 
1101 
1100 
MODOO 
OR/M 
(disp-Io),(disp-hi) 
FADD 
long-real 
DC 
1101 
1100 
MODOO 
1R/M 
(disp-Io),(disp-hi) 
FMUL 
long-real 
DC 
1101 
1100 
MOD01 
OR/M 
(disp-Io),(disp-hi) 
FCOM 
long-real 
DC 
1101 
1100 
MOD01 
1R/M 
(disp-Io),(disp-hi) 
FCOMP 
long-real 
DC 
1101 
1100 
MOD10 
OR/M 
(disp-Io),(disp-hi) 
FSUB 
long-real 
DC 
1101 
1100 
MOD10 
1R/M 
(disp-Io),(disp-hi) 
FSUBR 
long-real 
DC 
1101 
1100 
MOD11 
OR/M 
(disp-Io),(disp-hi) 
FDIV 
long-real 
DC 
1101 
1100 
MOD11 
1R/M 
(disp-Io),(disp-hi) 
FDIVR 
long-real 
DC 
1101 
1100 
1100 
OREG 
FADD 
ST(i),ST 
DC 
1101 
1100 
1100 
1REG 
FMUL 
ST(i),ST 
DC 
1101 
1100 
1101 
OREG 
*(2) 
DC 
1101 
1100 
1101 
1REG 
*(3) 
DC 
1101 
1100 
1110 
OREG 
FSUB 
ST(i),ST 
DC 
1101 
1100 
1110 
1REG 
FSUBR 
ST(i),ST 
DC 
1101 
1100 
1111 
OREG 
FDIV 
ST(i),ST 
DC 
1101 
1100 
1111 
1REG 
FDIVR 
ST(i),ST 
DD 
1101 
1101 
MODOO 
OR/M 
(disp-Io),(disp-hi) 
FLD 
long-real 
DD 
1101 
1101 
MODOO 
1R/M 
reserved 
DD 
1101 
1101 
MOD01 
OR/M 
(disp-Io),(disp-hi) 
FST 
long-real 
DD 
1101 
1101 
MOD01 
1R/M 
(disp-Io),(disp-hi) 
FSTP 
long-real 
DD 
1101 
1101 
MOD10 
OR/M 
(disp-Io),(disp-hi) 
FRSTOR 
94-bytes 
DD 
1101 
1101 
MOD10 
1R/M 
(disp-Io),(disp-hi) 
reserved 
DD 
1101 
1101 
MOD11 
OR/M 
(disp-Io),(disp-hi) 
FSAVE 
94-bytes 
DD 
1101 
1101 
MOD11 
1R/M 
(disp-Io),(disp-hi) 
FSTSW 
2-bytes 
DD 
1101 
1101 
1100 
OREG 
FFREE 
ST(i) 
DD 
1101 
1101 
1100 
1REG 
*(4) 
DD 
1101 
1101 
1101 
OREG 
FST 
ST(i) 
DD 
1101 
1101 
1101 
1REG 
FSTP 
ST(i) 


1st Byte 
2nd Byte 
Bytes 3,4 
ASM-86 Instruction 


Hex 
Binary 
Format 


DD 
1101 1101 
111- 
---- 
reserved 
DE 
1101 1110 
MODOO OR/M 
(disp-Io),(disp-hi) 
FIADD 
word-integer 


DE 
1101 1110 
MODOO 1R/M 
(disp-Io),(disp-hi) 
FIMUL 
word-integer 


DE 
1101 1110 
MOD01 
OR/M 
(disp-Io),(disp-hi) 
FICOM 
word-integer 


DE 
1101 1110 
MOD01 
1R/M 
(disp-Io),(disp-hi) 
FICOMP 
word-integer 


DE 
1101 1110 
MOD10 
OR/M 
(disp-Io),(disp-hi) 
FI8UB 
word-integer 


DE 
1101 1110 
MOD10 
1R/M 
(disp-Io),(disp-hi) 
FI8UBR 
word-integer 


DE 
1101 1110 
MOD11 
OR/M 
(disp-Io),(disp-hi) 
FIDIV 
word-integer 


DE 
1101 1110 
MOD11 
1R/M 
(disp-Io),(disp-hi) 
FIDIVR 
word-integer 


DE 
1101 1110 
1100 
OREG 
FADDP 
8T(i),8T 
DE 
1101 1110 
1100 
1REG 
FMULP 
8T(i),8T 
DE 
1101 1110 
1101 
0--- 
*(5) 
DE 
1101 1110 
1101 
1000 
reserved 
DE 
1101 1110 
1101 
1001 
FCOMPP 
DE 
1101 1110 
1101 
101- 
reserved 
DE 
1101 1110 
1101 
11-- 
reserved 
DE 
1101 1110 
1110 
OREG 
F8UBP 
8T(i),8T 
DE 
1101 1110 
1110 
1REG 
F8UBRP 
8T(i),8T 
DE 
1101 1110 
1111 
OREG 
FDIVP 
8T(i),8T 
DE 
1101 1110 
1111 
1REG 
FDIVRP 
8T(i),8T 
DF 
1101 1111 
MODOO OR/M 
(disp-Io),(disp-hi) 
FILD 
word-integer 
DF 
1101 1111 
MODOO 1R/M 
(disp-Io),(disp-hi) 
reserved 
DF 
1101 1111 
MOD01 
OR/M 
(disp-Io),(disp-hi) 
FI8T 
word-integer 


DF 
1101 1111 
MOD01 
1R/M 
(disp-Io),(disp-hi) 
FI8TP 
word-integer 


DF 
1101 1111 
MOD10 
OR/M 
(disp-Io),(disp-hi) 
FBLD 
packed-decimal 
DF 
1101 1111 
MOD10 
1R/M 
(disp-Io),(disp-hi) 
FILD 
long-integer 
DF 
1101 1111 
MOD11 
OR/M 
(disp-Io),(d isp-hi) 
FB8TP 
packed-decimal 
DF 
1101 1111 
MOD11 
1R/M 
(disp-Io),(disp-hi) 
FI8TP 
long-integer 


DF 
1101 1111 
1100 
OREG 
*(6) 
DF 
1101 1111 
1100 
1REG 
*(7) 
DF 
1101 1111 
1101 
OREG 
*(8) 
DF 
1101 1111 
1101 
1REG 
*(9) 
DF 
1101 1111 
111- 
---- 
reserved 
, 


* The marked encodings 
are not generated 
by the language translators. 
If, however, 
the 8087 


encounters 
one one these encodings in the instruction 
stream, it will execute it as follows: 


(1) F8TP 
8T(i) 


(2) FCOM 
8T(i) 


(3) FCOMP 
8T(i) 


(4) FXCH 
8T(i) 


(5) FCOMP 
8T(i) 


(6) FFREE 
8T(i) and pop stack 


(7) FXCH 
8T(i) 


(8) F8TP 
8T(i) 


(9) F8TP 
8T(i) 


